The flag y allows to perform the search at the given position in the source string.
To grasp the use case of y flag, and better understand the ways of regexps, letâs explore a practical example.
One of common tasks for regexps is âlexical analysisâ: we get a text, e.g. in a programming language, and need to find its structural elements. For instance, HTML has tags and attributes, JavaScript code has functions, variables, and so on.
Writing lexical analyzers is a special area, with its own tools and algorithms, so we donât go deep in there, but thereâs a common task: to read something at the given position.
E.g. we have a code string let varName = "value", and we need to read the variable name from it, that starts at position 4.
Weâll look for variable name using regexp \w+. Actually, JavaScript variable names need a bit more complex regexp for accurate matching, but here it doesnât matter.
- A call to
str.match(/\w+/)will find only the first word in the line (let). Thatâs not it. - We can add the flag
g. But then the callstr.match(/\w+/g)will look for all words in the text, while we need one word at position4. Again, not what we need.
So, how to search for a regexp exactly at the given position?
Letâs try using method regexp.exec(str).
For a regexp without flags g and y, this method looks only for the first match, it works exactly like str.match(regexp).
â¦But if thereâs flag g, then it performs the search in str, starting from position stored in the regexp.lastIndex property. And, if it finds a match, then sets regexp.lastIndex to the index immediately after the match.
In other words, regexp.lastIndex serves as a starting point for the search, that each regexp.exec(str) call resets to the new value (âafter the last matchâ). Thatâs only if thereâs g flag, of course.
So, successive calls to regexp.exec(str) return matches one after another.
Hereâs an example of such calls:
let str = 'let varName'; // Let's find all words in this string
let regexp = /\w+/g;
alert(regexp.lastIndex); // 0 (initially lastIndex=0)
let word1 = regexp.exec(str);
alert(word1[0]); // let (1st word)
alert(regexp.lastIndex); // 3 (position after the match)
let word2 = regexp.exec(str);
alert(word2[0]); // varName (2nd word)
alert(regexp.lastIndex); // 11 (position after the match)
let word3 = regexp.exec(str);
alert(word3); // null (no more matches)
alert(regexp.lastIndex); // 0 (resets at search end)
We can get all matches in the loop:
let str = 'let varName';
let regexp = /\w+/g;
let result;
while (result = regexp.exec(str)) {
alert( `Found ${result[0]} at position ${result.index}` );
// Found let at position 0, then
// Found varName at position 4
}
Such use of regexp.exec is an alternative to method str.matchAll, with a bit more control over the process.
Letâs go back to our task.
We can manually set lastIndex to 4, to start the search from the given position!
Like this:
let str = 'let varName = "value"';
let regexp = /\w+/g; // without flag "g", property lastIndex is ignored
regexp.lastIndex = 4;
let word = regexp.exec(str);
alert(word); // varName
Hooray! Problem solved!
We performed a search of \w+, starting from position regexp.lastIndex = 4.
The result is correct.
â¦But wait, not so fast.
Please note: the regexp.exec call starts searching at position lastIndex and then goes further. If thereâs no word at position lastIndex, but itâs somewhere after it, then it will be found:
let str = 'let varName = "value"';
let regexp = /\w+/g;
// start the search from position 3
regexp.lastIndex = 3;
let word = regexp.exec(str);
// found the match at position 4
alert(word[0]); // varName
alert(word.index); // 4
For some tasks, including the lexical analysis, thatâs just wrong. We need to find a match exactly at the given position at the text, not somewhere after it. And thatâs what the flag y is for.
The flag y makes regexp.exec to search exactly at position lastIndex, not âstarting fromâ it.
Hereâs the same search with flag y:
let str = 'let varName = "value"';
let regexp = /\w+/y;
regexp.lastIndex = 3;
alert( regexp.exec(str) ); // null (there's a space at position 3, not a word)
regexp.lastIndex = 4;
alert( regexp.exec(str) ); // varName (word at position 4)
As we can see, regexp /\w+/y doesnât match at position 3 (unlike the flag g), but matches at position 4.
Not only thatâs what we need, thereâs an important performance gain when using flag y.
Imagine, we have a long text, and there are no matches in it, at all. Then a search with flag g will go till the end of the text and find nothing, and this will take significantly more time than the search with flag y, that checks only the exact position.
In tasks like lexical analysis, there are usually many searches at an exact position, to check what we have there. Using flag y is the key for correct implementations and a good performance.
Comments
<code>tag, for several lines â wrap them in<pre>tag, for more than 10 lines â use a sandbox (plnkr, jsbin, codepenâ¦)