Computes a new version of a String value in which certain characters have been escaped, so that the regular expression engine will interpret any metacharacters that it may contain as character literals.
When the escape function is called with one argument string, the following steps are taken:
- Let string be ToString(string).
- ReturnIfAbrupt(string).
- Let length be the number of characters in string.
- Let R be the empty string.
- Let k be 0.
- Repeat, while k < length,
1. Let C be the character at position k within string.
1. If C is one of the 16 nonblank characters "-[]{}()*+?.,\^$|" then,
- Let S be a String containing two characters "\x" where x is a C character. 1. Else,
- Let S be a String containing the single C character. 1. Let R be a new String value computed by concatenating the previous value of R and S. 1. Increase k by 1.
- Return R.
@kangax re code unit, i think @inexorabletash was referring to step 3 and 6i. In particular, since regular expressions now have a "unicode" mode, you probably need to decide how you want to represent astral characters -- is that single "code point" encoded as two UTF-16 "code units" in an ordinary regexp, or as a single code point in a unicode mode regexp?