Skip to main content

Unicode

Unicode-aware mode

JavaScript RegExp object offers Unicode-aware.

Character escapes

function unicodeChar(codePoint: number): CharacterEscape;

Alias: char.

Regex syntax:

  • \uXXXX: 4-digit hex escape for code points below 0x10000.
  • \u{XXXXXX}: Unicode code point escape for code points above 0xFFFF.

Note: for code points above 0xFFFF, the regex engine must be unicode-aware.

Unicode character property escapes

function unicodeProperty(property: string, value?: string): CharacterEscape;

Unicode character property escape matching a set of characters specified by a Unicode property.

Regex syntax: \p{Property} or \p{Property=Value}

See: