Read XSLT 2.0 and XPath 2.0 Programmer's Reference, 4th Edition Online
Authors: Michael Kay
The list of categories, with a few examples of characters found in each, is listed in the table below.
Category | Description | Examples |
L | Letters | |
Lu | Upper case | A, B, Φ , Φ |
Ll | Lower case | a, b, ö, λ |
Lt | Title case | Dz (x01C5) |
Lm | Modifier | Arabic TATWEEL (x0640) |
Lo | Other | Hebrew ALEF (x05D0) |
M | Marks | |
Mn | Non-spacing | Combining acute accent (x0301) |
Mc | Spacing | Gujarati vowel sign AA (x0ABE) |
Me | Enclosing | Combining enclosing circle (x20DD) |
N | Numbers | |
Nd | Decimal digits | 1, 2, 3, 4, |
Nl | Numeric letters | Roman numeral ten thousand (x2182) |
No | Other | Superscript 2 3 (x00B2, x00B3) |
P | Punctuation | |
Pc | Connector | _ (x005F) |
Pd | Dash | em dash (x2014) |
Ps | Open | ( [ { |
Pe | Close | ) ]} |
Pi | Initial quote | (x00AB) |
Pf | Final quote | (x00BB) |
Po | Other | ! ? ¿ (x00BF) |
Z | Separators | |
Zs | Space | space (x0020), non-breaking space (x00A0) |
Zl | Line | line separator (x2028) |
Zp | Paragraph | paragraph separator (x2029) |
S | Symbols | |
Sm | Mathematical | +<=>| ∼ ¬ ± |
Sc | Currency | $ ¢ £ ¥ € |
Sk | Modifier | acute accent ’ , cedilla ¸ |
So | Other | ¦ § © ° |
C | Others | |
Cc | Control | tab (x0009), newline (X000A) |
Cf | Format | soft hyphen (x00AD) |
Co | Private use | |
Cn | Not assigned |
Flags
The effect of a regular expression can be modified by setting one or more flags. These are supplied in a separate argument of the XPath function
matches()
,
replace()
, or
tokenize()
, or in a separate attribute of the XSLT instruction
i
,
m
,
s
, and
x
(they can be written in any order, and no harm is done if a character is repeated). This section explains the meanings of the flags.