Read XSLT 2.0 and XPath 2.0 Programmer's Reference, 4th Edition Online
Authors: Michael Kay
Some very traditional HTML
then the spaces afterSome
and before
traditional
are significant, even though they appear respectively at the end and the beginning of a text node.
XSLT Example
The following key declaration indexes the titles of books with whitespace normalized:
This may then be used to locate books by title as follows:
The effect is that it will be possible, without knowing how many spaces and newlines there are, to retrieve a book appearing in the source document as:
Basic Principles and Programming Techniques
See Also
concat()
on page 729
substring-after()
on page 885
substring-before()
on page 887
normalize-unicode
The
normalize-unicode()
function returns a canonical representation of a string in which different ways of representing the same Unicode character have been reduced to a common representation. This makes it possible to compare two strings accurately.
Signature
Argument | Type | Meaning |
input | xs:string? | The string to be normalized. If an empty sequence is supplied, the function returns a zero-length string. |
normalization-form (optional) | xs:string | The normalization algorithm to be used. |
Result | xs:string | The result of normalizing the string . |
Effect
The function applies a Unicode normalization algorithm to the input string, and returns the normalized string as its result. If the
normalization-form
argument is omitted, the default is NFC. The only normalization form that all implementations must support is NFC. Other normalization forms can be requested (including but not limited toNFC
,
NFD
,
NFKC
,
NFKD
,
FULLY-NORMALIZED
) using the
normalization-form
argument; a runtime error is reported if the requested normalization form is not supported by the implementation. For the meanings of these normalization forms, see the
Usage
section below.