Package net.sf.saxon.codenorm
Class Normalizer
java.lang.Object
net.sf.saxon.codenorm.Normalizer
Implements Unicode Normalization Forms C, D, KC, KD.
Copyright (c) 1991-2005 Unicode, Inc.
For terms of use, see http://www.unicode.org/terms_of_use.html
For documentation, see UAX#15.
The Unicode Consortium makes no expressed or implied warranty of any kind, and assumes no liability for errors or omissions. No liability is assumed for incidental and consequential damages in connection with or arising out of the use of the information here.
The Unicode Consortium makes no expressed or implied warranty of any kind, and assumes no liability for errors or omissions. No liability is assumed for incidental and consequential damages in connection with or arising out of the use of the information here.
- Author:
- Mark Davis Updates for supplementary code points: Vladimir Weinstein invalid input: '&' Markus Scherer Modified to remove dependency on ICU code: Michael Kay
-
Field Summary
Fields -
Constructor Summary
ConstructorsConstructorDescriptionNormalizer
(byte form) Create a normalizer for a given form.Normalizer
(CharSequence formCS) Create a normalizer for a given form, expressed as a character string -
Method Summary
Modifier and TypeMethodDescriptionnormalize
(CharSequence source) Normalizes text according to the chosen form
-
Field Details
-
D
public static final byte DNormalization Form Selector- See Also:
-
C
public static final byte CNormalization Form Selector- See Also:
-
KD
public static final byte KDNormalization Form Selector- See Also:
-
KC
public static final byte KCNormalization Form Selector- See Also:
-
NO_ACTION
public static final byte NO_ACTIONNormalization Form Selector- See Also:
-
-
Constructor Details
-
Normalizer
public Normalizer(byte form) Create a normalizer for a given form. -
Normalizer
Create a normalizer for a given form, expressed as a character string- Parameters:
formCS
- the normalization form required: for example "NFC" or "NFD"- Throws:
XPathException
-
-
Method Details
-
normalize
Normalizes text according to the chosen form- Parameters:
source
- the original text, unnormalized- Returns:
- target the resulting normalized text
-