| Interface | Description |
|---|---|
| Normalizer |
Normalization transforms text into a canonical form by removing unwanted
variations.
|
| Class | Description |
|---|---|
| SimpleNormalizer |
A baseline normalizer for processing Unicode text:
Apply Unicode normalization form NFKC.
Strip, trim, normalize, and compress whitespace.
Remove control and formatting characters.
Normalize dash, double and single quotes.
|