See: Description
Class | Description |
---|---|
DefaultICUTokenizerConfig |
Default
ICUTokenizerConfig that is generally applicable
to many languages. |
ICUTokenizer |
Breaks text into words according to UAX #29: Unicode Text Segmentation
(http://www.unicode.org/reports/tr29/)
|
ICUTokenizerConfig |
Class that allows for tailored Unicode Text Segmentation on
a per-writing system basis.
|
ICUTokenizerFactory |
Factory for
ICUTokenizer . |
Copyright © 2000-2021 Apache Software Foundation. All Rights Reserved.