public final class CapitalizationFilter extends TokenFilter
This filter is particularly useful to build nice looking facet parameters. This filter is not appropriate if you intend to use a prefix query.
AttributeSource.State
Modifier and Type | Field and Description |
---|---|
static int |
DEFAULT_MAX_TOKEN_LENGTH |
static int |
DEFAULT_MAX_WORD_COUNT |
input
DEFAULT_TOKEN_ATTRIBUTE_FACTORY
Constructor and Description |
---|
CapitalizationFilter(TokenStream in)
Creates a CapitalizationFilter with the default parameters.
|
CapitalizationFilter(TokenStream in,
boolean onlyFirstWord,
CharArraySet keep,
boolean forceFirstLetter,
Collection<char[]> okPrefix,
int minWordLength,
int maxWordCount,
int maxTokenLength)
Creates a CapitalizationFilter with the specified parameters.
|
Modifier and Type | Method and Description |
---|---|
boolean |
incrementToken() |
close, end, reset
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, endAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, removeAllAttributes, restoreState, toString
public static final int DEFAULT_MAX_WORD_COUNT
public static final int DEFAULT_MAX_TOKEN_LENGTH
public CapitalizationFilter(TokenStream in)
public CapitalizationFilter(TokenStream in, boolean onlyFirstWord, CharArraySet keep, boolean forceFirstLetter, Collection<char[]> okPrefix, int minWordLength, int maxWordCount, int maxTokenLength)
in
- input tokenstreamonlyFirstWord
- should each word be capitalized or all of the words?keep
- a keep word list. Each word that should be kept separated by whitespace.forceFirstLetter
- Force the first letter to be capitalized even if it is in the keep list.okPrefix
- do not change word capitalization if a word begins with something in this list.minWordLength
- how long the word needs to be to get capitalization applied. If the
minWordLength is 3, "and" > "And" but "or" stays "or".maxWordCount
- if the token contains more then maxWordCount words, the capitalization is
assumed to be correct.maxTokenLength
- ???public boolean incrementToken() throws IOException
incrementToken
in class TokenStream
IOException
Copyright © 2000-2021 Apache Software Foundation. All Rights Reserved.