public class ShingleFilterFactory extends TokenFilterFactory
ShingleFilter.
<fieldType name="text_shingle" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.ShingleFilterFactory" minShingleSize="2" maxShingleSize="2"
outputUnigrams="true" outputUnigramsIfNoShingles="false" tokenSeparator=" " fillerToken="_"/>
</analyzer>
</fieldType>| Modifier and Type | Field and Description |
|---|---|
private java.lang.String |
fillerToken |
private int |
maxShingleSize |
private int |
minShingleSize |
private boolean |
outputUnigrams |
private boolean |
outputUnigramsIfNoShingles |
private java.lang.String |
tokenSeparator |
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion| Constructor and Description |
|---|
ShingleFilterFactory(java.util.Map<java.lang.String,java.lang.String> args)
Creates a new ShingleFilterFactory
|
| Modifier and Type | Method and Description |
|---|---|
ShingleFilter |
create(TokenStream input)
Transform the specified input TokenStream
|
availableTokenFilters, forName, lookupClass, normalize, reloadTokenFiltersget, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNamesprivate final int minShingleSize
private final int maxShingleSize
private final boolean outputUnigrams
private final boolean outputUnigramsIfNoShingles
private final java.lang.String tokenSeparator
private final java.lang.String fillerToken
public ShingleFilterFactory(java.util.Map<java.lang.String,java.lang.String> args)
public ShingleFilter create(TokenStream input)
TokenFilterFactorycreate in class TokenFilterFactory