public class SmartChineseWordTokenFilterFactory extends BaseTokenFilterFactory
WordTokenFilter
Note: this class will currently emit tokens for punctuation. So you should either add
a WordDelimiterFilter after to remove these (with concatenate off), or use the
SmartChinese stoplist with a StopFilterFactory via:
words="org/apache/lucene/analysis/cn/smart/stopwords.txt"
logargs, luceneMatchVersion| Constructor and Description |
|---|
SmartChineseWordTokenFilterFactory() |
| Modifier and Type | Method and Description |
|---|---|
org.apache.lucene.analysis.TokenFilter |
create(org.apache.lucene.analysis.TokenStream input)
Transform the specified input TokenStream
|
assureMatchVersion, getArgs, getBoolean, getBoolean, getInt, getInt, getInt, getSnowballWordSet, getWordSet, init, warnDeprecatedclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitgetArgs, initpublic org.apache.lucene.analysis.TokenFilter create(org.apache.lucene.analysis.TokenStream input)
TokenFilterFactory