public class CzechStemmer
extends java.lang.Object
Implements the algorithm described in: Indexing and stemming approaches for the Czech language http://portal.acm.org/citation.cfm?id=1598600
| Constructor and Description |
|---|
CzechStemmer() |
| Modifier and Type | Method and Description |
|---|---|
private int |
normalize(char[] s,
int len) |
private int |
removeCase(char[] s,
int len) |
private int |
removePossessives(char[] s,
int len) |
int |
stem(char[] s,
int len)
Stem an input buffer of Czech text.
|
public int stem(char[] s,
int len)
s - input bufferlen - length of input bufferNOTE: Input is expected to be in lowercase, but with diacritical marks
private int removeCase(char[] s,
int len)
private int removePossessives(char[] s,
int len)
private int normalize(char[] s,
int len)