|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.lucene.analysis.Analyzer
org.apache.lucene.analysis.ru.RussianAnalyzer
public final class RussianAnalyzer
Analyzer for Russian language.
Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.
| Field Summary |
|---|
| Fields inherited from class org.apache.lucene.analysis.Analyzer |
|---|
overridesTokenStreamMethod |
| Constructor Summary | |
|---|---|
RussianAnalyzer(Version matchVersion)
|
|
RussianAnalyzer(Version matchVersion,
Map<?,?> stopwords)
Deprecated. use RussianAnalyzer(Version, Set) instead |
|
RussianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words |
|
RussianAnalyzer(Version matchVersion,
String... stopwords)
Deprecated. use RussianAnalyzer(Version, Set) instead |
|
| Method Summary | |
|---|---|
TokenStream |
reusableTokenStream(String fieldName,
Reader reader)
Returns a (possibly reused) TokenStream which tokenizes all the text
in the provided Reader. |
TokenStream |
tokenStream(String fieldName,
Reader reader)
Creates a TokenStream which tokenizes all the text in the
provided Reader. |
| Methods inherited from class org.apache.lucene.analysis.Analyzer |
|---|
close, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, setOverridesTokenStreamMethod, setPreviousTokenStream |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public RussianAnalyzer(Version matchVersion)
public RussianAnalyzer(Version matchVersion,
String... stopwords)
RussianAnalyzer(Version, Set) instead
public RussianAnalyzer(Version matchVersion,
Set<?> stopwords)
matchVersion - lucene compatibility versionstopwords - a stopword set
public RussianAnalyzer(Version matchVersion,
Map<?,?> stopwords)
RussianAnalyzer(Version, Set) instead
| Method Detail |
|---|
public TokenStream tokenStream(String fieldName,
Reader reader)
TokenStream which tokenizes all the text in the
provided Reader.
tokenStream in class AnalyzerTokenStream built from a
RussianLetterTokenizer filtered with
RussianLowerCaseFilter, StopFilter,
and RussianStemFilter
public TokenStream reusableTokenStream(String fieldName,
Reader reader)
throws IOException
TokenStream which tokenizes all the text
in the provided Reader.
reusableTokenStream in class AnalyzerTokenStream built from a
RussianLetterTokenizer filtered with
RussianLowerCaseFilter, StopFilter,
and RussianStemFilter
IOException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||