Closeable
, AutoCloseable
@Deprecated public class RussianLetterTokenizer extends CharTokenizer
StandardTokenizer
instead, which has the same functionality.
This filter will be removed in Lucene 5.0Tokenizer
that extends LetterTokenizer
by also allowing the basic Latin digits 0-9.
You must specify the required Version
compatibility when creating
RussianLetterTokenizer
:
CharTokenizer
uses an int based API to normalize and
detect token characters. See CharTokenizer.isTokenChar(int)
and
CharTokenizer.normalize(int)
for details.AttributeSource.AttributeFactory, AttributeSource.State
Constructor | Description |
---|---|
RussianLetterTokenizer(Reader in) |
Deprecated.
use
RussianLetterTokenizer(Version, Reader) instead. |
RussianLetterTokenizer(AttributeSource.AttributeFactory factory,
Reader in) |
Deprecated.
|
RussianLetterTokenizer(AttributeSource source,
Reader in) |
Deprecated.
|
RussianLetterTokenizer(Version matchVersion,
Reader in) |
Deprecated.
Construct a new RussianLetterTokenizer.
|
RussianLetterTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader in) |
Deprecated.
Construct a new RussianLetterTokenizer using a given
AttributeSource.AttributeFactory . |
RussianLetterTokenizer(Version matchVersion,
AttributeSource source,
Reader in) |
Deprecated.
Construct a new RussianLetterTokenizer using a given
AttributeSource . |
Modifier and Type | Method | Description |
---|---|---|
protected boolean |
isTokenChar(int c) |
Deprecated.
Collects only characters which satisfy
Character.isLetter(int) . |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
end, incrementToken, isTokenChar, normalize, normalize, reset
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
close, correctOffset
reset
public RussianLetterTokenizer(Version matchVersion, Reader in)
in
- the input to split up into tokenspublic RussianLetterTokenizer(Version matchVersion, AttributeSource source, Reader in)
AttributeSource
.public RussianLetterTokenizer(Version matchVersion, AttributeSource.AttributeFactory factory, Reader in)
AttributeSource.AttributeFactory
. * @param
matchVersion Lucene version to match See
{@link above}factory
- the attribute factory to use for this Tokenizer
in
- the input to split up into tokens@Deprecated public RussianLetterTokenizer(Reader in)
RussianLetterTokenizer(Version, Reader)
instead. This will
be removed in Lucene 4.0.@Deprecated public RussianLetterTokenizer(AttributeSource source, Reader in)
RussianLetterTokenizer(Version, AttributeSource, Reader)
instead. This will be removed in Lucene 4.0.AttributeSource
.@Deprecated public RussianLetterTokenizer(AttributeSource.AttributeFactory factory, Reader in)
RussianLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader)
instead. This will be removed in Lucene 4.0.AttributeSource.AttributeFactory
.protected boolean isTokenChar(int c)
Character.isLetter(int)
.isTokenChar
in class CharTokenizer
Copyright © 2000-2018 Apache Software Foundation. All Rights Reserved.