org.apache.lucene.analysis.ja
Class JapaneseTokenizer

java.lang.Object
  extended by org.apache.lucene.util.AttributeSource
      extended by org.apache.lucene.analysis.TokenStream
          extended by org.apache.lucene.analysis.Tokenizer
              extended by org.apache.lucene.analysis.ja.JapaneseTokenizer
All Implemented Interfaces:
Closeable

public final class JapaneseTokenizer
extends org.apache.lucene.analysis.Tokenizer

This is a Japanese tokenizer which uses "Sen" morphological analyzer.

sets the surface form as the term text, but also sets these attributes:


Nested Class Summary
 
Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
org.apache.lucene.util.AttributeSource.AttributeFactory, org.apache.lucene.util.AttributeSource.State
 
Field Summary
 
Fields inherited from class org.apache.lucene.analysis.Tokenizer
input
 
Constructor Summary
JapaneseTokenizer(Reader in)
           
JapaneseTokenizer(Reader in, StreamFilter filter)
           
JapaneseTokenizer(Reader in, StreamFilter filter, String dictionaryDir)
           
 
Method Summary
 void end()
           
 boolean incrementToken()
           
 void reset(Reader in)
           
 
Methods inherited from class org.apache.lucene.analysis.Tokenizer
close, correctOffset
 
Methods inherited from class org.apache.lucene.analysis.TokenStream
reset
 
Methods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
 
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
 

Constructor Detail

JapaneseTokenizer

public JapaneseTokenizer(Reader in)

JapaneseTokenizer

public JapaneseTokenizer(Reader in,
                         StreamFilter filter)

JapaneseTokenizer

public JapaneseTokenizer(Reader in,
                         StreamFilter filter,
                         String dictionaryDir)
Method Detail

incrementToken

public boolean incrementToken()
                       throws IOException
Specified by:
incrementToken in class org.apache.lucene.analysis.TokenStream
Throws:
IOException

reset

public void reset(Reader in)
           throws IOException
Overrides:
reset in class org.apache.lucene.analysis.Tokenizer
Throws:
IOException

end

public void end()
         throws IOException
Overrides:
end in class org.apache.lucene.analysis.TokenStream
Throws:
IOException


Copyright © 2012. All Rights Reserved.