Class TokenInfoFST


  • public final class TokenInfoFST
    extends Object
    Thin wrapper around an FST with root-arc caching for Japanese.

    Depending upon fasterButMoreRam, either just kana (191 arcs), or kana and han (28,607 arcs) are cached. The latter offers additional performance at the cost of more RAM.

    • Field Detail

      • NO_OUTPUT

        public final Long NO_OUTPUT
    • Constructor Detail

      • TokenInfoFST

        public TokenInfoFST​(org.apache.lucene.util.fst.FST<Long> fst,
                            boolean fasterButMoreRam)
                     throws IOException
        Throws:
        IOException
    • Method Detail

      • findTargetArc

        public org.apache.lucene.util.fst.FST.Arc<Long> findTargetArc​(int ch,
                                                                      org.apache.lucene.util.fst.FST.Arc<Long> follow,
                                                                      org.apache.lucene.util.fst.FST.Arc<Long> arc,
                                                                      boolean useCache,
                                                                      org.apache.lucene.util.fst.FST.BytesReader fstReader)
                                                               throws IOException
        Throws:
        IOException
      • getFirstArc

        public org.apache.lucene.util.fst.FST.Arc<Long> getFirstArc​(org.apache.lucene.util.fst.FST.Arc<Long> arc)
      • getBytesReader

        public org.apache.lucene.util.fst.FST.BytesReader getBytesReader​(int pos)