Class WordSegmenter
java.lang.Object
org.apache.lucene.analysis.cn.smart.WordSegmenter
Segment a sentence of Chinese text into words.
-
Field Summary
Fields -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionconvertSegToken
(SegToken st, String sentence, int sentenceStartOffset) Process aSegToken
so that it is ready for indexing.segmentSentence
(String sentence, int startOffset) Segment a sentence into words withHHMMSegmenter
-
Field Details
-
hhmmSegmenter
-
tokenFilter
-
-
Constructor Details
-
WordSegmenter
WordSegmenter()
-
-
Method Details
-
segmentSentence
Segment a sentence into words withHHMMSegmenter
-
convertSegToken
Process aSegToken
so that it is ready for indexing.This method calculates offsets and normalizes the token with
SegTokenFilter
.
-