Class IndonesianStemmer

java.lang.Object
org.apache.lucene.analysis.id.IndonesianStemmer

public class IndonesianStemmer extends Object
Stemmer for Indonesian.

Stems Indonesian words with the algorithm presented in: A Study of Stemming Effects on Information Retrieval in Bahasa Indonesia, Fadillah Z Tala. http://www.illc.uva.nl/Publications/ResearchReports/MoL-2003-02.text.pdf

  • Field Details

  • Constructor Details

    • IndonesianStemmer

      public IndonesianStemmer()
  • Method Details

    • stem

      public int stem(char[] text, int length, boolean stemDerivational)
      Stem a term (returning its new length).

      Use stemDerivational to control whether full stemming or only light inflectional stemming is done.

    • stemDerivational

      private int stemDerivational(char[] text, int length)
    • isVowel

      private boolean isVowel(char ch)
    • removeParticle

      private int removeParticle(char[] text, int length)
    • removePossessivePronoun

      private int removePossessivePronoun(char[] text, int length)
    • removeFirstOrderPrefix

      private int removeFirstOrderPrefix(char[] text, int length)
    • removeSecondOrderPrefix

      private int removeSecondOrderPrefix(char[] text, int length)
    • removeSuffix

      private int removeSuffix(char[] text, int length)