czech-stemmer

Czech stemmer is pure Ruby port of CzechStemmer Java class from Lucene.

Installation

gem install czech-stemmer

Usage

require 'czech-stemmer'

CzechStemmer.stem("předseda") # => "předsd"
CzechStemmer.stem("mladými") # => "mlad"

Stemmer works only with lowercased letters in suffixes. Based on Lucene CzechStemmer with all test passed. Note the difference between stemming and lemmatization.

Copyright © 2014 Ondrej Odchazel. See LICENSE.txt for further details.