Package org.apache.lucene.analysis.fa
Class PersianNormalizer
java.lang.Object
org.apache.lucene.analysis.fa.PersianNormalizer
Normalizer for Persian.
Normalization is done in-place for efficiency, operating on a termbuffer.
Normalization is defined as:
- Normalization of various heh + hamza forms and heh goal to heh.
- Normalization of farsi yeh and yeh barree to arabic yeh
- Normalization of persian keheh to arabic kaf
-
Field Summary
FieldsModifier and TypeFieldDescription(package private) static final char
(package private) static final char
(package private) static final char
(package private) static final char
(package private) static final char
(package private) static final char
(package private) static final char
(package private) static final char
(package private) static final char
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescription(package private) int
normalize
(char[] s, int len) Normalize an input buffer of Persian text
-
Field Details
-
YEH
static final char YEH- See Also:
-
FARSI_YEH
static final char FARSI_YEH- See Also:
-
YEH_BARREE
static final char YEH_BARREE- See Also:
-
KEHEH
static final char KEHEH- See Also:
-
KAF
static final char KAF- See Also:
-
HAMZA_ABOVE
static final char HAMZA_ABOVE- See Also:
-
HEH_YEH
static final char HEH_YEH- See Also:
-
HEH_GOAL
static final char HEH_GOAL- See Also:
-
HEH
static final char HEH- See Also:
-
-
Constructor Details
-
PersianNormalizer
PersianNormalizer()
-
-
Method Details
-
normalize
int normalize(char[] s, int len) Normalize an input buffer of Persian text- Parameters:
s
- input bufferlen
- length of input buffer- Returns:
- length of input buffer after normalization
-