class documentation
ARLSTem stemmer : a light Arabic Stemming algorithm without any dictionary. Department of Telecommunication & Information Processing. USTHB University, Algiers, Algeria. ARLSTem.stem(token) returns the Arabic stem for the input token. The ARLSTem Stemmer requires that all tokens are encoded using Unicode encoding.
Method | __init__ |
Undocumented |
Method | fem2masc |
transform the word from the feminine form to the masculine form. |
Method | norm |
normalize the word by removing diacritics, replacing hamzated Alif with Alif replacing AlifMaqsura with Yaa and removing Waaw at the beginning. |
Method | plur2sing |
transform the word from the plural form to the singular form. |
Method | pref |
remove prefixes from the words' beginning. |
Method | stem |
call this function to get the word's stem based on ARLSTem . |
Method | suff |
remove suffixes from the word's end. |
Method | verb |
stem the verb prefixes and suffixes or both |
Method | verb |
stem the present prefixes and suffixes |
Method | verb |
stem the future prefixes and suffixes |
Method | verb |
stem the present suffixes |
Method | verb |
stem the present prefixes |
Method | verb |
stem the future prefixes |
Method | verb |
stem the order prefixes |
Instance Variable | pl |
Undocumented |
Instance Variable | pl |
Undocumented |
Instance Variable | pr2 |
Undocumented |
Instance Variable | pr3 |
Undocumented |
Instance Variable | pr32 |
Undocumented |
Instance Variable | pr4 |
Undocumented |
Instance Variable | re_alif |
Undocumented |
Instance Variable | re |
Undocumented |
Instance Variable | re |
Undocumented |
Instance Variable | su2 |
Undocumented |
Instance Variable | su22 |
Undocumented |
Instance Variable | su3 |
Undocumented |
Instance Variable | su32 |
Undocumented |
Instance Variable | verb |
Undocumented |
Instance Variable | verb |
Undocumented |
Instance Variable | verb |
Undocumented |
Instance Variable | verb |
Undocumented |
Instance Variable | verb |
Undocumented |
Instance Variable | verb |
Undocumented |
Instance Variable | verb |
Undocumented |
normalize the word by removing diacritics, replacing hamzated Alif with Alif replacing AlifMaqsura with Yaa and removing Waaw at the beginning.