class documentation
ARLSTem stemmer : a light Arabic Stemming algorithm without any dictionary. Department of Telecommunication & Information Processing. USTHB University, Algiers, Algeria. ARLSTem.stem(token) returns the Arabic stem for the input token. The ARLSTem Stemmer requires that all tokens are encoded using Unicode encoding.
| Method | __init__ |
Undocumented |
| Method | fem2masc |
transform the word from the feminine form to the masculine form. |
| Method | norm |
normalize the word by removing diacritics, replacing hamzated Alif with Alif replacing AlifMaqsura with Yaa and removing Waaw at the beginning. |
| Method | plur2sing |
transform the word from the plural form to the singular form. |
| Method | pref |
remove prefixes from the words' beginning. |
| Method | stem |
call this function to get the word's stem based on ARLSTem . |
| Method | suff |
remove suffixes from the word's end. |
| Method | verb |
stem the verb prefixes and suffixes or both |
| Method | verb |
stem the present prefixes and suffixes |
| Method | verb |
stem the future prefixes and suffixes |
| Method | verb |
stem the present suffixes |
| Method | verb |
stem the present prefixes |
| Method | verb |
stem the future prefixes |
| Method | verb |
stem the order prefixes |
| Instance Variable | pl |
Undocumented |
| Instance Variable | pl |
Undocumented |
| Instance Variable | pr2 |
Undocumented |
| Instance Variable | pr3 |
Undocumented |
| Instance Variable | pr32 |
Undocumented |
| Instance Variable | pr4 |
Undocumented |
| Instance Variable | re_alif |
Undocumented |
| Instance Variable | re |
Undocumented |
| Instance Variable | re |
Undocumented |
| Instance Variable | su2 |
Undocumented |
| Instance Variable | su22 |
Undocumented |
| Instance Variable | su3 |
Undocumented |
| Instance Variable | su32 |
Undocumented |
| Instance Variable | verb |
Undocumented |
| Instance Variable | verb |
Undocumented |
| Instance Variable | verb |
Undocumented |
| Instance Variable | verb |
Undocumented |
| Instance Variable | verb |
Undocumented |
| Instance Variable | verb |
Undocumented |
| Instance Variable | verb |
Undocumented |
normalize the word by removing diacritics, replacing hamzated Alif with Alif replacing AlifMaqsura with Yaa and removing Waaw at the beginning.