class documentation
class EnglishStemmer(_StandardStemmer): (source)
Constructor: EnglishStemmer(ignore_stopwords)
The English Snowball stemmer.
| Note | |
| A detailed description of the English stemming algorithm can be found under http://snowball.tartarus.org/algorithms/english/stemmer.html |
| Method | stem |
Stem an English word and return the stemmed form. |
| Class Variable | __double |
The English double consonants. |
| Class Variable | __li |
Letters that may directly appear before a word final 'li'. |
| Class Variable | __special |
A dictionary containing words which have to be stemmed specially. |
| Class Variable | __step0 |
Suffixes to be deleted in step 0 of the algorithm. |
| Class Variable | __step1a |
Suffixes to be deleted in step 1a of the algorithm. |
| Class Variable | __step1b |
Suffixes to be deleted in step 1b of the algorithm. |
| Class Variable | __step2 |
Suffixes to be deleted in step 2 of the algorithm. |
| Class Variable | __step3 |
Suffixes to be deleted in step 3 of the algorithm. |
| Class Variable | __step4 |
Suffixes to be deleted in step 4 of the algorithm. |
| Class Variable | __step5 |
Suffixes to be deleted in step 5 of the algorithm. |
| Class Variable | __vowels |
The English vowels. |
Inherited from _StandardStemmer:
| Method | _r1r2 |
Return the standard interpretations of the string regions R1 and R2. |
| Method | _rv |
Return the standard interpretation of the string region RV. |
Inherited from _LanguageSpecificStemmer (via _StandardStemmer):
| Method | __init__ |
Undocumented |
| Method | __repr__ |
Print out the string representation of the respective class. |
| Instance Variable | stopwords |
Undocumented |
overrides
nltk.stem.api.StemmerI.stemStem an English word and return the stemmed form.
| Parameters | |
| word:str or unicode | The word that is stemmed. |
| Returns | |
| unicode | The stemmed form. |