class documentation
class EnglishStemmer(_StandardStemmer): (source)
Constructor: EnglishStemmer(ignore_stopwords)
The English Snowball stemmer.
Note | |
A detailed description of the English stemming algorithm can be found under http://snowball.tartarus.org/algorithms/english/stemmer.html |
Method | stem |
Stem an English word and return the stemmed form. |
Class Variable | __double |
The English double consonants. |
Class Variable | __li |
Letters that may directly appear before a word final 'li'. |
Class Variable | __special |
A dictionary containing words which have to be stemmed specially. |
Class Variable | __step0 |
Suffixes to be deleted in step 0 of the algorithm. |
Class Variable | __step1a |
Suffixes to be deleted in step 1a of the algorithm. |
Class Variable | __step1b |
Suffixes to be deleted in step 1b of the algorithm. |
Class Variable | __step2 |
Suffixes to be deleted in step 2 of the algorithm. |
Class Variable | __step3 |
Suffixes to be deleted in step 3 of the algorithm. |
Class Variable | __step4 |
Suffixes to be deleted in step 4 of the algorithm. |
Class Variable | __step5 |
Suffixes to be deleted in step 5 of the algorithm. |
Class Variable | __vowels |
The English vowels. |
Inherited from _StandardStemmer
:
Method | _r1r2 |
Return the standard interpretations of the string regions R1 and R2. |
Method | _rv |
Return the standard interpretation of the string region RV. |
Inherited from _LanguageSpecificStemmer
(via _StandardStemmer
):
Method | __init__ |
Undocumented |
Method | __repr__ |
Print out the string representation of the respective class. |
Instance Variable | stopwords |
Undocumented |
overrides
nltk.stem.api.StemmerI.stem
Stem an English word and return the stemmed form.
Parameters | |
word:str or unicode | The word that is stemmed. |
Returns | |
unicode | The stemmed form. |