class documentation
class TestTokenize: (source)
Undocumented
Method | test |
Test LegalitySyllableTokenizer tokenizer. |
Method | test |
Test padding of asterisk for word tokenization. |
Method | test |
Test padding of dotdot* for word tokenization. |
Method | test |
Test a string that resembles a phone number but contains a newline |
Method | test |
Undocumented |
Method | test |
Undocumented |
Method | test |
Undocumented |
Method | test |
Undocumented |
Method | test |
Undocumented |
Method | test |
Test remove_handle() from casual.py with specially crafted edge cases |
Method | test |
Test SyllableTokenizer tokenizer. |
Method | test |
Test the Stanford Word Segmenter for Arabic (default config) |
Method | test |
Test the Stanford Word Segmenter for Chinese (default config) |
Method | test |
Test TreebankWordTokenizer.span_tokenize function |
Method | test |
Test TweetTokenizer using words with special and accented characters. |
Method | test |
Test word_tokenize function |