class documentation
class CJKChars(object): (source)
An object that enumerates the code points of the CJK characters as listed on http://en.wikipedia.org/wiki/Basic_Multilingual_Plane#Basic_Multilingual_Plane
This is a Python port of the CJK code point enumerations of Moses tokenizer: https://github.com/moses-smt/mosesdecoder/blob/master/scripts/tokenizer/detokenizer.perl#L309
| Class Variable | |
Undocumented |
| Class Variable | |
Undocumented |
| Class Variable | |
Undocumented |
| Class Variable | |
Undocumented |
| Class Variable | |
Undocumented |
| Class Variable | |
Undocumented |
| Class Variable | |
Undocumented |
| Class Variable | ranges |
Undocumented |
| Class Variable | |
Undocumented |