class documentation
class CJKChars(object): (source)
An object that enumerates the code points of the CJK characters as listed on http://en.wikipedia.org/wiki/Basic_Multilingual_Plane#Basic_Multilingual_Plane
This is a Python port of the CJK code point enumerations of Moses tokenizer: https://github.com/moses-smt/mosesdecoder/blob/master/scripts/tokenizer/detokenizer.perl#L309
Class Variable |
|
Undocumented |
Class Variable |
|
Undocumented |
Class Variable |
|
Undocumented |
Class Variable |
|
Undocumented |
Class Variable |
|
Undocumented |
Class Variable |
|
Undocumented |
Class Variable |
|
Undocumented |
Class Variable | ranges |
Undocumented |
Class Variable |
|
Undocumented |