module documentation

UDHR corpus reader. It mostly deals with encodings.