class documentation

Logic common to all interpolated language models.

The idea to abstract this comes from Chen & Goodman 1995. Do not instantiate this class directly!

Method __init__ Creates new LanguageModel.
Method unmasked_score Score a word given some optional context.
Instance Variable estimator Undocumented

Inherited from LanguageModel:

Method context_counts Helper method for retrieving counts for a given context.
Method entropy Calculate cross-entropy of model for given evaluation text.
Method fit Trains the model on a text.
Method generate Generate words from the model.
Method logscore Evaluate the log score of this word in this context.
Method perplexity Calculates the perplexity of the given text.
Method score Masks out of vocab (OOV) words and computes their model score.
Instance Variable counts Undocumented
Instance Variable order Undocumented
Instance Variable vocab Undocumented
def __init__(self, smoothing_cls, order, **kwargs): (source)

Creates new LanguageModel.

of creating a new one when training. :type vocabulary: nltk.lm.Vocabulary or None :param counter: If provided, use this object to count ngrams. :type vocabulary: nltk.lm.NgramCounter or None :param ngrams_fn: If given, defines how sentences in training text are turned to ngram

sequences.
Parameters
smoothing_clsUndocumented
orderUndocumented
ngrams_fn:function or NoneUndocumented
pad_fn:function or NoneIf given, defines how senteces in training text are padded.
vocabularyIf provided, this vocabulary will be used instead
**kwargsUndocumented
def unmasked_score(self, word, context=None): (source)

Score a word given some optional context.

Concrete models are expected to provide an implementation. Note that this method does not mask its arguments with the OOV label. Use the score method for that.

If None, compute unigram score. :param context: tuple(str) or None :rtype: float

Parameters
wordUndocumented
contextUndocumented
str wordWord for which we want the score
tuple(str) contextContext the word is in.
estimator = (source)

Undocumented