module documentation

Undocumented

Class Dendrogram Represents a dendrogram, a tree with a specified branching order. This must be initialised with the leaf items, then iteratively call merge for each branch. This class constructs a tree representing the order of calls to the merge function.
Class VectorSpaceClusterer Abstract clusterer which takes tokens and maps them into a vector space. Optionally performs singular value decomposition to reduce the dimensionality.
Function cosine_distance Returns 1 minus the cosine of the angle between vectors v and u. This is equal to 1 - (u.v / |u||v|).
Function euclidean_distance Returns the euclidean distance between vectors u and v. This is equivalent to the length of the vector (u - v).
Class _DendrogramNode Tree node of a dendrogram.
def cosine_distance(u, v): (source)

Returns 1 minus the cosine of the angle between vectors v and u. This is equal to 1 - (u.v / |u||v|).

def euclidean_distance(u, v): (source)

Returns the euclidean distance between vectors u and v. This is equivalent to the length of the vector (u - v).