Skip navigation links

Package com.wcohen.ss

This package contains a bunch of approximate string comparators, plus code for performing controlled experiments with this.

See: Description

Package com.wcohen.ss Description

This package contains a bunch of approximate string comparators, plus code for performing controlled experiments with this.

A StringDistance is the basic class for computing distances. The score() function of this class outputs a distance measure between its two arguments. The other methods are there for efficiency, so that preprocessing steps (like tokenization) can be amortized over multiple comparisons with the same string.

The way that preprocessing steps are saved is by creating a StringWrapper object which contains the preprocessed string, plus whatever else needs to be cached. To do this, extend default implementation of StringWrapper.

Almost everything in this package implements StringDistance. The only (public) exceptions are StringWrapper; PrintfFormat, pilfered from Sun to make the explanations easier; CharMatchScore, which is a character-based distance metric; and MemoMatrix, a utility for defining edit-distance-based methods.

Skip navigation links

Copyright © 2016. All rights reserved.