Skip navigation links
A B C D E F G H I J L M N O P R S T U V W _ 

A

a - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
AbbreviationAlignment - Class in com.wcohen.ss
Abbreviation distance metric which evaluates the probability of a short-form string being an abbreviation/acronym of another long-form string.
AbbreviationAlignment() - Constructor for class com.wcohen.ss.AbbreviationAlignment
 
AbbreviationAlignmentContainer<T extends Enum<T>,S extends Enum<S>> - Class in com.wcohen.ss.abbvGapsHmm
 
AbbreviationsBlocker - Class in com.wcohen.ss.expt
Produces candidate pairs that share not-too-common character tokens.
AbbreviationsBlocker(Tokenizer, double) - Constructor for class com.wcohen.ss.expt.AbbreviationsBlocker
 
AbbreviationsBlocker() - Constructor for class com.wcohen.ss.expt.AbbreviationsBlocker
 
AbbvGapsHMM - Class in com.wcohen.ss.abbvGapsHmm
 
AbbvGapsHMM() - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
AbbvGapsHMM(String) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
AbbvGapsHMM(String, boolean) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
AbbvGapsHMM.Emissions - Enum in com.wcohen.ss.abbvGapsHmm
 
AbbvGapsHMM.States - Enum in com.wcohen.ss.abbvGapsHmm
 
AbbvGapsHMM.Transitions - Enum in com.wcohen.ss.abbvGapsHmm
 
AbbvGapsHmmBackwardsEvaluator - Class in com.wcohen.ss.abbvGapsHmm
 
AbbvGapsHmmBackwardsEvaluator(AbbvGapsHMM) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam - Class in com.wcohen.ss.abbvGapsHmm
 
AbbvGapsHmmBackwardsViterbiEvaluator - Class in com.wcohen.ss.abbvGapsHmm
 
AbbvGapsHmmBackwardsViterbiEvaluator(AbbvGapsHMM) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
AbbvGapsHMMEvaluator - Class in com.wcohen.ss.abbvGapsHmm
 
AbbvGapsHMMEvaluator(AbbvGapsHMM) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
AbbvGapsHMMEvaluator.EvalParam - Class in com.wcohen.ss.abbvGapsHmm
 
AbbvGapsHmmExpectationEvaluator - Class in com.wcohen.ss.abbvGapsHmm
 
AbbvGapsHmmExpectationEvaluator(AbbvGapsHMM) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
 
AbbvGapsHmmForwardEvaluator - Class in com.wcohen.ss.abbvGapsHmm
 
AbbvGapsHmmForwardEvaluator(AbbvGapsHMM) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
AbbvGapsHmmForwardEvaluator.ForwardEvalParam - Class in com.wcohen.ss.abbvGapsHmm
 
AbstractSourcedStatisticalTokenDistance - Class in com.wcohen.ss
Abstract token distance metric that uses frequency statistics.
AbstractSourcedStatisticalTokenDistance(SourcedTokenizer) - Constructor for class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
 
AbstractSourcedStatisticalTokenDistance() - Constructor for class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
 
AbstractSourcedTokenizedStringDistance - Class in com.wcohen.ss
Abstract distance metric for tokenized strings.
AbstractSourcedTokenizedStringDistance(Tokenizer) - Constructor for class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
 
AbstractSourcedTokenizedStringDistance() - Constructor for class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
 
AbstractStatisticalTokenDistance - Class in com.wcohen.ss
Abstract token distance metric that uses frequency statistics.
AbstractStatisticalTokenDistance(Tokenizer) - Constructor for class com.wcohen.ss.AbstractStatisticalTokenDistance
 
AbstractStatisticalTokenDistance() - Constructor for class com.wcohen.ss.AbstractStatisticalTokenDistance
 
AbstractStringDistance - Class in com.wcohen.ss
Abstract class which implements StringDistanceLearner as well as StringDistance.
AbstractStringDistance() - Constructor for class com.wcohen.ss.AbstractStringDistance
 
AbstractTokenizedStringDistance - Class in com.wcohen.ss
Abstract distance metric for tokenized strings.
AbstractTokenizedStringDistance(Tokenizer) - Constructor for class com.wcohen.ss.AbstractTokenizedStringDistance
 
AbstractTokenizedStringDistance() - Constructor for class com.wcohen.ss.AbstractTokenizedStringDistance
 
Acronym - Class in com.wcohen.ss.abbvGapsHmm
 
Acronym(String, String) - Constructor for class com.wcohen.ss.abbvGapsHmm.Acronym
 
Acronym(String, String, Integer) - Constructor for class com.wcohen.ss.abbvGapsHmm.Acronym
 
Acronym.AcronymFrequencyComparator - Class in com.wcohen.ss.abbvGapsHmm
 
Acronym.AcronymShortFormComparator - Class in com.wcohen.ss.abbvGapsHmm
 
AcronymFrequencyComparator() - Constructor for class com.wcohen.ss.abbvGapsHmm.Acronym.AcronymFrequencyComparator
 
acronymsArrayToMap(Collection<Acronym>) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
AcronymShortFormComparator() - Constructor for class com.wcohen.ss.abbvGapsHmm.Acronym.AcronymShortFormComparator
 
add(int, int, int, double) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
addAbbreviationPairs(Map<String, Acronym>) - Method in class com.wcohen.ss.expt.ExtractAbbreviations
 
addCandidatePair(List<Acronym>, String, String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
addExample(DistanceInstance) - Method in class com.wcohen.ss.AbstractStringDistance
Implements StringDistanceLearner api by accepting new DistanceInstance labels.
addExample(DistanceInstance) - Method in interface com.wcohen.ss.api.StringDistanceLearner
Accept the answer to the last query.
addExample(DistanceInstance) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Pass new labels to the sublearners.
addInstance(String, String, String) - Method in class com.wcohen.ss.expt.MatchData
Add a single instance, with given src and id, to the datafile
addInstance(String, String, String) - Method in class com.wcohen.ss.expt.SourcedMatchData
Add a single instance, with given src and id, to the datafile
advanceEvalRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
 
advanceEvalRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
Advance within evaluation range
advanceEvalRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
 
AffineGap - Class in com.wcohen.ss
Affine-gap string distance, following Durban et al.
AffineGap() - Constructor for class com.wcohen.ss.AffineGap
 
AffineGap(CharMatchScore, double, double, double) - Constructor for class com.wcohen.ss.AffineGap
 
AffineGap.MatrixTrio - Class in com.wcohen.ss
 
AffineGap.MatrixTrio.InsertSMatrix - Class in com.wcohen.ss
 
AffineGap.MatrixTrio.InsertTMatrix - Class in com.wcohen.ss
 
AlignmentPredictionModel - Class in com.wcohen.ss.abbvGapsHmm
 
AlignmentPredictionModel() - Constructor for class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
andToSymbolMatch(char) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
ApproxMemoMatrix - Class in com.wcohen.ss
Variant of MemoMatrix that only stores values near the diagonal, for better efficiency.
ApproxNeedlemanWunsch - Class in com.wcohen.ss
Needleman-Wunsch string distance, following Durban et al.
ApproxNeedlemanWunsch() - Constructor for class com.wcohen.ss.ApproxNeedlemanWunsch
 
ApproxNeedlemanWunsch(CharMatchScore, double) - Constructor for class com.wcohen.ss.ApproxNeedlemanWunsch
 
asBagOfSourcedTokens(SourcedStringWrapper) - Method in class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
 
asBagOfTokens(StringWrapper) - Method in class com.wcohen.ss.AbstractTokenizedStringDistance
 
asMultiDistanceInstance(DistanceInstance) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
 
asMultiDistanceInstanceList(DistanceInstanceIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
 
asMultiStringWrapper(StringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
 
asMultiStringWrapper(StringWrapper) - Method in class com.wcohen.ss.MultiStringDistance
Lazily prepare a string.
asMultiStringWrapperList(StringWrapperIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
 
asUnitVector(SourcedStringWrapper) - Method in class com.wcohen.ss.SourcedTFIDF
 
asUnitVector(StringWrapper) - Method in class com.wcohen.ss.TagLink
 
asUnitVector(StringWrapper) - Method in class com.wcohen.ss.TFIDF
 
at(int, int, int) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
AveragedStringDistanceLearner - Class in com.wcohen.ss
Abstract StringDistanceLearner class which averages results of a number of inner distance metrics, learned by a number of inner distance learners.
AveragedStringDistanceLearner() - Constructor for class com.wcohen.ss.AveragedStringDistanceLearner
 
AveragedStringDistanceLearner(StringDistanceLearner[], String) - Constructor for class com.wcohen.ss.AveragedStringDistanceLearner
 
averagePrecision() - Method in class com.wcohen.ss.expt.MatchExpt
non-interpolated average precision
averagePrecision() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
non-interpolated average precision

B

b - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
backgroundProb(Token) - Method in class com.wcohen.ss.JensenShannonDistance
Probability of token in the background language model
BackwardEvalParam(String) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
 
backwardEvaluate(Acronym, List<Double>, List<Double>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
backwardViterbiEvaluate(Acronym, List<Double>, List<Double>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
BasicDistanceInstanceIterator - Class in com.wcohen.ss
A simple DistanceInstanceIterator implementation.
BasicDistanceInstanceIterator(Iterator) - Constructor for class com.wcohen.ss.BasicDistanceInstanceIterator
 
BasicSourcedStringWrapperIterator - Class in com.wcohen.ss
A simple StringWrapperIterator implementation.
BasicSourcedStringWrapperIterator(Iterator) - Constructor for class com.wcohen.ss.BasicSourcedStringWrapperIterator
 
BasicSourcedToken - Class in com.wcohen.ss.tokens
An interned version of a string, with provinance information
BasicStringWrapper - Class in com.wcohen.ss
An extendible (non-final) class that implements some of the functionality of a string.
BasicStringWrapper(String) - Constructor for class com.wcohen.ss.BasicStringWrapper
 
BasicStringWrapperIterator - Class in com.wcohen.ss
A simple StringWrapperIterator implementation.
BasicStringWrapperIterator(Iterator) - Constructor for class com.wcohen.ss.BasicStringWrapperIterator
 
BasicTeacher - Class in com.wcohen.ss.expt
Train a StringDistanceLearner.
BasicTeacher(Blocker, MatchData) - Constructor for class com.wcohen.ss.expt.BasicTeacher
Create a teacher from a blocker and a dataset.
BasicTeacher(StringWrapperIterator, DistanceInstanceIterator, DistanceInstanceIterator) - Constructor for class com.wcohen.ss.expt.BasicTeacher
Create a teacher using specific values for the various iterators.
BasicToken - Class in com.wcohen.ss.tokens
An interned version of a string.
block(MatchData) - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
 
block(MatchData) - Method in class com.wcohen.ss.expt.Blocker
Load matchdata and prepare it for production of candidate pairs.
block(MatchData) - Method in class com.wcohen.ss.expt.NullBlocker
 
block(MatchData) - Method in class com.wcohen.ss.expt.TokenBlocker
 
Blocker - Class in com.wcohen.ss.expt
Produces candidate pairs from a MatchData structure, and provides access to those candidate pairs.
Blocker() - Constructor for class com.wcohen.ss.expt.Blocker
 
blocker(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Load a blocker.
blocker(String, String, String) - Method in class com.wcohen.ss.expt.MatchExptScript
Load a blocker, with optional boolean value
Blocker.Pair - Class in com.wcohen.ss.expt
Holds a pair of instances, with mutable distance between them.
BLOCKER_PACKAGE - Static variable in class com.wcohen.ss.expt.MatchExpt
 
BLOCKER_PACKAGE - Static variable in class com.wcohen.ss.expt.MatchExptScript
 
BLOCKER_PACKAGE - Static variable in class com.wcohen.ss.expt.SpecialMatchExpt
 
blockerRecall() - Method in class com.wcohen.ss.expt.MatchExpt
performance of the blocker
blockerRecall() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
performance of the blocker
build(String) - Static method in class com.wcohen.ss.DistanceLearnerFactory
Generate a StringDistance from a class name, or a sequence of classnames separated by slashes.
build(String[]) - Static method in class com.wcohen.ss.DistanceLearnerFactory
Generate a StringDistance from a sequence of classnames.
buildArray(String) - Static method in class com.wcohen.ss.DistanceLearnerFactory
Generate a StringDistanceArray given a sequence of classnames separated by slashes.

C

Candidates(int, int, double) - Constructor for class com.wcohen.ss.TagLink.Candidates
Candidates constructor.
cellFormat - Variable in class com.wcohen.ss.MemoMatrix
 
CharacterTokenizer - Class in com.wcohen.ss.tokens
Character tokenizer implementation.
CharacterTokenizer(boolean, boolean) - Constructor for class com.wcohen.ss.tokens.CharacterTokenizer
 
charAt(int) - Method in interface com.wcohen.ss.api.StringWrapper
Return the i-th char of the wrapped string
charAt(int) - Method in class com.wcohen.ss.BasicStringWrapper
 
charEqualIgnoreCase(char, char) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
CharMatchScore - Class in com.wcohen.ss
Abstract distance between characters.
CharMatchScore() - Constructor for class com.wcohen.ss.CharMatchScore
 
checkTrainingHasHappened(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
 
checkTrainingHasHappened(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbstractStatisticalTokenDistance
 
chunkLongForm(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
clear(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Clear datasets, blockers, or learners.
clusterMode - Variable in class com.wcohen.ss.expt.Blocker
 
ClusterNGramBlocker - Class in com.wcohen.ss.expt
TokenBlocker for clustering based on NGram co-occurence.
ClusterNGramBlocker() - Constructor for class com.wcohen.ss.expt.ClusterNGramBlocker
 
ClusterTokenBlocker - Class in com.wcohen.ss.expt
TokenBlocker for clustering.
ClusterTokenBlocker() - Constructor for class com.wcohen.ss.expt.ClusterTokenBlocker
 
ClusterTokenBlocker(Tokenizer, double) - Constructor for class com.wcohen.ss.expt.ClusterTokenBlocker
 
collectionSize - Variable in class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
 
collectionSize - Variable in class com.wcohen.ss.AbstractStatisticalTokenDistance
 
com.wcohen.ss - package com.wcohen.ss
This package contains a bunch of approximate string comparators, plus code for performing controlled experiments with this.
com.wcohen.ss.abbvGapsHmm - package com.wcohen.ss.abbvGapsHmm
 
com.wcohen.ss.api - package com.wcohen.ss.api
 
com.wcohen.ss.expt - package com.wcohen.ss.expt
 
com.wcohen.ss.lookup - package com.wcohen.ss.lookup
 
com.wcohen.ss.tokens - package com.wcohen.ss.tokens
 
CombinedStringDistance(StringDistance[], MultiStringWrapper) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
CombinedStringDistanceLearner - Class in com.wcohen.ss
Abstract StringDistanceLearner class which combines results of a number of inner distance metrics, learned by a number of inner distance learners.
CombinedStringDistanceLearner() - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner
 
CombinedStringDistanceLearner(StringDistanceLearner[], String) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner
 
CombinedStringDistanceLearner.CombinedStringDistance - Class in com.wcohen.ss
Abstract class for combining innerDistances's
CombinedStringDistanceLearner.JthDistanceInstanceIterator - Class in com.wcohen.ss
Iterate over the j-th field of MultiStringWrapper's in a DistanceInstance of MultiStringWrapper's
CombinedStringDistanceLearner.JthStringWrapperValueIterator - Class in com.wcohen.ss
Iterate over the j-th field of MultiStringWrapper
CombinedStringDistanceLearner.MyDistanceInstance - Class in com.wcohen.ss
 
CombinedStringDistanceLearner.MyMultiDistanceInstance - Class in com.wcohen.ss
 
comboAddExample(DistanceInstance) - Method in class com.wcohen.ss.AveragedStringDistanceLearner
 
comboAddExample(DistanceInstance) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Pass a labeled example to the score-combination learner.
comboHasNextQuery() - Method in class com.wcohen.ss.AveragedStringDistanceLearner
 
comboHasNextQuery() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Poll the routine that learns to combine inner distance scores to see if it wants to make more queries.
comboNextQuery() - Method in class com.wcohen.ss.AveragedStringDistanceLearner
 
comboNextQuery() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Get the next query from the score-combination learner.
comboSetDistanceInstancePool(Iterator) - Method in class com.wcohen.ss.AveragedStringDistanceLearner
 
comboSetDistanceInstancePool(Iterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Set up a pool of (possibly unlabeled) instance distances, for the learner to make queries from.
comboSetStringWrapperPool(Iterator) - Method in class com.wcohen.ss.AveragedStringDistanceLearner
 
comboSetStringWrapperPool(Iterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Pass an iterator over unlabeled string wrappers to the score-combination learner, just in case that's useful.
compare(Acronym, Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.Acronym.AcronymFrequencyComparator
 
compare(Acronym, Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.Acronym.AcronymShortFormComparator
 
compareTo(Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.Acronym
 
compareTo(Object) - Method in class com.wcohen.ss.expt.Blocker.Pair
 
compareTo(Object) - Method in class com.wcohen.ss.tokens.BasicSourcedToken
 
compareTo(Object) - Method in class com.wcohen.ss.tokens.BasicToken
 
compute(int, int) - Method in class com.wcohen.ss.AffineGap.MatrixTrio
 
compute(int, int) - Method in class com.wcohen.ss.AffineGap.MatrixTrio.InsertSMatrix
 
compute(int, int) - Method in class com.wcohen.ss.AffineGap.MatrixTrio.InsertTMatrix
 
compute() - Method in class com.wcohen.ss.expt.MatchExptScript
Compute learners.
correct - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
countCorrectPairs(MatchData) - Method in class com.wcohen.ss.expt.Blocker
Compute number of correct pairs betwn src1 and src2, where src2>src1

D

dataset(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Load a dataset.
DEFAULT_SOURCED_TOKENIZER - Static variable in class com.wcohen.ss.tokens.SimpleSourcedTokenizer
 
DEFAULT_TOKENIZER - Static variable in class com.wcohen.ss.tokens.CharacterTokenizer
 
DEFAULT_TOKENIZER - Static variable in class com.wcohen.ss.tokens.NGramTokenizer
 
DEFAULT_TOKENIZER - Static variable in class com.wcohen.ss.tokens.SimpleTokenizer
 
delim - Variable in class com.wcohen.ss.CombinedStringDistanceLearner
 
dimension1() - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
dimension2() - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
dimension3() - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
DirichletJS - Class in com.wcohen.ss
Jensen-Shannon distance of two unigram language models, smoothed using Dirichlet prior.
DirichletJS(Tokenizer, double) - Constructor for class com.wcohen.ss.DirichletJS
 
DirichletJS() - Constructor for class com.wcohen.ss.DirichletJS
 
displayResults(boolean, PrintStream) - Method in class com.wcohen.ss.expt.MatchExpt
Show results in a simple format.
displayResults(boolean, PrintStream) - Method in class com.wcohen.ss.expt.SpecialMatchExpt
Show results in a simple format.
DIST_01 - Static variable in class com.wcohen.ss.CharMatchScore
Scores match as 0, mismatch as -1.
DIST_21 - Static variable in class com.wcohen.ss.CharMatchScore
Scores match as +2, mismatch as -1.
distance - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
distance(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Load a distance learner.
DISTANCE_PACKAGE - Static variable in class com.wcohen.ss.expt.MatchExpt
 
DISTANCE_PACKAGE - Static variable in class com.wcohen.ss.expt.MatchExptScript
 
DISTANCE_PACKAGE - Static variable in class com.wcohen.ss.expt.SpecialMatchExpt
 
distanceExamplePool() - Method in class com.wcohen.ss.api.StringDistanceTeacher
A pool of unlabeled pairs of strings over which distances will be computed, to be used for supervised learning.
distanceExamplePool() - Method in class com.wcohen.ss.expt.BasicTeacher
 
distanceExamplePool() - Method in class com.wcohen.ss.expt.MatchDataTeacher
 
DistanceInstance - Interface in com.wcohen.ss.api
An 'instance' for a StringDistance, analogous to an 'instance' for a classification learner.
DistanceInstanceIterator - Interface in com.wcohen.ss.api
An iterator over DistanceInstance objects.
distanceInstancePool() - Method in class com.wcohen.ss.api.StringDistanceTeacher
A pool of unlabeled pairs of strings over which distances will be computed, to be used for active or semi-supervised learning.
distanceInstancePool() - Method in class com.wcohen.ss.expt.BasicTeacher
 
distanceInstancePool() - Method in class com.wcohen.ss.expt.MatchDataTeacher
 
DistanceLearnerFactory - Class in com.wcohen.ss
Creates distance metric learners from string descriptions.
DistanceLearnerFactory() - Constructor for class com.wcohen.ss.DistanceLearnerFactory
 
documentFrequency - Variable in class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
 
documentFrequency - Variable in class com.wcohen.ss.AbstractStatisticalTokenDistance
 
doMain(StringDistance, String[]) - Static method in class com.wcohen.ss.AbstractStringDistance
Default main routine for testing
doMain(StringDistance, String[]) - Static method in class com.wcohen.ss.MultiStringDistance
Default main routine for testing
doScore(MultiStringWrapper, MultiStringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
Produce a score, assuming ms and mt are the correct sizes, and fully prepared.
dumpResults(PrintStream) - Method in class com.wcohen.ss.expt.MatchExpt
Show results in an easily machine-readable format.
dumpResults(PrintStream) - Method in class com.wcohen.ss.expt.SpecialMatchExpt
Show results in an easily machine-readable format.

E

echo(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Turn echoing of commands on/off.
eq(Matrix3D) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
equals(Object) - Method in class com.wcohen.ss.abbvGapsHmm.Acronym
 
equals(Object) - Method in class com.wcohen.ss.tokens.BasicSourcedToken
 
equals(Object) - Method in class com.wcohen.ss.tokens.BasicToken
 
EvalParam(String) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
evaluate(Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
This function enumerates the possible transitions and emissions in a hard coded way, in order to save running-time
expectationEvaluate(Acronym, List<Double>, List<Double>, List<Double>, List<Double>, Matrix3D, Matrix3D) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
 
expectationStep(Acronym, String) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
explainCombination(MultiStringWrapper, MultiStringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
Explain how the primitive scores were combined.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbbreviationAlignment
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbstractStringDistance
This method needs to be implemented by subclasses.
explainScore(String, String) - Method in class com.wcohen.ss.AbstractStringDistance
Scores are explained by converting Strings to StringWrappers with the prepare function.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AffineGap
 
explainScore(StringWrapper, StringWrapper) - Method in interface com.wcohen.ss.api.StringDistance
Explain how the distance was computed.
explainScore(String, String) - Method in interface com.wcohen.ss.api.StringDistance
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.ApproxNeedlemanWunsch
 
explainScore(String, String) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Jaccard
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Jaro
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.JensenShannonDistance
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Level2
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Mixture
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.MongeElkan
Version where distance which is possibly scaled to [0,1].
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.MultiStringDistance
 
explainScore(String, String) - Method in class com.wcohen.ss.MultiStringDistance
Scores are explained by converting Strings to StringWrappers with the prepare function.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.NeedlemanWunsch
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SmithWaterman
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SoftTFIDF
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SourcedSoftTFIDF
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SourcedTFIDF
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TagLink
explainStringMetric gives a brief explanation of how the stringMetric was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TFIDF
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TokenFelligiSunter
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.tokens.TagLinkToken
explainScore returns an explanation of how the string distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.WinklerRescorer
 
explainScoreCombination(double[]) - Method in class com.wcohen.ss.MultiStringAvgDistance
Explain how to combine the scores for each primitive distance function on each field.
explainScoreCombination(double[]) - Method in class com.wcohen.ss.MultiStringDistance
Explain how to combine the scores for each primitive distance function on each field.
ExtractAbbreviations - Class in com.wcohen.ss.expt
Extracts abbreviation pairs (<short-form, long-form>) from text using an 'abbreviation distance metric' which evaluates the probability of a short-form string being an abbreviation/acronym of another long-form string.
ExtractAbbreviations(String, String, String, String) - Constructor for class com.wcohen.ss.expt.ExtractAbbreviations
 
ExtractAbbreviations.Stats - Class in com.wcohen.ss.expt
 
extractCandidatePairs(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
Candidates Extraction
extractHeadNounPattern_2Parts(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
extractHeadNounPattern_3Parts(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
extractPatternAcronyms(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
extractSingleAcronyms(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
extractTrailingNounPattern_2Parts(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
extractTrailingNounPattern_3Parts(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 

F

F1 - Variable in class com.wcohen.ss.expt.ExtractAbbreviations.Stats
 
FastLookup - Interface in com.wcohen.ss.lookup
Interface for SoftTFIDFDictionary and the rescoring variant of it.
finalizeEvalMat() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
finalizeEvalMat() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
finalizeEvalMat() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
findLastNonLetterOrDigit(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
findMostProbablePathBackwards(Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
findNextNonLetterOrDigit(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
FN - Variable in class com.wcohen.ss.expt.ExtractAbbreviations.Stats
 
ForwardEvalParam(String) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
 
forwardEvaluate(Acronym, List<Double>, List<Double>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
FP - Variable in class com.wcohen.ss.expt.ExtractAbbreviations.Stats
 
freeze() - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Make it impossible to add new values, but possible to perform lookups.

G

get(int) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
 
get(int) - Method in class com.wcohen.ss.MultiStringWrapper
Return the i-th field.
getA() - Method in interface com.wcohen.ss.api.DistanceInstance
 
getA() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
getA(int) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
 
getA() - Method in class com.wcohen.ss.expt.Blocker.Pair
 
getAcronym() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
 
getAcronymScore() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
 
getAlignedChar(int, boolean) - Method in class com.wcohen.ss.ApproxNeedlemanWunsch
Find a character in the first string, s, that can be aligned with the i-th character in the second string, t.
getB() - Method in interface com.wcohen.ss.api.DistanceInstance
 
getB() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
getB(int) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
 
getB() - Method in class com.wcohen.ss.expt.Blocker.Pair
 
getCollectionSize() - Method in class com.wcohen.ss.SourcedTFIDF
 
getCollectionSize() - Method in class com.wcohen.ss.TFIDF
 
getCurrentChars(String, int, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
getCurrentChars(String, int, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
Get characters from the string going backwards or forwards (depending on the implementing class), and according to the specifies length.
getCurrentChars(String, int, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
getCurrentPartialWordLen() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
 
getCurrentPartialWordLen() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
getCurrentPartialWordLen() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
 
getCurrentPartialWordMatchPosition() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
 
getCurrentPartialWordMatchPosition() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
getCurrentPartialWordMatchPosition() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
 
getCurrentWord(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
getCurrentWord(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
Get the current long form word going backwards or forwards (depending on the implementing class).
getCurrentWord(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
getDF(String) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
getDistance() - Method in class com.wcohen.ss.AbstractStringDistance
Implements the StringDistanceLearner api by return a StringDistance.
getDistance() - Method in interface com.wcohen.ss.api.DistanceInstance
 
getDistance() - Method in interface com.wcohen.ss.api.StringDistanceLearner
Return the learned distance.
getDistance() - Method in class com.wcohen.ss.AveragedStringDistanceLearner
 
getDistance() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Get the final string distance, which will be based on the distances learned by the inner learners, as well as the combination scheme learned by comboSetAnswer, comboTrain, and etc.
getDistance() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
getDistance() - Method in class com.wcohen.ss.expt.Blocker.Pair
 
getDistance(int) - Method in class com.wcohen.ss.MultiStringAvgDistance
 
getDistance(int) - Method in class com.wcohen.ss.MultiStringDistance
Get the distance used for the i-th pair of fields
getDistanceLearnerIndex(int) - Method in class com.wcohen.ss.MultiStringWrapper
Return the index of the learner that will process internal field i
getDocumentFrequency(Token) - Method in class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
 
getDocumentFrequency(Token) - Method in class com.wcohen.ss.AbstractStatisticalTokenDistance
 
getDocumentFrequency(Token) - Method in class com.wcohen.ss.SourcedTFIDF
Get the document frequency of the token.
getDocumentFrequency(Token) - Method in class com.wcohen.ss.TFIDF
Get the document frequency of the token.
getEmissionCounters() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
 
getEmmisionParams() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
getEmmisions() - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
getEndedWord(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
If pos is ending a word in str: returns this word.
getEvalMatrix() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
getEvalMatrixSize() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
getEvalStringPos() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
getFieldIndex(int) - Method in class com.wcohen.ss.MultiStringWrapper
Return the field i of the original string associated with internal field i
getFilename() - Method in class com.wcohen.ss.expt.MatchData
 
getFilename() - Method in class com.wcohen.ss.expt.SourcedMatchData
 
getId() - Method in interface com.wcohen.ss.api.IdentifiedStringWrapper
 
getId() - Method in class com.wcohen.ss.expt.MatchData.Instance
 
getId() - Method in class com.wcohen.ss.expt.SourcedMatchData.Instance
 
getIndex() - Method in interface com.wcohen.ss.api.Token
 
getIndex() - Method in class com.wcohen.ss.tokens.BasicToken
 
getInnerDistances() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Get an array of trained inner distances.
getInstance(String, int) - Method in class com.wcohen.ss.expt.MatchData
Get the j-th record for the named source.
getInstance(String, int) - Method in class com.wcohen.ss.expt.SourcedMatchData
Get the j-th record for the named source.
getIterator() - Method in class com.wcohen.ss.expt.MatchData
 
getIterator() - Method in class com.wcohen.ss.expt.SourcedMatchData
 
getLAlign() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
 
getLambda() - Method in class com.wcohen.ss.JelinekMercerJS
 
getLearnedDistance() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
 
getLegalStringPos(int, String) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
getLookupTime() - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Get the time used in performing the lookup
getMatched() - Method in class com.wcohen.ss.tokens.TagLinkToken
getMatched return the number of matched character.
getMaxFraction() - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
 
getMaxFraction() - Method in class com.wcohen.ss.expt.TokenBlocker
 
getMaxInvertedIndexSize() - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
getMaxNGramSize() - Method in class com.wcohen.ss.expt.NGramBlocker
 
getMinNGramSize() - Method in class com.wcohen.ss.expt.NGramBlocker
 
getNewStateVal(double, double) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
getNumDeletionsInAcronym() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
 
getPair(int) - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
 
getPair(int) - Method in class com.wcohen.ss.expt.Blocker
Get the i-th candidate pair, as produced from most recently block()-ed data
getPair(int) - Method in class com.wcohen.ss.expt.NullBlocker
 
getPair(int) - Method in class com.wcohen.ss.expt.TokenBlocker
 
getPartialEndedWord(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
If pos is ending a word in str: returns this word.
getPartialStartedWord(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
If pos is starting a new word in str, returns this word.
getPartialWordParam(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
getPartialWordParam(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
getPartialWordParam(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
getProbability() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
 
getPseudoCount() - Method in class com.wcohen.ss.DirichletJS
 
getRangeEnd() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
getRangeStart() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
getResult(int) - Method in interface com.wcohen.ss.lookup.FastLookup
Get the i'th string found by the last lookup
getResult(int) - Method in class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
 
getResult(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Get the i'th string found by the last lookup
getSAlign() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
 
getScore(int) - Method in interface com.wcohen.ss.lookup.FastLookup
Get the score of the i'th string found by the last lookup
getScore(int) - Method in class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
 
getScore(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Get the score of the i'th string found by the last lookup
getScore() - Method in class com.wcohen.ss.TagLink.Candidates
getScore, return the score or distance between strings T and U.
getSource() - Method in interface com.wcohen.ss.api.SourcedStringWrapper
 
getSource() - Method in interface com.wcohen.ss.api.SourcedToken
 
getSource(int) - Method in class com.wcohen.ss.expt.MatchData
Get string identifier for i-th source
getSource() - Method in class com.wcohen.ss.expt.MatchData.Instance
 
getSource(int) - Method in class com.wcohen.ss.expt.SourcedMatchData
Get string identifier for i-th source
getSource() - Method in class com.wcohen.ss.expt.SourcedMatchData.Instance
 
getSource() - Method in class com.wcohen.ss.tokens.BasicSourcedToken
 
getStartedWord(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
If pos is starting a new word in str, returns this word.
getStrByEmission(String, T) - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
 
getTeacher() - Method in class com.wcohen.ss.lookup.SoftDictionary
Return a teacher that can 'train' a distance metric from the information in the dictionary.
getTokenMatchThreshold() - Method in class com.wcohen.ss.SoftTFIDF
 
getTokenMatchThreshold() - Method in class com.wcohen.ss.SourcedSoftTFIDF
 
getTokens() - Method in class com.wcohen.ss.SourcedTFIDF
Access the tokens of the last prepare()-ed string.
getTokens() - Method in class com.wcohen.ss.TFIDF
Access the tokens of the last prepare()-ed string.
getTPos() - Method in class com.wcohen.ss.TagLink.Candidates
getTPos, return the position of string T.
getTr() - Method in class com.wcohen.ss.tokens.TagLinkToken
getTr return the contant value Gamma.
getTransitionCounters() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
 
getTransitionParams() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
getTransitions() - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
getUPos() - Method in class com.wcohen.ss.TagLink.Candidates
getUPos, return the position of string U.
getValue() - Method in interface com.wcohen.ss.api.Token
 
getValue(int) - Method in interface com.wcohen.ss.lookup.FastLookup
Get the value of the i'th string found by the last lookup
getValue(int) - Method in class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
 
getValue(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Get the value of the i'th string found by the last lookup
getValue() - Method in class com.wcohen.ss.tokens.BasicToken
 
getVocabularySize() - Method in class com.wcohen.ss.TFIDF
 
getWeight(Token) - Method in class com.wcohen.ss.SourcedTFIDF
Access the weight of a token in the vector created for the last prepare()-ed string.
getWeight(Token) - Method in class com.wcohen.ss.TFIDF
Access the weight of a token in the vector created for the last prepare()-ed string.
getWindowSize(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
graphPrecisionRecall(PrintStream) - Method in class com.wcohen.ss.expt.MatchExpt
Graph interpolated precision vs recall
graphPrecisionRecall(PrintStream) - Method in class com.wcohen.ss.expt.SpecialMatchExpt
Graph interpolated precision vs recall

H

hasAnswers() - Method in class com.wcohen.ss.api.StringDistanceTeacher
Return true if this teacher can answer more queries.
hasAnswers() - Method in class com.wcohen.ss.expt.BasicTeacher
 
hasAnswers() - Method in class com.wcohen.ss.expt.MatchDataTeacher
 
hashCode() - Method in class com.wcohen.ss.abbvGapsHmm.Acronym
 
hashCode() - Method in class com.wcohen.ss.BasicStringWrapper
 
hashCode() - Method in class com.wcohen.ss.tokens.BasicSourcedToken
 
hashCode() - Method in class com.wcohen.ss.tokens.BasicToken
 
hasNext() - Method in interface com.wcohen.ss.api.DistanceInstanceIterator
 
hasNext() - Method in interface com.wcohen.ss.api.StringWrapperIterator
 
hasNext() - Method in class com.wcohen.ss.BasicDistanceInstanceIterator
 
hasNext() - Method in class com.wcohen.ss.BasicSourcedStringWrapperIterator
 
hasNext() - Method in class com.wcohen.ss.BasicStringWrapperIterator
 
hasNext() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthDistanceInstanceIterator
 
hasNext() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthStringWrapperValueIterator
 
hasNext() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
 
hasNext() - Method in class com.wcohen.ss.expt.SourcedMatchData.MatchIterator
 
hasNextQuery() - Method in class com.wcohen.ss.AbstractStringDistance
Implements StringDistanceLearner api by informing a teacher if the learner has DistanceInstance queries.
hasNextQuery() - Method in interface com.wcohen.ss.api.StringDistanceLearner
Returns true if the learner has more queries to answer.
hasNextQuery() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
See if someone has a query

I

IdentifiedStringWrapper - Interface in com.wcohen.ss.api
 
identity(int) - Static method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
increaseCounter(AbbvGapsHMM.Emissions, double) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
 
increaseCounter(AbbvGapsHMM.Transitions, double) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
 
INCREASING_DISTANCE - Static variable in interface com.wcohen.ss.api.DistanceInstance
 
index - Variable in class com.wcohen.ss.tokens.BasicToken
 
init(String[]) - Static method in class com.wcohen.ss.AbbreviationAlignment
 
initEvalMat() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
initEvalMat() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
initEvalMat() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
initEvalMat() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
initEvalParams() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
initEvalParams() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
initEvalParams() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
initEvalRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
 
initEvalRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
Init evaluation range
initEvalRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
 
initModelParamsAndCounters() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
initStartProbs() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
Initialize the starting probabilities for each state (hard coded).
innerDistances - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
innerDistanceString() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
Help class for 'toString()' which produces a description of the distances being combined.
innerLearners - Variable in class com.wcohen.ss.CombinedStringDistanceLearner
 
InputFormatException(String, int, String) - Constructor for exception com.wcohen.ss.expt.MatchData.InputFormatException
 
InputFormatException(String, int, String) - Constructor for exception com.wcohen.ss.expt.SourcedMatchData.InputFormatException
 
InsertSMatrix(StringWrapper, StringWrapper) - Constructor for class com.wcohen.ss.AffineGap.MatrixTrio.InsertSMatrix
 
InsertTMatrix(StringWrapper, StringWrapper) - Constructor for class com.wcohen.ss.AffineGap.MatrixTrio.InsertTMatrix
 
Instance(String, String, String) - Constructor for class com.wcohen.ss.expt.MatchData.Instance
 
Instance(String, String, String) - Constructor for class com.wcohen.ss.expt.SourcedMatchData.Instance
 
intern(String) - Method in interface com.wcohen.ss.api.Tokenizer
Convert a given string into a token.
intern(String) - Method in class com.wcohen.ss.tokens.CharacterTokenizer
 
intern(String) - Method in class com.wcohen.ss.tokens.NGramTokenizer
 
intern(String) - Method in class com.wcohen.ss.tokens.SimpleTokenizer
 
interpolated11PointPrecision() - Method in class com.wcohen.ss.expt.MatchExpt
Return an array of interpolated precision at various different recall levels.
interpolated11PointPrecision() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
Return an array of interpolated precision at various different recall levels.
interpolated11PointRecallLevels() - Static method in class com.wcohen.ss.expt.MatchExpt
Return recall levels associated with the precision levels returned by interpolated11PointPrecision.
interpolated11PointRecallLevels() - Static method in class com.wcohen.ss.expt.SpecialMatchExpt
Return recall levels associated with the precision levels returned by interpolated11PointPrecision.
is - Variable in class com.wcohen.ss.AffineGap.MatrixTrio
 
isAtRangeEnd() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
isAtRangeStart() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
isCorrect() - Method in interface com.wcohen.ss.api.DistanceInstance
 
isCorrect() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
isCorrect() - Method in class com.wcohen.ss.expt.Blocker.Pair
 
isCurrentPartialWordMatchPositionAtWordStart() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
 
isCurrentPartialWordMatchPositionAtWordStart() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
isCurrentPartialWordMatchPositionAtWordStart() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
 
isInEvalRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
 
isInEvalRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
Is in evaluation range
isInEvalRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
 
isInStringMatchingRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
 
isInStringMatchingRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
isInStringMatchingRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
 
isLegalMultiStringWrapperSize(int) - Method in class com.wcohen.ss.MultiStringDistance
Check if a string has a valid number of fields.
isShortForm(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
isValidExpression(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
isValidShortForm(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
it - Variable in class com.wcohen.ss.AffineGap.MatrixTrio
 

J

Jaccard - Class in com.wcohen.ss
Jaccard distance implementation.
Jaccard(Tokenizer) - Constructor for class com.wcohen.ss.Jaccard
 
Jaccard() - Constructor for class com.wcohen.ss.Jaccard
 
Jaro - Class in com.wcohen.ss
Jaro distance metric.
Jaro() - Constructor for class com.wcohen.ss.Jaro
 
JaroTFIDF - Class in com.wcohen.ss
Soft TFIDF-based distance metric, extended to use "soft" token-matching with the Jaro distance metric.
JaroTFIDF() - Constructor for class com.wcohen.ss.JaroTFIDF
 
JaroWinkler - Class in com.wcohen.ss
Jaro distance metric, as extended by Winkler.
JaroWinkler() - Constructor for class com.wcohen.ss.JaroWinkler
 
JaroWinklerTFIDF - Class in com.wcohen.ss
Soft TFIDF-based distance metric, extended to use "soft" token-matching with the JaroWinkler distance metric.
JaroWinklerTFIDF() - Constructor for class com.wcohen.ss.JaroWinklerTFIDF
 
JelinekMercerJS - Class in com.wcohen.ss
Jensen-Shannon distance of two unigram language models, smoothed using Jelinek-Mercer mixture model.
JelinekMercerJS(Tokenizer, double) - Constructor for class com.wcohen.ss.JelinekMercerJS
 
JelinekMercerJS() - Constructor for class com.wcohen.ss.JelinekMercerJS
 
JensenShannonDistance - Class in com.wcohen.ss
Distance metrics based on Jensen-Shannon distance of two smoothed unigram language models.
JensenShannonDistance(Tokenizer) - Constructor for class com.wcohen.ss.JensenShannonDistance
 
JensenShannonDistance() - Constructor for class com.wcohen.ss.JensenShannonDistance
 
JthDistanceInstanceIterator(int, Iterator) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.JthDistanceInstanceIterator
 
JthStringWrapperValueIterator(int, Iterator) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.JthStringWrapperValueIterator
 

L

labelInstance(DistanceInstance) - Method in class com.wcohen.ss.api.StringDistanceTeacher
Label an instance queried by the learner.
labelInstance(DistanceInstance) - Method in class com.wcohen.ss.expt.BasicTeacher
 
labelInstance(DistanceInstance) - Method in class com.wcohen.ss.expt.MatchDataTeacher
 
learner(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Load a learner.
length() - Method in interface com.wcohen.ss.api.StringWrapper
Return the length of the wrapped string
length() - Method in class com.wcohen.ss.BasicStringWrapper
 
Level2 - Class in com.wcohen.ss
Generic version of Monge & Elkan's "level 2" recursive field matching.
Level2(Tokenizer, StringDistance) - Constructor for class com.wcohen.ss.Level2
 
Level2Jaro - Class in com.wcohen.ss
"Level 2" recursive field matching algorithm, based on Jaro distance.
Level2Jaro() - Constructor for class com.wcohen.ss.Level2Jaro
 
Level2JaroWinkler - Class in com.wcohen.ss
"Level 2" recursive field matching algorithm, based on Jaro distance.
Level2JaroWinkler() - Constructor for class com.wcohen.ss.Level2JaroWinkler
 
Level2Levenstein - Class in com.wcohen.ss
"Level 2" recursive field matching algorithm using Levenstein distance.
Level2Levenstein() - Constructor for class com.wcohen.ss.Level2Levenstein
 
Level2MongeElkan - Class in com.wcohen.ss
Monge & Elkan's "level 2" recursive field matching algorithm.
Level2MongeElkan() - Constructor for class com.wcohen.ss.Level2MongeElkan
 
Levenstein - Class in com.wcohen.ss
Levenstein string distance.
Levenstein() - Constructor for class com.wcohen.ss.Levenstein
 
load(File) - Method in class com.wcohen.ss.lookup.SoftDictionary
Insert all lines in a file as items mapping to themselves.
load(File, boolean) - Method in class com.wcohen.ss.lookup.SoftDictionary
Insert all lines in a file as items mapping to themselves.
loadAliases(File) - Method in class com.wcohen.ss.lookup.SoftDictionary
Load a file of identifiers, each of which has multiple aliases.
loadAliases(File) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Load a file of identifiers, each of which has multiple aliases.
loadLabels(String) - Static method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
loadModelParams() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
loadPredictor() - Method in class com.wcohen.ss.AbbreviationAlignment
 
loadPredictor() - Method in class com.wcohen.ss.expt.ExtractAbbreviations
 
loadTrainingCorpus(String) - Static method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
lookup(double, String) - Method in interface com.wcohen.ss.lookup.FastLookup
Lookup items similar to 'toFind', and return the number of items found.
lookup(double, String) - Method in class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
 
lookup(String, String) - Method in class com.wcohen.ss.lookup.SoftDictionary
Lookup a string in the dictionary.
lookup(String, StringWrapper) - Method in class com.wcohen.ss.lookup.SoftDictionary
Lookup a prepared string in the dictionary.
lookup(String) - Method in class com.wcohen.ss.lookup.SoftDictionary
Lookup a string in the dictionary.
lookup(StringWrapper) - Method in class com.wcohen.ss.lookup.SoftDictionary
Lookup a prepared string in the dictionary.
lookup(double, String) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Lookup items SoftTFIDF-similar to the 'toFind' argument, and return the number of items found.
lookupDistance(String, String) - Method in class com.wcohen.ss.lookup.SoftDictionary
Return the distance to the best match.
lookupDistance(String, StringWrapper) - Method in class com.wcohen.ss.lookup.SoftDictionary
Return the distance to the best match.
lookupDistance(String) - Method in class com.wcohen.ss.lookup.SoftDictionary
Return the distance to the best match.
lookupDistance(StringWrapper) - Method in class com.wcohen.ss.lookup.SoftDictionary
Return the distance to the best match.
lookupTime - Variable in class com.wcohen.ss.lookup.SoftTFIDFDictionary
 

M

m - Variable in class com.wcohen.ss.AffineGap.MatrixTrio
 
main(String[]) - Static method in class com.wcohen.ss.AbbreviationAlignment
Evaluates the probability of the short-form string (string1) being an abbreviation/acronym of the long-form string (string2).
Usage: AbbreviationAlignment short_form_string long_form_string [train_data_dir]
main(String[]) - Static method in class com.wcohen.ss.AffineGap
 
main(String[]) - Static method in class com.wcohen.ss.ApproxNeedlemanWunsch
 
main(String[]) - Static method in class com.wcohen.ss.DirichletJS
 
main(String[]) - Static method in class com.wcohen.ss.DistanceLearnerFactory
Test routine.
main(String[]) - Static method in class com.wcohen.ss.expt.ExtractAbbreviations
Extracts abbreviation pairs from text.

Usage: ExtractAbbreviations input experiment_name [gold-file] [train-dir]
main(String[]) - Static method in class com.wcohen.ss.expt.MatchData
 
main(String[]) - Static method in class com.wcohen.ss.expt.MatchExpt
Command-line interface.
main(String[]) - Static method in class com.wcohen.ss.expt.MatchExptScript
 
main(String[]) - Static method in class com.wcohen.ss.expt.SourcedMatchData
 
main(String[]) - Static method in class com.wcohen.ss.expt.SpecialMatchExpt
Command-line interface.
main(String[]) - Static method in class com.wcohen.ss.Jaccard
 
main(String[]) - Static method in class com.wcohen.ss.Jaro
 
main(String[]) - Static method in class com.wcohen.ss.JaroTFIDF
 
main(String[]) - Static method in class com.wcohen.ss.JaroWinkler
 
main(String[]) - Static method in class com.wcohen.ss.JaroWinklerTFIDF
 
main(String[]) - Static method in class com.wcohen.ss.JelinekMercerJS
 
main(String[]) - Static method in class com.wcohen.ss.Level2
 
main(String[]) - Static method in class com.wcohen.ss.Level2Jaro
 
main(String[]) - Static method in class com.wcohen.ss.Level2JaroWinkler
 
main(String[]) - Static method in class com.wcohen.ss.Level2Levenstein
 
main(String[]) - Static method in class com.wcohen.ss.Level2MongeElkan
 
main(String[]) - Static method in class com.wcohen.ss.Levenstein
 
main(String[]) - Static method in class com.wcohen.ss.lookup.SoftDictionary
Simple main for testing.
main(String[]) - Static method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Simple main for testing and experimentation
main(String[]) - Static method in class com.wcohen.ss.Mixture
 
main(String[]) - Static method in class com.wcohen.ss.MongeElkan
 
main(String[]) - Static method in class com.wcohen.ss.MongeElkanTFIDF
 
main(String[]) - Static method in class com.wcohen.ss.MultiStringAvgDistance
 
main(String[]) - Static method in class com.wcohen.ss.MultiStringWrapper
 
main(String[]) - Static method in class com.wcohen.ss.NeedlemanWunsch
 
main(String[]) - Static method in class com.wcohen.ss.ScaledLevenstein
 
main(String[]) - Static method in class com.wcohen.ss.SmithWaterman
 
main(String[]) - Static method in class com.wcohen.ss.SoftTokenFelligiSunter
 
main(String[]) - Static method in class com.wcohen.ss.SourcedTFIDF
 
main(String[]) - Static method in class com.wcohen.ss.TFIDF
 
main(String[]) - Static method in class com.wcohen.ss.TokenFelligiSunter
 
main(String[]) - Static method in class com.wcohen.ss.tokens.CharacterTokenizer
Test routine
main(String[]) - Static method in class com.wcohen.ss.tokens.NGramTokenizer
Test routine
main(String[]) - Static method in class com.wcohen.ss.tokens.SimpleSourcedTokenizer
Test routine
main(String[]) - Static method in class com.wcohen.ss.tokens.SimpleTokenizer
Test routine
main(String[]) - Static method in class com.wcohen.ss.UnsmoothedJS
 
MatchData - Class in com.wcohen.ss.expt
Holds data for evaluating a distance metric.
MatchData(String) - Constructor for class com.wcohen.ss.expt.MatchData
Read match data from a file.
MatchData() - Constructor for class com.wcohen.ss.expt.MatchData
 
MatchData.InputFormatException - Exception in com.wcohen.ss.expt
Signals an incorrectly formatted MatchData file.
MatchData.Instance - Class in com.wcohen.ss.expt
A single item (aka record, string, etc) to match against others.
MatchData.MatchIterator - Class in com.wcohen.ss.expt
Iterates over all stored StringWrappers
MatchDataTeacher - Class in com.wcohen.ss.expt
Train a StringDistanceLearner using MatchData and a Blocker.
MatchDataTeacher(MatchData, Blocker) - Constructor for class com.wcohen.ss.expt.MatchDataTeacher
 
MatchExpt - Class in com.wcohen.ss.expt
Perform a matching experiment using a data file, distance function and blocker.
MatchExpt(MatchData, StringDistanceLearner, Blocker) - Constructor for class com.wcohen.ss.expt.MatchExpt
 
MatchExpt(MatchData, StringDistanceLearner) - Constructor for class com.wcohen.ss.expt.MatchExpt
 
MatchExptScript - Class in com.wcohen.ss.expt
Perform a series of match experiments, specified by a script in an input file.
MatchExptScript() - Constructor for class com.wcohen.ss.expt.MatchExptScript
 
MatchIterator(MatchData) - Constructor for class com.wcohen.ss.expt.MatchData.MatchIterator
 
MatchIterator(SourcedMatchData) - Constructor for class com.wcohen.ss.expt.SourcedMatchData.MatchIterator
 
matchScore(char, char) - Method in class com.wcohen.ss.CharMatchScore
 
Matrix3D - Class in com.wcohen.ss.abbvGapsHmm
M-by-N-by-K matrix implementation.
Matrix3D(int, int, int) - Constructor for class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
Matrix3D(double[][][]) - Constructor for class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
MatrixTrio(StringWrapper, StringWrapper) - Constructor for class com.wcohen.ss.AffineGap.MatrixTrio
 
max3(double, double, double) - Static method in class com.wcohen.ss.MemoMatrix
Return max of three numbers.
max4(double, double, double, double) - Static method in class com.wcohen.ss.MemoMatrix
Return max of four numbers.
maxF1() - Method in class com.wcohen.ss.expt.MatchExpt
max F1 for any threshold
maxF1() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
max F1 for any threshold
maximizationStep() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
Returns the total change in model parameter values.
maximizationStepForEmissions() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
maximizationStepForTransitions() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
maxTokenIndex() - Method in interface com.wcohen.ss.api.Tokenizer
Return the higest index of any interned token
maxTokenIndex() - Method in class com.wcohen.ss.tokens.CharacterTokenizer
 
maxTokenIndex() - Method in class com.wcohen.ss.tokens.NGramTokenizer
 
maxTokenIndex() - Method in class com.wcohen.ss.tokens.SimpleTokenizer
 
MemoMatrix - Class in com.wcohen.ss
A matrix of doubles, defined recursively by the compute(i,j) method, that will not be recomputed more than necessary.
minus(Matrix3D) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
Mixture - Class in com.wcohen.ss
Mixture-based distance metric.
Mixture(Tokenizer) - Constructor for class com.wcohen.ss.Mixture
 
Mixture() - Constructor for class com.wcohen.ss.Mixture
 
mkdir(String) - Method in class com.wcohen.ss.expt.ExtractAbbreviations
 
MongeElkan - Class in com.wcohen.ss
The match method proposed by Monge and Elkan.
MongeElkan() - Constructor for class com.wcohen.ss.MongeElkan
 
MongeElkanTFIDF - Class in com.wcohen.ss
Soft TFIDF-based distance metric, extended to use "soft" token-matching with the MongeElkan distance metric.
MongeElkanTFIDF() - Constructor for class com.wcohen.ss.MongeElkanTFIDF
 
mul(int, int, int, double) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
MultiStringAvgDistance - Class in com.wcohen.ss
StringDistance defined over Strings that are broken into fields, with distance defined as the average distance between any field.
MultiStringAvgDistance(StringDistance, String) - Constructor for class com.wcohen.ss.MultiStringAvgDistance
 
MultiStringDistance - Class in com.wcohen.ss
Abstract class StringDistance defined over Strings that are broken into fields.
MultiStringDistance(String) - Constructor for class com.wcohen.ss.MultiStringDistance
 
MultiStringWrapper - Class in com.wcohen.ss
A StringWrapper that stores a version of the string that has been either (a) split into a number of distinct fields, or (b) duplicated k times, so that k different StringDistance's can preprocess it, of (b) both of the above.
MultiStringWrapper(String, String) - Constructor for class com.wcohen.ss.MultiStringWrapper
Create a MultiStringWrapper by splitting s into fields based on the given delimiter.
MultiStringWrapper(String, int) - Constructor for class com.wcohen.ss.MultiStringWrapper
Create a MultiStringWrapper by making k copies of s.
MultiStringWrapper(String, int, String) - Constructor for class com.wcohen.ss.MultiStringWrapper
Create a MultiStringWrapper by making k copies of each field of s.
MyDistanceInstance(StringWrapper, StringWrapper, boolean, double) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
MyMultiDistanceInstance(StringWrapper, StringWrapper, boolean, double) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
 

N

NeedlemanWunsch - Class in com.wcohen.ss
Needleman-Wunsch string distance, following Durban et al.
NeedlemanWunsch() - Constructor for class com.wcohen.ss.NeedlemanWunsch
 
NeedlemanWunsch(CharMatchScore, double) - Constructor for class com.wcohen.ss.NeedlemanWunsch
 
next() - Method in interface com.wcohen.ss.api.DistanceInstanceIterator
 
next() - Method in interface com.wcohen.ss.api.StringWrapperIterator
 
next() - Method in class com.wcohen.ss.BasicDistanceInstanceIterator
 
next() - Method in class com.wcohen.ss.BasicSourcedStringWrapperIterator
 
next() - Method in class com.wcohen.ss.BasicStringWrapperIterator
 
next() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthDistanceInstanceIterator
 
next() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthStringWrapperValueIterator
 
next() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
Returns the next StringWrapper as an object.
next() - Method in class com.wcohen.ss.expt.SourcedMatchData.MatchIterator
Returns the next StringWrapper as an object.
nextDistanceInstance() - Method in interface com.wcohen.ss.api.DistanceInstanceIterator
 
nextDistanceInstance() - Method in class com.wcohen.ss.BasicDistanceInstanceIterator
 
nextDistanceInstance() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthDistanceInstanceIterator
 
nextQuery() - Method in class com.wcohen.ss.AbstractStringDistance
Implements StringDistanceLearner api by querying for DistanceInstance labels.
nextQuery() - Method in interface com.wcohen.ss.api.StringDistanceLearner
Returns a DistanceInstance for which the learner would like a label.
nextQuery() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Get a next query from one of the sublearners
nextSourcedStringWrapper() - Method in interface com.wcohen.ss.api.SourcedStringWrapperIterator
 
nextSourcedStringWrapper() - Method in class com.wcohen.ss.BasicSourcedStringWrapperIterator
 
nextSourcedStringWrapper() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
Return the next StringWrapper.
nextSourcedStringWrapper() - Method in class com.wcohen.ss.expt.SourcedMatchData.MatchIterator
Return the next StringWrapper.
nextStringWrapper() - Method in interface com.wcohen.ss.api.StringWrapperIterator
 
nextStringWrapper() - Method in class com.wcohen.ss.BasicSourcedStringWrapperIterator
 
nextStringWrapper() - Method in class com.wcohen.ss.BasicStringWrapperIterator
 
nextStringWrapper() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthStringWrapperValueIterator
 
nextStringWrapper() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
Return the next StringWrapper.
nextStringWrapper() - Method in class com.wcohen.ss.expt.SourcedMatchData.MatchIterator
Return the next StringWrapper.
NGramBlocker - Class in com.wcohen.ss.expt
Finds all pairs that share a not-too-common character n-gram.
NGramBlocker() - Constructor for class com.wcohen.ss.expt.NGramBlocker
 
NGramTokenizer - Class in com.wcohen.ss.tokens
Wraps another tokenizer, and adds all computes all ngrams of characters from a single token produced by the inner tokenizer.
NGramTokenizer(int, int, boolean, Tokenizer) - Constructor for class com.wcohen.ss.tokens.NGramTokenizer
 
NullBlocker - Class in com.wcohen.ss.expt
Inefficient exhausitive blocking algorithm.
NullBlocker() - Constructor for class com.wcohen.ss.expt.NullBlocker
 
numCorrectPairs() - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
 
numCorrectPairs() - Method in class com.wcohen.ss.expt.Blocker
Return total number of correct pairs in the dataset.
numCorrectPairs() - Method in class com.wcohen.ss.expt.NullBlocker
 
numCorrectPairs() - Method in class com.wcohen.ss.expt.TokenBlocker
 
numInstances(String) - Method in class com.wcohen.ss.expt.MatchData
Number of records for source with given string id
numInstances(String) - Method in class com.wcohen.ss.expt.SourcedMatchData
Number of records for source with given string id
numSources() - Method in class com.wcohen.ss.expt.MatchData
Number of sources in data set
numSources() - Method in class com.wcohen.ss.expt.SourcedMatchData
Number of sources in data set

O

offset(int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
 
offset(int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
offset(int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
 
outOfRange(int, int) - Method in class com.wcohen.ss.ApproxMemoMatrix
 
outputAbbvs(Map<String, Acronym>) - Method in class com.wcohen.ss.expt.ExtractAbbreviations
 
outputPairs(BufferedWriter) - Method in class com.wcohen.ss.expt.ExtractAbbreviations
 

P

Pair(MatchData.Instance, MatchData.Instance, boolean) - Constructor for class com.wcohen.ss.expt.Blocker.Pair
 
pairsPerSecond() - Method in class com.wcohen.ss.expt.MatchExpt
Return total time to process data, divided by the number of pairs
pairsPerSecond() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
Return total time to process data, divided by the number of pairs
parseCandidate(String, String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
plus(Matrix3D) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
posIsAtWord(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
posIsAtWordEnd(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
posIsAtWordStart(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
precision - Variable in class com.wcohen.ss.expt.ExtractAbbreviations.Stats
 
precisionRecall() - Method in class com.wcohen.ss.expt.MatchExptScript
Show interpolated 11-pt precision curves for each blocker/distance/dataset
predict(String, String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
predict(Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
predict(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
predictAlignment(Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
predictAndTest(List<String>, List<Map<String, String>>) - Method in class com.wcohen.ss.expt.ExtractAbbreviations
 
predictAndTest(int, List<String>, List<Map<String, String>>, BufferedWriter) - Method in class com.wcohen.ss.expt.ExtractAbbreviations
 
prepare(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
 
prepare(String) - Method in class com.wcohen.ss.AbstractStringDistance
Default way to preprocess a string for distance computation.
prepare(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractStringDistance
Implements StringDistanceLearner api by providing a way to prep a StringWrapperIterator for training.
prepare(DistanceInstanceIterator) - Method in class com.wcohen.ss.AbstractStringDistance
Implements StringDistanceLearner api by providing a way to prep a DistanceInstanceIterator for training.
prepare(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractTokenizedStringDistance
 
prepare(String) - Method in interface com.wcohen.ss.api.StringDistance
Preprocess a string for distance computation
prepare(StringWrapperIterator) - Method in interface com.wcohen.ss.api.StringDistanceLearner
Preprocess a StringWrapperIterator for unsupervised training.
prepare(DistanceInstanceIterator) - Method in interface com.wcohen.ss.api.StringDistanceLearner
Preprocess a DistanceInstanceIterator for supervised training.
prepare(String) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
prepare(StringWrapperIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Prepare data for the sublearners.
prepare(DistanceInstanceIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Prepare data for the learners.
prepare(String) - Method in class com.wcohen.ss.Jaccard
Preprocess a string by finding tokens.
prepare(String) - Method in class com.wcohen.ss.Jaro
 
prepare(String) - Method in class com.wcohen.ss.JensenShannonDistance
Preprocess a string by finding tokens and giving them weights W such that W is the smoothed probability of the token appearing in the document.
prepare(String) - Method in class com.wcohen.ss.lookup.SoftDictionary
Prepare a string for quicker lookup.
prepare(String) - Method in class com.wcohen.ss.MultiStringDistance
Prepare a string.
prepare(StringDistance[]) - Method in class com.wcohen.ss.MultiStringWrapper
Prepare each field with the appropriate distance
prepare(String) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
Preprocess a string by finding tokens
prepare(String) - Method in class com.wcohen.ss.SourcedTFIDF
Preprocess a string by finding tokens and giving them TFIDF weights
prepare(String) - Method in class com.wcohen.ss.TagLink
Preprocess a string by finding tokens and giving them TFIDF weights
prepare(String) - Method in class com.wcohen.ss.TFIDF
Preprocess a string by finding tokens and giving them TFIDF weights
prepare(String) - Method in class com.wcohen.ss.TokenFelligiSunter
Preprocess a string by finding tokens and giving them appropriate weights
prepare(String) - Method in class com.wcohen.ss.WinklerRescorer
 
PrintfFormat - Class in com.wcohen.ss
PrintfFormat allows the formatting of an array of objects embedded within a string.
PrintfFormat(String) - Constructor for class com.wcohen.ss.PrintfFormat
Constructs an array of control specifications possibly preceded, separated, or followed by ordinary strings.
PrintfFormat(Locale, String) - Constructor for class com.wcohen.ss.PrintfFormat
Constructs an array of control specifications possibly preceded, separated, or followed by ordinary strings.
printNegativeValues - Variable in class com.wcohen.ss.MemoMatrix
 
prototype - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
put(String, String, Object) - Method in class com.wcohen.ss.lookup.SoftDictionary
Insert a string into the dictionary.
put(String, Object) - Method in class com.wcohen.ss.lookup.SoftDictionary
Insert a string into the dictionary.
put(String, StringWrapper, Object) - Method in class com.wcohen.ss.lookup.SoftDictionary
Insert a prepared string into the dictionary.
put(String, Object) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Insert a string into the dictionary, and associate it with the given value.

R

random(int, int, int) - Static method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
recall - Variable in class com.wcohen.ss.expt.ExtractAbbreviations.Stats
 
refreeze() - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
remove() - Method in class com.wcohen.ss.BasicDistanceInstanceIterator
 
remove() - Method in class com.wcohen.ss.BasicSourcedStringWrapperIterator
 
remove() - Method in class com.wcohen.ss.BasicStringWrapperIterator
 
remove() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthDistanceInstanceIterator
 
remove() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthStringWrapperValueIterator
 
remove() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
Not implemented.
remove() - Method in class com.wcohen.ss.expt.SourcedMatchData.MatchIterator
Not implemented.
RescoringSoftTFIDFDictionary - Class in com.wcohen.ss.lookup
Wrapper around a SoftTFIDFDictionary that allows you to 'rescore' the result using an arbitrary StringDistance.
RescoringSoftTFIDFDictionary(FastLookup, double, StringDistance) - Constructor for class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
 
restore(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Restore experimental data previously 'saved' toa file.
restore(File) - Static method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
retrunBestStates() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
 
run() - Method in class com.wcohen.ss.expt.ExtractAbbreviations
 
runScript(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Load commands from a file and execute them.

S

s - Variable in class com.wcohen.ss.MemoMatrix
 
sameId(MatchData.Instance) - Method in class com.wcohen.ss.expt.MatchData.Instance
 
sameId(SourcedMatchData.Instance) - Method in class com.wcohen.ss.expt.SourcedMatchData.Instance
 
sAt(int) - Method in class com.wcohen.ss.MemoMatrix
Get i-th char of s, indexing s from 1..n
save(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Save current experimental data to a file
saveAs(File) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
saveModelParams() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
ScaledLevenstein - Class in com.wcohen.ss
Levenstein string distance.
ScaledLevenstein() - Constructor for class com.wcohen.ss.ScaledLevenstein
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbbreviationAlignment
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbstractStringDistance
This method needs to be implemented by subclasses.
score(String, String) - Method in class com.wcohen.ss.AbstractStringDistance
Strings are scored by converting them to StringWrappers with the prepare function.
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AffineGap
 
score(StringWrapper, StringWrapper) - Method in interface com.wcohen.ss.api.StringDistance
Find the distance between s and t.
score(String, String) - Method in interface com.wcohen.ss.api.StringDistance
Find the distance between s and t
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.ApproxNeedlemanWunsch
 
score(String, String) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Jaccard
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Jaro
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.JensenShannonDistance
Jensen-Shannon distance between distributions.
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Level2
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Mixture
Distance is argmax_lambda prod_{w in s} lambda Pr(w|t) * (1-lambda) Pr(w|background).
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.MongeElkan
Version of distance which is possibly scaled to [0,1].
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.MultiStringDistance
 
score(String, String) - Method in class com.wcohen.ss.MultiStringDistance
Strings are scored by converting them to StringWrappers with the prepare function.
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.NeedlemanWunsch
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.ScaledLevenstein
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SmithWaterman
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SoftTFIDF
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SourcedSoftTFIDF
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SourcedTFIDF
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TagLink
getStringMetric computes the similarity between a pair of strings T and U.
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TFIDF
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TokenFelligiSunter
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.tokens.TagLinkToken
score return the a strng distance value between 0 and 1 of a pair of tokens.
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.WinklerRescorer
 
scoreCombination(double[]) - Method in class com.wcohen.ss.MultiStringAvgDistance
Combine the scores for each primitive distance function on each field.
scoreCombination(double[]) - Method in class com.wcohen.ss.MultiStringDistance
Combine the scores for each primitive distance function on each field.
SEPARATOR - Static variable in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
SEPARATOR - Static variable in class com.wcohen.ss.expt.ExtractAbbreviations
 
set(int, int, int, double) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
set(int, StringWrapper) - Method in class com.wcohen.ss.MultiStringWrapper
Set the i-th field.
setClusterMode(boolean) - Method in class com.wcohen.ss.expt.Blocker
In clusterMode, consider pairings between instances from the same source.
setClusterMode(Boolean) - Method in class com.wcohen.ss.expt.Blocker
 
setCollectionSize(int) - Method in class com.wcohen.ss.SourcedTFIDF
Setting the collectionSize and alsoSet the size of the collection that this TFIDF measure was trained on to some value.
setCollectionSize(int) - Method in class com.wcohen.ss.TFIDF
Setting the collectionSize and alsoSet the size of the collection that this TFIDF measure was trained on to some value.
setDistance(double) - Method in interface com.wcohen.ss.api.DistanceInstance
 
setDistance(double) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
setDistance(double) - Method in class com.wcohen.ss.expt.Blocker.Pair
 
setDistanceInstancePool(DistanceInstanceIterator) - Method in class com.wcohen.ss.AbstractStringDistance
Implements StringDistanceLearner api by providing a way to accept a pool of unlabeled DistanceInstance's.
setDistanceInstancePool(DistanceInstanceIterator) - Method in interface com.wcohen.ss.api.StringDistanceLearner
Accept a set of unlabeled DistanceInstance, to use in making distance instance queries.
setDistanceInstancePool(DistanceInstanceIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Pass the training data along to the inner learners.
setDocumentFrequency(Token, int) - Method in class com.wcohen.ss.SourcedTFIDF
Set the document frequency of the token to some value.
setDocumentFrequency(Token, int) - Method in class com.wcohen.ss.TFIDF
Set the document frequency of the token to some value.
setIgnoreCase(boolean) - Method in class com.wcohen.ss.tokens.CharacterTokenizer
 
setIgnoreCase(boolean) - Method in class com.wcohen.ss.tokens.SimpleTokenizer
 
setIgnorePunctuation(boolean) - Method in class com.wcohen.ss.tokens.CharacterTokenizer
 
setIgnorePunctuation(boolean) - Method in class com.wcohen.ss.tokens.SimpleTokenizer
 
setLambda(double) - Method in class com.wcohen.ss.JelinekMercerJS
 
setLambda(Double) - Method in class com.wcohen.ss.JelinekMercerJS
 
setMaxFraction(double) - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
 
setMaxFraction(double) - Method in class com.wcohen.ss.expt.TokenBlocker
 
setMaxInvertedIndexSize(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Set the maximum size of an inverted index that will be followed.
setMaxNGramSize(int) - Method in class com.wcohen.ss.expt.NGramBlocker
 
setMinNGramSize(int) - Method in class com.wcohen.ss.expt.NGramBlocker
 
setMismatchFactor(double) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
 
setMismatchFactor(Double) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
 
setMismatchFactor(double) - Method in class com.wcohen.ss.TokenFelligiSunter
 
setMismatchFactor(Double) - Method in class com.wcohen.ss.TokenFelligiSunter
 
setModelParamsFile(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
setModelParamsFile() - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
setParamFile(String) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
setPartialWord(String, boolean) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
 
setPartialWord(String, boolean) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
setPartialWord(String, boolean) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
 
setPseudoCount(double) - Method in class com.wcohen.ss.DirichletJS
 
setPseudoCount(Double) - Method in class com.wcohen.ss.DirichletJS
 
setScaling(boolean) - Method in class com.wcohen.ss.MongeElkan
If scaling is true, then distances are scaled to 0-1
setScaling(Double) - Method in class com.wcohen.ss.MongeElkan
For interfacing with reflection in MatchExptScript.
setStartingParams(List<Double>, List<Double>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
setStartingParams(List<Double>, List<Double>) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
setStringWrapperPool(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
 
setStringWrapperPool(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractStringDistance
Implements the StringDistanceLearner api, by providing a way to accumulate statistics for a set of related strings.
setStringWrapperPool(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractTokenizedStringDistance
 
setStringWrapperPool(StringWrapperIterator) - Method in interface com.wcohen.ss.api.StringDistanceLearner
Unsupervised learning method that observes strings for which distance will be computed.
setStringWrapperPool(StringWrapperIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Pass the training data along to the inner learners.
setTfIdfData(String) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
setTfIdfData(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
setTokenCount(int) - Method in class com.wcohen.ss.TFIDF
 
setTokenMatchThreshold(double) - Method in class com.wcohen.ss.SoftTFIDF
 
setTokenMatchThreshold(Double) - Method in class com.wcohen.ss.SoftTFIDF
 
setTokenMatchThreshold(double) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
 
setTokenMatchThreshold(Double) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
 
setTokenMatchThreshold(double) - Method in class com.wcohen.ss.SourcedSoftTFIDF
 
setTokenMatchThreshold(Double) - Method in class com.wcohen.ss.SourcedSoftTFIDF
 
setTrainDir(String) - Method in class com.wcohen.ss.AbbreviationAlignment
 
setTrainDir(String) - Method in class com.wcohen.ss.expt.ExtractAbbreviations
 
setTrainingDataDir(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
setTreshold(double) - Method in class com.wcohen.ss.tokens.TagLinkToken
setTreshold set a new value to the constant Gamma.
setWidth(int) - Method in class com.wcohen.ss.ApproxNeedlemanWunsch
 
setWindowSize(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Set the 'windowSize' used for finding similar tokens.
show(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Show datasets, blockers, or learners.
SimpleSourcedTokenizer - Class in com.wcohen.ss.tokens
Simple implementation of a Tokenizer.
SimpleSourcedTokenizer(boolean, boolean) - Constructor for class com.wcohen.ss.tokens.SimpleSourcedTokenizer
 
SimpleTokenizer - Class in com.wcohen.ss.tokens
Simple implementation of a Tokenizer.
SimpleTokenizer(boolean, boolean) - Constructor for class com.wcohen.ss.tokens.SimpleTokenizer
 
size() - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
 
size() - Method in class com.wcohen.ss.expt.Blocker
Return number of candidate pairs, as produced from most recently block()-ed data
size() - Method in class com.wcohen.ss.expt.NullBlocker
 
size() - Method in class com.wcohen.ss.expt.TokenBlocker
 
size() - Method in class com.wcohen.ss.lookup.SoftDictionary
Return the number of entries in the dictionary.
size() - Method in class com.wcohen.ss.MultiStringWrapper
Return number of fields.
slowLookup(double, String) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Exactly like lookup, but works by exhaustively checking every stored string.
SmithWaterman - Class in com.wcohen.ss
Smith-Waterman string distance, following Durban et al.
SmithWaterman() - Constructor for class com.wcohen.ss.SmithWaterman
 
SmithWaterman(CharMatchScore, double) - Constructor for class com.wcohen.ss.SmithWaterman
 
smoothCounter(int, List<Double>, List<Double>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
Dirichlet smoothing ------------------- Without a prior: P(data | theta) = theta(i)^beta(i) = counters(i) With a dirichlet prior: P(data | theta)*p(theta) = theta(i)^(beta(i) + alpha(i)) = theta(i)^beta(i) + theta(i)^alpha(i) counters(i) + params(i)^alpha(i)
smoothedProbability(Token, double, double) - Method in class com.wcohen.ss.DirichletJS
smoothed probability of the token
smoothedProbability(Token, double, double) - Method in class com.wcohen.ss.JelinekMercerJS
smoothed probability of the token
smoothedProbability(Token, double, double) - Method in class com.wcohen.ss.JensenShannonDistance
Smoothed probability of the token with frequency freq in a bag with the given totalWeight
smoothedProbability(Token, double, double) - Method in class com.wcohen.ss.UnsmoothedJS
Unsmoothed probability of the token
SoftDictionary - Class in com.wcohen.ss.lookup
Looks up nearly-matching strings in a dictionary, using a string distance.
SoftDictionary() - Constructor for class com.wcohen.ss.lookup.SoftDictionary
 
SoftDictionary(StringDistanceLearner) - Constructor for class com.wcohen.ss.lookup.SoftDictionary
 
SoftDictionary(Tokenizer) - Constructor for class com.wcohen.ss.lookup.SoftDictionary
 
SoftDictionary(StringDistanceLearner, Tokenizer) - Constructor for class com.wcohen.ss.lookup.SoftDictionary
 
SoftTFIDF - Class in com.wcohen.ss
TFIDF-based distance metric, extended to use "soft" token-matching.
SoftTFIDF(Tokenizer, StringDistance, double) - Constructor for class com.wcohen.ss.SoftTFIDF
 
SoftTFIDF(StringDistance, double) - Constructor for class com.wcohen.ss.SoftTFIDF
 
SoftTFIDF(StringDistance) - Constructor for class com.wcohen.ss.SoftTFIDF
 
SoftTFIDF() - Constructor for class com.wcohen.ss.SoftTFIDF
 
SoftTFIDFDictionary - Class in com.wcohen.ss.lookup
Looks up nearly-matching strings in a dictionary, using SoftTFIDF distance.
SoftTFIDFDictionary() - Constructor for class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
SoftTFIDFDictionary(Tokenizer) - Constructor for class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
SoftTFIDFDictionary(Tokenizer, double) - Constructor for class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
SoftTFIDFDictionary(Tokenizer, double, int, int) - Constructor for class com.wcohen.ss.lookup.SoftTFIDFDictionary
Create a new SoftTFIDFDictionary.
SoftTokenFelligiSunter - Class in com.wcohen.ss
Highly simplified model of Felligi-Sunter's method 1, applied to tokens.
SoftTokenFelligiSunter(Tokenizer, StringDistance, double, double) - Constructor for class com.wcohen.ss.SoftTokenFelligiSunter
 
SoftTokenFelligiSunter() - Constructor for class com.wcohen.ss.SoftTokenFelligiSunter
 
SourcedMatchData - Class in com.wcohen.ss.expt
Holds data for evaluating a distance metric.
SourcedMatchData(String) - Constructor for class com.wcohen.ss.expt.SourcedMatchData
Read match data from a file.
SourcedMatchData() - Constructor for class com.wcohen.ss.expt.SourcedMatchData
 
SourcedMatchData.InputFormatException - Exception in com.wcohen.ss.expt
Signals an incorrectly formatted MatchData file.
SourcedMatchData.Instance - Class in com.wcohen.ss.expt
A single item (aka record, string, etc) to match against others.
SourcedMatchData.MatchIterator - Class in com.wcohen.ss.expt
Iterates over all stored StringWrappers
SourcedSoftTFIDF - Class in com.wcohen.ss
TFIDF-based distance metric, extended to use "soft" token-matching.
SourcedSoftTFIDF(SourcedTokenizer, StringDistance, double) - Constructor for class com.wcohen.ss.SourcedSoftTFIDF
 
SourcedSoftTFIDF(StringDistance, double) - Constructor for class com.wcohen.ss.SourcedSoftTFIDF
 
SourcedSoftTFIDF(StringDistance) - Constructor for class com.wcohen.ss.SourcedSoftTFIDF
 
SourcedSoftTFIDF() - Constructor for class com.wcohen.ss.SourcedSoftTFIDF
 
SourcedStringWrapper - Interface in com.wcohen.ss.api
 
SourcedStringWrapperIterator - Interface in com.wcohen.ss.api
An iterator over StringWrapper objects.
SourcedTFIDF - Class in com.wcohen.ss
Sourced-based distance metric.
SourcedTFIDF(SourcedTokenizer) - Constructor for class com.wcohen.ss.SourcedTFIDF
 
SourcedTFIDF() - Constructor for class com.wcohen.ss.SourcedTFIDF
 
SourcedTFIDF.UnitVector - Class in com.wcohen.ss
Marker class extending BagOfTokens
SourcedToken - Interface in com.wcohen.ss.api
An interned version of a string, with provinance information
sourcedTokenize(String, String) - Method in interface com.wcohen.ss.api.SourcedTokenizer
Return tokenized version of a string, as an array of SourcedToken objects.
sourcedTokenize(String, String) - Method in class com.wcohen.ss.tokens.SimpleSourcedTokenizer
Return tokenized version of a string.
SourcedTokenizer - Interface in com.wcohen.ss.api
Split a string into tokens, retaining provinance.
SpecialMatchExpt - Class in com.wcohen.ss.expt
Perform a matching experiment using a vocabulary stats file, data file, distance function and blocker.
SpecialMatchExpt(MatchData, StringDistanceLearner, Blocker, boolean, String, String, boolean) - Constructor for class com.wcohen.ss.expt.SpecialMatchExpt
 
sprintf(Object[]) - Method in class com.wcohen.ss.PrintfFormat
Format an array of objects.
sprintf() - Method in class com.wcohen.ss.PrintfFormat
Format nothing.
sprintf(int) - Method in class com.wcohen.ss.PrintfFormat
Format an int.
sprintf(long) - Method in class com.wcohen.ss.PrintfFormat
Format an long.
sprintf(double) - Method in class com.wcohen.ss.PrintfFormat
Format a double.
sprintf(String) - Method in class com.wcohen.ss.PrintfFormat
Format a String.
sprintf(Object) - Method in class com.wcohen.ss.PrintfFormat
Format an Object.
Stats() - Constructor for class com.wcohen.ss.expt.ExtractAbbreviations.Stats
 
StringDistance - Interface in com.wcohen.ss.api
Compute the difference between pairs of strings.
StringDistanceLearner - Interface in com.wcohen.ss.api
Learn a StringDistance.
StringDistanceTeacher - Class in com.wcohen.ss.api
Train a StringDistanceLearner and return the learned StringDistance, using some unspecified source of information to train the learner.
StringDistanceTeacher() - Constructor for class com.wcohen.ss.api.StringDistanceTeacher
 
StringWrapper - Interface in com.wcohen.ss.api
Something that implements some of the functionality of Java's string class, but which is a non-final class, and hence can also cache additional information to facilitate later processing.
stringWrapperIterator() - Method in class com.wcohen.ss.api.StringDistanceTeacher
Strings over which distances will be computed.
StringWrapperIterator - Interface in com.wcohen.ss.api
An iterator over StringWrapper objects.
stringWrapperIterator() - Method in class com.wcohen.ss.expt.BasicTeacher
 
stringWrapperIterator() - Method in class com.wcohen.ss.expt.MatchDataTeacher
 

T

t - Variable in class com.wcohen.ss.MemoMatrix
 
table(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Show a table of some expt-wide numeric measurement.
TagLink - Class in com.wcohen.ss
 
TagLink() - Constructor for class com.wcohen.ss.TagLink
TagLink default constructor.
TagLink(AbstractStringDistance) - Constructor for class com.wcohen.ss.TagLink
TagLink constructor requires a character based string metric.
TagLink(Tokenizer, AbstractStringDistance) - Constructor for class com.wcohen.ss.TagLink
TagLink constructor requires a tokenizer and a tokenDistance metric
TagLink(String[]) - Constructor for class com.wcohen.ss.TagLink
TagLink constructor requires dataset string array in order to compute the IDF weights.
TagLink(String[], AbstractStringDistance) - Constructor for class com.wcohen.ss.TagLink
TagLink constructor requires dataset string array in order to compute the IDF weights and a tokenDistance metric.
TagLink.Candidates - Class in com.wcohen.ss
 
TagLink.UnitVector - Class in com.wcohen.ss
Marker class extending BagOfTokens
TagLinkToken - Class in com.wcohen.ss.tokens
 
TagLinkToken() - Constructor for class com.wcohen.ss.tokens.TagLinkToken
TagLinkToken default constructor.
TagLinkToken(double) - Constructor for class com.wcohen.ss.tokens.TagLinkToken
TagLinkToken constrctur.
tAt(int) - Method in class com.wcohen.ss.MemoMatrix
Get i-th char of t, indexing s from 1..n
TFIDF - Class in com.wcohen.ss
TFIDF-based distance metric.
TFIDF(Tokenizer) - Constructor for class com.wcohen.ss.TFIDF
 
TFIDF() - Constructor for class com.wcohen.ss.TFIDF
 
TFIDF.UnitVector - Class in com.wcohen.ss
Marker class extending BagOfTokens
time() - Method in class com.wcohen.ss.expt.MatchExpt
Return total time to process data.
time() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
Return total time to process data.
times(double) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
 
TN - Variable in class com.wcohen.ss.expt.ExtractAbbreviations.Stats
 
Token - Interface in com.wcohen.ss.api
An interned version of a string.
TokenBlocker - Class in com.wcohen.ss.expt
Finds all pairs that share a not-too-common token.
TokenBlocker(Tokenizer, double) - Constructor for class com.wcohen.ss.expt.TokenBlocker
 
TokenBlocker() - Constructor for class com.wcohen.ss.expt.TokenBlocker
 
TokenFelligiSunter - Class in com.wcohen.ss
Highly simplified model of Felligi-Sunter's method 1, applied to tokens.
TokenFelligiSunter(Tokenizer, double) - Constructor for class com.wcohen.ss.TokenFelligiSunter
 
TokenFelligiSunter() - Constructor for class com.wcohen.ss.TokenFelligiSunter
 
tokenIterator() - Method in class com.wcohen.ss.AbstractStatisticalTokenDistance
 
tokenIterator() - Method in interface com.wcohen.ss.api.Tokenizer
Return an iterator over interned tokens
tokenIterator() - Method in class com.wcohen.ss.tokens.CharacterTokenizer
 
tokenIterator() - Method in class com.wcohen.ss.tokens.NGramTokenizer
 
tokenIterator() - Method in class com.wcohen.ss.tokens.SimpleTokenizer
 
tokenize(String) - Method in interface com.wcohen.ss.api.Tokenizer
Return tokenized version of a string
tokenize(String) - Method in class com.wcohen.ss.tokens.CharacterTokenizer
Return tokenized version of a string.
tokenize(String) - Method in class com.wcohen.ss.tokens.NGramTokenizer
Return tokenized version of a string.
tokenize(String) - Method in class com.wcohen.ss.tokens.SimpleTokenizer
Return tokenized version of a string.
tokenizer - Variable in class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
 
tokenizer - Variable in class com.wcohen.ss.AbstractTokenizedStringDistance
 
Tokenizer - Interface in com.wcohen.ss.api
Split a string into tokens.
tokenizer - Variable in class com.wcohen.ss.expt.AbbreviationsBlocker
 
tokenizer - Variable in class com.wcohen.ss.expt.TokenBlocker
 
toString() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
 
toString() - Method in class com.wcohen.ss.abbvGapsHmm.Acronym
 
toString() - Method in class com.wcohen.ss.ApproxMemoMatrix
Print the matrix, for debugging and/or explanation.
toString() - Method in class com.wcohen.ss.BasicStringWrapper
 
toString() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
 
toString() - Method in class com.wcohen.ss.DirichletJS
 
toString() - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
 
toString() - Method in class com.wcohen.ss.expt.Blocker.Pair
 
toString() - Method in class com.wcohen.ss.expt.ClusterNGramBlocker
 
toString() - Method in class com.wcohen.ss.expt.ClusterTokenBlocker
 
toString() - Method in class com.wcohen.ss.expt.MatchData.Instance
 
toString() - Method in class com.wcohen.ss.expt.MatchData
 
toString() - Method in class com.wcohen.ss.expt.MatchExpt
 
toString() - Method in class com.wcohen.ss.expt.NGramBlocker
 
toString() - Method in class com.wcohen.ss.expt.NullBlocker
 
toString() - Method in class com.wcohen.ss.expt.SourcedMatchData.Instance
 
toString() - Method in class com.wcohen.ss.expt.SourcedMatchData
 
toString() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
 
toString() - Method in class com.wcohen.ss.expt.TokenBlocker
 
toString() - Method in class com.wcohen.ss.Jaccard
 
toString() - Method in class com.wcohen.ss.Jaro
 
toString() - Method in class com.wcohen.ss.JaroTFIDF
 
toString() - Method in class com.wcohen.ss.JaroWinklerTFIDF
 
toString() - Method in class com.wcohen.ss.JelinekMercerJS
 
toString() - Method in class com.wcohen.ss.Level2
 
toString() - Method in class com.wcohen.ss.Level2Jaro
 
toString() - Method in class com.wcohen.ss.Level2JaroWinkler
 
toString() - Method in class com.wcohen.ss.Level2Levenstein
 
toString() - Method in class com.wcohen.ss.Level2MongeElkan
 
toString() - Method in class com.wcohen.ss.Levenstein
 
toString() - Method in class com.wcohen.ss.MemoMatrix
Print the matrix, for debugging and/or explanation.
toString() - Method in class com.wcohen.ss.Mixture
 
toString() - Method in class com.wcohen.ss.MongeElkan
 
toString() - Method in class com.wcohen.ss.MongeElkanTFIDF
 
toString() - Method in class com.wcohen.ss.MultiStringWrapper
 
toString() - Method in class com.wcohen.ss.ScaledLevenstein
 
toString() - Method in class com.wcohen.ss.SmithWaterman
 
toString() - Method in class com.wcohen.ss.SoftTFIDF
 
toString() - Method in class com.wcohen.ss.SoftTokenFelligiSunter
 
toString() - Method in class com.wcohen.ss.SourcedSoftTFIDF
 
toString() - Method in class com.wcohen.ss.SourcedTFIDF
 
toString() - Method in class com.wcohen.ss.TagLink
toString returns the name and parameters of this string metric
toString() - Method in class com.wcohen.ss.TFIDF
 
toString() - Method in class com.wcohen.ss.TokenFelligiSunter
 
toString() - Method in class com.wcohen.ss.tokens.BasicSourcedToken
 
toString() - Method in class com.wcohen.ss.tokens.BasicToken
 
toString() - Method in class com.wcohen.ss.tokens.CharacterTokenizer
 
toString() - Method in class com.wcohen.ss.tokens.SimpleTokenizer
 
toString() - Method in class com.wcohen.ss.tokens.TagLinkToken
toString return the name of the string metric.
toString() - Method in class com.wcohen.ss.UnsmoothedJS
 
toString() - Method in class com.wcohen.ss.WinklerRescorer
 
toStringNoStates() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
 
toStringWithEmissions() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
 
toStringWithFrequency() - Method in class com.wcohen.ss.abbvGapsHmm.Acronym
 
toStringWithStates() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
 
totalTokenCount - Variable in class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
 
totalTokenCount - Variable in class com.wcohen.ss.AbstractStatisticalTokenDistance
 
TP - Variable in class com.wcohen.ss.expt.ExtractAbbreviations.Stats
 
train(List<List<Acronym>>, List<Map<String, String>>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
train(List<List<Acronym>>, List<Map<String, String>>, boolean) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
train(List<String>, List<Integer>, List<Map<String, String>>) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
train(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
Accumulate statistics on how often each token value occurs
train(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
 
train(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractStatisticalTokenDistance
Accumulate statistics on how often each token value occurs
train(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractTokenizedStringDistance
 
train(StringDistanceLearner) - Method in class com.wcohen.ss.api.StringDistanceTeacher
 
train(StringWrapperIterator) - Method in class com.wcohen.ss.JensenShannonDistance
Accumulate statistics on how often each token occurs.
train(StringWrapperIterator) - Method in class com.wcohen.ss.Level2
Training is a null operation.
trainCorpus(List<List<Acronym>>, List<Map<String, String>>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 
trainIfNeeded() - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
trainOnAll() - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
trainOnCandidates() - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
transitionIsLegal(int, int, AbbvGapsHMM.States, int, int, AbbvGapsHMM.States, AbbvGapsHMM.Transitions, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
transitionIsLegal(int, int, AbbvGapsHMM.States, int, int, AbbvGapsHMM.States, AbbvGapsHMM.Transitions, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
transitionIsLegal(int, int, AbbvGapsHMM.States, int, int, AbbvGapsHMM.States, AbbvGapsHMM.Transitions, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 

U

UnitVector(String, SourcedToken[]) - Constructor for class com.wcohen.ss.SourcedTFIDF.UnitVector
 
UnitVector(BagOfSourcedTokens) - Constructor for class com.wcohen.ss.SourcedTFIDF.UnitVector
 
UnitVector(String, Token[]) - Constructor for class com.wcohen.ss.TagLink.UnitVector
 
UnitVector(BagOfTokens) - Constructor for class com.wcohen.ss.TagLink.UnitVector
 
UnitVector(String, Token[]) - Constructor for class com.wcohen.ss.TFIDF.UnitVector
 
UnitVector(BagOfTokens) - Constructor for class com.wcohen.ss.TFIDF.UnitVector
 
UnsmoothedJS - Class in com.wcohen.ss
Jensen-Shannon distance of two unsmoothed unigram language models.
UnsmoothedJS() - Constructor for class com.wcohen.ss.UnsmoothedJS
 
unwrap() - Method in interface com.wcohen.ss.api.StringWrapper
Return the string that is wrapped.
unwrap() - Method in class com.wcohen.ss.BasicStringWrapper
 
updateLegalOutgoingEdges(int, int, AbbvGapsHMM.States, int, int, AbbvGapsHMM.States, AbbvGapsHMM.Transitions, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
updateLegalOutgoingEdges(int, int, AbbvGapsHMM.States, int, int, AbbvGapsHMM.States, AbbvGapsHMM.Transitions, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
updateLegalOutgoingEdges(int, int, AbbvGapsHMM.States, int, int, AbbvGapsHMM.States, AbbvGapsHMM.Transitions, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
updateLegalOutgoingEdges(int, int, AbbvGapsHMM.States, int, int, AbbvGapsHMM.States, AbbvGapsHMM.Transitions, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
 
updateLegalOutgoingEdges(int, int, AbbvGapsHMM.States, int, int, AbbvGapsHMM.States, AbbvGapsHMM.Transitions, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
updateOutgoingEdges(int, int, AbbvGapsHMM.States, int, int, AbbvGapsHMM.States, AbbvGapsHMM.Transitions, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
updateOutgoingEdgesStateD(int, int, int, int, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
Enumerates the possible transitions into state D.
updateOutgoingEdgesStateDL(int, int, int, int, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
Enumerates the possible transitions into state DL.
updateOutgoingEdgesStateEND(int, int, int, int, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
Enumerates the possible transitions into state END.
updateOutgoingEdgesStateM(int, int, int, int, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
Enumerates the possible transitions into state M.
updateState(AbbvGapsHMM.States, int, int, int, int, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
useTDIDF() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 

V

val() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
value - Variable in class com.wcohen.ss.tokens.BasicToken
 
valueOf(String) - Static method in enum com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM.Emissions
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM.States
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM.Transitions
Returns the enum constant of this type with the specified name.
values() - Static method in enum com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM.Emissions
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM.States
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM.Transitions
Returns an array containing the constants of this enum type, in the order they are declared.
viterbi(Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
 

W

WinklerRescorer - Class in com.wcohen.ss
Winkler's reweighting scheme for distance metrics.
WinklerRescorer(StringDistance) - Constructor for class com.wcohen.ss.WinklerRescorer
Rescore the innerDistance's scores, to account for the subjectively greater importance of the first few characters.

_

_acronym - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
_alignment - Variable in class com.wcohen.ss.abbvGapsHmm.Acronym
 
_alpha - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
_alpha - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
 
_bestProb - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
_beta - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
_beta - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
 
_current - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
_currentStringPos - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
_currPartialWord - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
_currProb - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
_currWord - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
_emissionCounters - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
 
_emissionParams - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
_emissionParams - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
_emissions - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
_eval_end - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
_eval_mat_size - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
_eval_start - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
_evalMat - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
_frequency - Variable in class com.wcohen.ss.abbvGapsHmm.Acronym
 
_gapsHMM - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
_length - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
_longForm - Variable in class com.wcohen.ss.abbvGapsHmm.Acronym
 
_lParam - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
_partialWord - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
_partialWordIsAtStart - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
 
_partialWordParam - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
_prevLF - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
_prevLF_stringPos - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
_prevSF - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
_prevSF_stringPos - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
_probability - Variable in class com.wcohen.ss.abbvGapsHmm.Acronym
 
_shortForm - Variable in class com.wcohen.ss.abbvGapsHmm.Acronym
 
_sParam - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
 
_states - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
 
_trainingCorpusFile - Static variable in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
_trainingDataDir - Static variable in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
_transitionCounters - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
 
_transitionParams - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
 
_transitionParams - Variable in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
 
_trueLabelsFile - Static variable in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
 
A B C D E F G H I J L M N O P R S T U V W _ 
Skip navigation links

Copyright © 2016. All rights reserved.