This is the Java implementation of SimString algorithm
http://www.chokkan.org/software/simstring/
SimString is an approximate string matching algorith. Given a query string, it looks for approximately similar string in a dictionary. Supported similarity functions include:
- Jaccard
- Cosine
SimString can handle a dictionary whose size of millions of unique words.