2e59643f1b
Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well.
9 lines
602 B
Text
9 lines
602 B
Text
Implements an approximate string matching version of R's native
|
|
'match' function. Can calculate various string distances based on
|
|
edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting
|
|
alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic
|
|
metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided
|
|
as well. Distances can be computed between character vectors while
|
|
taking proper care of encoding or between integer vectors representing
|
|
generic sequences. This package is built for speed and runs in
|
|
parallel by using 'openMP'. An API for C or C++ is exposed as well.
|