Tag Page

Levenshtein distance

1 source across the archive use this tag. The list below groups them by source while keeping the original topic context visible.

Back to all tags

Fuzzy String Matching

Splink: String Comparators

Also listed on Fuzzy String Matching.

This is one of the clearest compact overviews of common string comparators in practical use. It covers Levenshtein, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Jaccard in one place, which makes it especially useful when you need to compare what each metric is actually sensitive to rather than reading isolated algorithm descriptions.

Its main strength is readability. The page is implementation-oriented because it is part of toolkit documentation, but that also makes it pragmatic: it helps you connect the abstract metric to the kinds of matching problems it handles well, such as transpositions, typos, or token overlap. It is a strong first reference when you want a technical overview before deciding which comparator deserves deeper study.