Fuzzy String Matching
Microsoft Learn: Fuzzy Merge in Power Query
Also listed on Fuzzy String Matching.
This is a good reference for token-based fuzzy matching in a real workflow rather than in the abstract. It explains Jaccard similarity, thresholds, and preprocessing controls in the context of fuzzy merge operations, which makes the strengths of set-based matching concrete and easy to reason about.
Its scope is narrower than the Splink material because it is tied to Power Query, but that narrowness is also the benefit. The examples make it clear when token overlap works well, how normalization choices affect outcomes, and why preprocessing can matter as much as the similarity function itself in messy data-cleaning pipelines.