Showing posts with label DEDUPE. Show all posts
Showing posts with label DEDUPE. Show all posts

Monday, 24 April 2017

Basics of Entity Resolution with Python and Dedupe by Kyle Rossetti and Rebecca Bilbro via @DistrictDataLab

Great blog by Kyle Rossetti and Rebecca Bilbro explains how to disambiguate records that correspond to real-world entities across and within datasets using the Python dedupe package.

Contains code and examples so you can really understand it and easily replicate in your own work.