Data CleaningMedium

What are your strategies for deduplicating records?

Seen at: Salesforce · HubSpot · Stripe

You've been given a users table where the same person may appear multiple times under slightly different email addresses or slightly different names. How do you approach deduplication?

Draft your answer

Saved to this browser only. Try it before you peek at the model answer.

Stuck? Peek at a hint

Reveals the model answer and the self-score rubric.

More data cleaning questions