Data CleaningMedium

How do you detect and decide what to do with outliers?

Seen at: Stripe · Amazon · Shopify

Your dataset of transaction amounts has a few values 100x the median. Should you remove them, cap them, transform them, or keep them?

Draft your answer

Saved to this browser only. Try it before you peek at the model answer.

Stuck? Peek at a hint

Reveals the model answer and the self-score rubric.

More data cleaning questions