Data CleaningVery Hard

How do you maintain referential integrity at 100B-row scale?

Seen at: Meta · Stripe · Uber

At your scale, database foreign keys are too expensive. How do you ensure referential integrity anyway?

Draft your answer

Saved to this browser only. Try it before you peek at the model answer.

Reveals the model answer and the self-score rubric.

More data cleaning questions