Practice the SQL patterns clinical research analysts and trial sponsors actually run: enrollment cohort analysis, arm balance checks, MedDRA-coded adverse event signal detection, visit adherence drift, and Kaplan-Meier-style retention curves. Real-world terminology — NCT-format trial IDs, CTCAE v5 severity grades 1-5, ATC drug classes, and MedDRA Preferred Terms — on a fully isolated 7-table dataset modeled on real phase 2 / phase 3 protocols.
25 missions · 7 tables · authentic NCT-format trial IDs, MedDRA, CTCAE v5
Read the briefing
A Slack message from your manager
Explore the schema
5 tables in a star schema
Write your query
Full SQL editor with autocomplete
Get expert feedback
Graduated hints, not just pass/fail
NCT-format trial IDs, MedDRA Preferred Terms for adverse events, CTCAE v5 severity grades, ATC drug classes. The vocabulary every clinical research analyst, biostatistician, and trial sponsor uses daily.
Arm balance checks, screen-fail rates, eligibility strata, and per-arm safety comparisons — the SQL patterns that drive every interim analysis and DSMB report.
Severity-by-arm cross-tabs, drug-vs-placebo AE rate deltas, and SAE cohort intersection — the analytics that flag a signal before the next protocol amendment.
Multi-CTE retention curves, sponsor portfolio scorecards, and Kaplan-Meier-style survival analysis. “Built SQL retention models on phase 3 trial data” lands cleanly with sponsor analytics, CRO, and biotech hiring managers.
Each mission is a real request from someone at the company. Difficulty increases as you go.
Clinical trial dataset modeled on real phase 2 and phase 3 protocols. 12 active trials across cardiology, oncology, neurology, and endocrinology. NCT-format trial IDs, real MedDRA Preferred Terms (Nausea, Neutropenia, Pyrexia, etc.), CTCAE v5 severity grades 1–5, and ATC drug classes. Planted patterns include Trial 1’s severe arm imbalance (M6), Trial 3’s ~37% screen-fail rate (M13), three patients enrolled in two trials each (M11), and three enrollments seeded with multiple SAEs (M17). Patient_id space is fully independent of every other path.
●dimension tables ● fact tables
The patterns trial sponsors, CROs, and biotech analytics teams interview on — on a real-shaped trial dataset.
Looking for something different?