Clinical Trials & Research Analytics Path

The protocol says enroll 200.
You’re at 47. The board meets Tuesday.

Practice the SQL patterns clinical research analysts and trial sponsors actually run: enrollment cohort analysis, arm balance checks, MedDRA-coded adverse event signal detection, visit adherence drift, and Kaplan-Meier-style retention curves. Real-world terminology — NCT-format trial IDs, CTCAE v5 severity grades 1-5, ATC drug classes, and MedDRA Preferred Terms — on a fully isolated 7-table dataset modeled on real phase 2 / phase 3 protocols.

See All Missions

25 missions · 7 tables · authentic NCT-format trial IDs, MedDRA, CTCAE v5

How It Works

1

Read the briefing

A Slack message from your manager

2

Explore the schema

5 tables in a star schema

3

Write your query

Full SQL editor with autocomplete

4

Get expert feedback

Graduated hints, not just pass/fail

Why This Path

Authentic Trial Terminology

NCT-format trial IDs, MedDRA Preferred Terms for adverse events, CTCAE v5 severity grades, ATC drug classes. The vocabulary every clinical research analyst, biostatistician, and trial sponsor uses daily.

Protocol & Cohort Analysis

Arm balance checks, screen-fail rates, eligibility strata, and per-arm safety comparisons — the SQL patterns that drive every interim analysis and DSMB report.

Safety Signal Detection

Severity-by-arm cross-tabs, drug-vs-placebo AE rate deltas, and SAE cohort intersection — the analytics that flag a signal before the next protocol amendment.

Resume-Grade Capstone

Multi-CTE retention curves, sponsor portfolio scorecards, and Kaplan-Meier-style survival analysis. “Built SQL retention models on phase 3 trial data” lands cleanly with sponsor analytics, CRO, and biotech hiring managers.

The Missions

Each mission is a real request from someone at the company. Difficulty increases as you go.

Easy5 missions
Medium5 missions
Hard5 missions
Expert5 missions
Master5 missions

The Database

Clinical trial dataset modeled on real phase 2 and phase 3 protocols. 12 active trials across cardiology, oncology, neurology, and endocrinology. NCT-format trial IDs, real MedDRA Preferred Terms (Nausea, Neutropenia, Pyrexia, etc.), CTCAE v5 severity grades 1–5, and ATC drug classes. Planted patterns include Trial 1’s severe arm imbalance (M6), Trial 3’s ~37% screen-fail rate (M13), three patients enrolled in two trials each (M11), and three enrollments seeded with multiple SAEs (M17). Patient_id space is fully independent of every other path.

dim_trials (12)dim_protocols (27)dim_sites (15)dim_trial_patients (520)fact_enrollments (433)fact_trial_visits (~2,500)fact_adverse_events (~250)

dimension tables   fact tables

Build clinical trial SQL on your resume

The patterns trial sponsors, CROs, and biotech analytics teams interview on — on a real-shaped trial dataset.

Looking for something different?

|