What the job actually is
A quick-commerce analyst turns shopper, merchant, and customer event data into answers for the people running a delivery marketplace. A day might include quantifying how much substitution swaps inflate reported GMV, building the three-sided P&L that splits one customer dollar across shopper take, merchant payout, and platform take, normalizing shopper economics for batched dispatches (where one trip fulfills 2–3 orders), reading surge-pricing volume impact for the pricing team, or running merchant-churn forensics. The data shape is a three-party event ledger: per-order fact (order_id) carrying the full fee waterfall, a substitutions fact at substituted-order grain, and a dispatch fact where batch_size links one shopper trip to several orders.
The role varies by employer
Grocery delivery marketplace (Instacart-shape)
Shopper picks from a partner store, substitutions are central (out-of-stock items get swapped at the shelf). The analyst owns substitution economics, basket composition, and the three-sided fee split. Heavy emphasis on GMV integrity and merchant relationships.
Restaurant / prepared-food delivery (DoorDash / Uber Eats-shape)
Less substitution, more dispatch and batching logistics. The analyst owns courier batching efficiency, surge pricing, delivery-time SLAs, and merchant acquisition / churn. Tip allocation and the dasher / courier P&L dominate.
Convenience / q-commerce (Gopuff, Getir-style dark stores)
Platform-owned micro-fulfillment, so the "merchant" is internal and the model collapses toward two-sided. The analyst focuses on shopper batching, delivery-zone economics, and inventory at the dark store. Faster delivery promise (10–20 min) raises the SLA bar.
Marketplace strategy / finance team
Sits closer to corporate finance. Owns the three-sided P&L, contribution-margin bridges, take-rate analysis, and board reporting. Less event-level SQL, more multi-CTE reconciliation and scenario modeling.
Platform data / experimentation team
Data engineering, the dbt event ledger, and the A/B platform behind surge, substitution policy, and batching algorithms. Adjacent to the business-analyst role but modeling-layer focused — a strong path for analysts who want to own the metric definitions.
Skills that actually get hired
SQL (non-negotiable)
- Postgres or Snowflake or BigQuery — marketplace event ledgers live in dbt-modeled stacks.
- Multi-grain joins (order × substitution × dispatch) without double-counting — the batching grain is the classic trap.
- Window functions: NTILE for shopper quality quintiles, cohort-decay grids by months-since-signup, PARTITION BY for per-dispatch normalization.
- Conditional aggregation: AVG(CASE WHEN ...) for completion / approval / refund rates in a single pass.
- Multi-CTE pipelines: the fee waterfall, the three-sided P&L, and the substitution-policy capstone are 2–4 CTEs deep.
- Ratio hygiene: NULLIF on every denominator, COALESCE for zero-activity entities, aggregate-before-join to avoid fan-out.
- Date arithmetic: DATE_TRUNC for monthly cohorts, EXTRACT year/month for months-since differences, hour-of-day for demand curves.
Marketplace domain fluency
- Three-sided economics: customer pays = merchant payout + shopper take + platform take. Knowing which column carries which flow.
- GMV vs net revenue: why substitution swaps make reported GMV overstate what the business actually nets.
- Take rate, contribution margin, and the difference between GMV, net revenue, and platform take.
- Batching: one dispatch fulfills 2–3 orders, so per-order shopper cost double-counts unless normalized by batch_size.
- Surge mechanics: the multiplier inflates fees mechanically, so volume / basket signals matter more than order-value signals.
Ops / growth fluency
- Shopper completion rate, dispatch SLA, and the difference between order-grain and dispatch-grain metrics.
- Substitution approval and refund rates as customer-experience signals.
- Cohort retention and survival — both customer (LTV by exposure) and shopper (attrition decay).
- Merchant churn signals: tenure, substitution rate, and last-order recency.
BI / tooling
- Looker / Tableau — the dashboard layer most marketplaces run.
- dbt — the event-ledger modeling layer; marketplaces have hundreds of models.
- Snowflake / BigQuery — the warehouse standard for high-volume event data.
- Mode / Hex — common for the ad-hoc analysis notebooks marketplace teams live in.
Stakeholder skills
- Translating a finance question ("is our GMV real?") into the substitution-leakage analysis.
- Defending a shopper-pay model against ops when batching normalization changes the numbers.
- Writing the one-table policy recommendation — the substitution-reason tradeoff is the artifact ops leadership wants.
The interview loop
- 1
Recruiter screen (30 min)
Fit check. Be ready to explain why quick-commerce / marketplace specifically (vs retail, vs DTC). Grocery-delivery shops want substitution and basket fluency; restaurant-delivery wants batching and courier economics.
- 2
Hiring manager (45–60 min)
Behavioral + domain depth. Expect a case: "Substitution rate spiked 3 points last month — what would you check?" They're testing whether you reach for GMV leakage, refund exposure, and merchant out-of-stock patterns fluently, and whether you understand the customer pays original / merchant paid sub asymmetry without prompting.
- 3
Technical SQL screen (60–90 min, usually take-home)
A Postgres / Snowflake schema with a three-party order fact + substitutions + dispatches. Common asks: quantify substitution GMV leakage by merchant, build the three-sided fee waterfall, normalize shopper economics for batching. Edge cases matter — "shopper_payout is per dispatch but you summed it per order; what breaks?"
- 4
Cross-functional round (finance / ops / pricing)
A non-analyst asks you to defend a metric. Finance will probe whether your P&L reconciles (customer_total = the three takes); Ops will challenge your batching normalization; Pricing will press on whether surge volume impact is real demand or a fee artifact.
- 5
Director / VP round
Strategy and prioritization. Common ask: "If you had 8 weeks as our new marketplace analyst, what would you build first?" The strong answer is "the reconciled three-sided P&L with the substitution-leakage adjustment, before any new analysis" — the artifact every board review needs.
Questions you’ll actually be asked
- “How does a substitution inflate GMV, and how do you quantify it?”
- When an item is out of stock, the shopper swaps a cheaper one but the customer still pays the original price; the merchant is only paid for the cheaper picked item. So customer_charged > merchant_paid on the substitution row. GMV leakage = SUM(customer_charged - merchant_paid). Reported GMV (customer-side) overstates the merchant-settled value by exactly this gap. M11 builds it per merchant; M24 carries it into the category P&L.
- “Walk me through the three-sided fee waterfall.”
- Start from SUM(customer_total) on delivered orders, subtract SUM(merchant_payout), subtract SUM(shopper_take), and the remainder is SUM(platform_take). Because the per-order identity customer_total = merchant_payout + shopper_take + platform_take holds by construction, a residual_check row should land at ~0. The classic mistake is adding a separate tip line — tip_amount is already inside shopper_take. M18 is the canonical mission.
- “Why does naive per-order shopper pay double-count, and how do you fix it?”
- shopper_payout is paid once per dispatch, but a dispatch can fulfill 2–3 orders (batch_size). If you join dispatches to orders and SUM shopper_payout per order, a triple batch counts the payout three times. The fix is to keep dispatch grain and divide SUM(shopper_payout) by SUM(batch_size) for true per-order economics. M12 surfaces the bug; M16 normalizes it into the quality score.
- “How do you read surge pricing impact on volume?”
- Split delivered orders into surge (surge_multiplier > 1.0) vs normal and compare order_count, avg_order_value, and avg_units. The trap: avg_order_value is mechanically higher under surge because surge_fee inflates customer_total — that is not evidence surge is harmless. The honest signal is whether avg_units or volume share softens, which reflects real demand suppression. M19 is the mission.
- “How do you build a shopper attrition cohort-decay grid?”
- Derive a DISTINCT (shopper_id, active_month) set from dispatch activity, join each shopper to their signup cohort_month, and compute months_since as (year diff)*12 + (month diff) on the truncated dates. COUNT(DISTINCT shopper_id) per (cohort_month, months_since) yields the survivor curve. The grid is triangular — recent cohorts have fewer months observed. M22 is the canonical mission; subtracting text dates instead of computing the month gap is the common failure.
- “What separates a substitution reason worth tightening from one worth keeping?”
- Score each reason on three levers: GMV leakage (SUM of the price gap), reject rate (customer_approved = 0), and refund rate (order_status = refunded, which requires the join to fact_orders). High-leakage, high-rejection reasons are the policies to tighten; a high-volume but clean like-for-like swap is the one you keep. M25 is the policy capstone.
What it pays
| Level | Range | Notes |
|---|---|---|
| Entry-level (0–2 yr) | $80k–$115k | Marketplace / gig-economy companies pay above generic retail because of the data-volume and experimentation intensity. Hybrid is common in the major hubs (SF, NYC, Seattle, Chicago). |
| Mid-level (2–4 yr) | $110k–$155k | Marketplace Analyst, Operations Analytics, Pricing / Growth Analytics. Equity / RSU is a meaningful component at the public players (DoorDash, Uber, Instacart). |
| Senior Analyst (4–7 yr) | $145k–$200k | Senior Marketplace Analyst, Senior Strategy & Ops. Three-sided P&L ownership and experimentation fluency command the top of the band. |
| Manager / Lead (5–8 yr) | $175k–$240k | Running an analytics pod for a function (shopper economics, merchant growth, pricing). Owns the metric layer and the experiment readouts for that surface. |
| Director Analytics / Strategy & Ops | $210k–$320k | Top-of-house at large marketplaces. Owns the three-sided unit-economics model and the analyst hiring bar. Cash plus substantial equity. |
| VP Analytics / Head of Marketplace | $260k–$450k+ | Public marketplace players (DoorDash, Uber, Instacart). Equity dominates total comp. Cross-functional reach into pricing, ops, and corporate finance. |
Certifications — honest take
dbt Analytics Engineering certification
Gold standardMarketplaces run dbt over enormous event ledgers. The cert is signal, but a public sample dbt project modeling a three-sided order ledger is worth more.
Looker LookML developer certification
Nice to haveUseful at marketplaces on Looker. Lower signal than demonstrable SQL fluency on multi-grain marketplace data.
SnowPro Core (Snowflake)
Nice to haveRelevant at the Snowflake-based marketplaces. Demonstrates warehouse fluency without being mandatory.
A/B testing / experimentation course (e.g. Udacity, Reforge)
Nice to haveMarketplaces are experimentation-heavy (surge, substitution policy, batching algorithms). Causal-inference fluency differentiates senior candidates.
Google Analytics 4 / GA4 certification
SkipMarketplace analytics relies on the company event ledger, not GA4. Skip in favor of dbt or experimentation skills.
How long it takes
E-commerce or BI analyst with basic SQL: 4–8 months prep — author 2–3 portfolio analyses on a fictional delivery marketplace, get fluent with the substitution-leakage and three-sided-P&L walks, apply to mid-stage marketplace roles. Retail / ops analyst transitioning: 2–4 months — your domain transfers; the gap is the three-party fee math and the batching grain. Non-analyst with a quantitative degree: 9–14 months prep — start at a smaller marketplace or a strategy & ops role, learn the event ledger on the job, transition into a senior marketplace analyst role within 18–24 months.
Common mistakes to avoid
- Summing per-order shopper_take to get shopper cost on batched dispatches. shopper_payout is per dispatch — divide by SUM(batch_size), or you triple-count a triple batch. M12 is built around this.
- Reading reported GMV as net revenue. Substitution swaps make customer-side GMV overstate merchant-settled value by SUM(customer_charged - merchant_paid). M11 quantifies the gap.
- Adding a separate tip line to the fee waterfall. tip_amount is already inside shopper_take — a tip step double-counts and the residual stops reconciling. M18 traps this.
- Treating higher surge order value as proof surge is fine. surge_fee inflates customer_total mechanically; the real signal is avg_units / volume. M19 is the mission.
- Computing months-since-signup by subtracting TEXT dates. Always use (year diff)*12 + (month diff) on truncated dates, or the cohort-decay grid is nonsense. M22.
- Joining substitutions before aggregating in the P&L. That fans each order out by its substitution rows and inflates every money column — pre-aggregate leakage to order grain first. M24.
- INNER JOINing orders to substitutions when you want all merchants / customers. Zero-substitution entities vanish; LEFT JOIN + COALESCE keeps them. M7, M23.
- Forgetting NULLIF on rate denominators. A merchant with zero orders errors the whole substitution-rate query. M7.
- Counting substitutions with COUNT(*) over a LEFT JOIN instead of COUNT(substitution_id). COUNT(*) counts the order rows (always 1+) and inflates the rate. M23.
- Over-indexing on Python early. Quick-commerce analytics rewards rock-solid multi-grain SQL + marketplace-economics fluency first; Python is the later lever.
The trajectory
| Stage | Years | Comp |
|---|---|---|
| Marketplace Analyst I | 0–2 yr | $80k–$115k |
| Marketplace / Strategy & Ops Analyst | 2–4 yr | $110k–$155k |
| Senior Marketplace Analyst | 4–7 yr | $145k–$200k |
| Manager / Lead | 5–8 yr | $175k–$240k |
| Director Analytics / Strategy & Ops | 8–12 yr | $210k–$320k |
| VP Analytics / Head of Marketplace | 12+ yr | $260k–$450k+ |
How the caseSQL curriculum maps to this
The caseSQL Quick Commerce path can't hand you Instacart's actual ledger or DoorDash's batching algorithm — but the schema is the right shape, the gotchas are the same gotchas (substitution GMV leakage, the batching grain, the three-party fee identity, surge-fee inflation, cohort decay), and the canonical missions force the SQL the hiring loop screens on. The Hard tier (M11–M15) approximates the live SQL screen; the Expert and Master tiers (M16–M25) are take-home territory. M21 — the reconciled three-sided P&L — is the question you'll be asked in some form on every senior marketplace analyst loop.