What is cross-marketplace cannibalization?

Same product listed on multiple marketplaces (different listing_ids, same product_title). It splits inventory, breaks price coherence, and complicates strategy. Detection is GROUP BY product_title with COUNT(DISTINCT marketplace_id) >= 2 — the listings that appear on more than one platform are the cannibalization cohort. Mission #18 surfaces it; #24 quantifies the per-side revenue split.

How is ROAS computed at listing grain?

Sum ad_fee from fact_settlements WHERE ad_fee > 0 (only ad-attributed orders), grouped by listing. Pair with revenue (sum of gross_amount on the same orders), divide revenue by ad_spend with NULLIF guard for divide-by-zero. Apply HAVING SUM(ad_fee) >= $50 minimum to avoid statistically thin listings dominating the top-ROAS list. Mission #14 walks the canonical version.

Marketplace Seller SQL Interview Questions

Marketplace analytics interviews are not generic SQL screens. They test whether you can reconcile gross GMV against actual seller payout across a stack of fees, surface Buy Box loss windows that destroy daily revenue without a price change, identify cross-marketplace cannibalization where the same SKU is listed twice and split units across platforms, and build a board-pack waterfall that walks orders to fees to refunds to net cash. This guide walks through the SQL skills multi-marketplace sellers and platform-side analytics teams get asked about, with worked examples on an Amazon + Etsy + eBay-shaped schema and a 25-mission path that drills each pattern.

If you are interviewing for a marketplace operations seat at a brand running on Amazon Seller Central / Etsy / eBay, an analytics role at a marketplace platform, or a 3PL / aggregator with multi-marketplace inventory, the patterns below are the ones that show up in take-home assignments and live whiteboards.

Start the Marketplace path →

What marketplace data actually looks like

Marketplace seller warehouses are organized around a fundamental reconciliation problem: GMV (gross merchandise value, the order amount) doesn’t equal payout (the cash the seller actually receives). Between gross and net sit four fee categories — marketplace fee (the platform’s take, 6.5–15% depending on platform), ad fee (sponsored-listing spend), fulfillment fee (FBA / shipping / pick-pack-ship), and refund amount (orders that came back). The job is to write the queries that walk gross to net cleanly, per marketplace, per month, per listing.

The caseSQL Marketplace database models that exact shape: fact_orders (~25K rows, GMV at the order grain), fact_settlements (~25K rows, 1:1 with orders, the four fee categories + net_payout), fact_listing_views (~108K rows, view stream for CTR analytics with organic / sponsored / external sources), and fact_buybox_snapshots (~10K rows, hourly Buy Box win-rate samples for Amazon listings only).

Around the facts sit two dimensions: dim_marketplaces (3 rows: Amazon at 15% take + FBA, Etsy at 6.5% take, eBay at 10% take), and dim_listings (~150 listings across 6 categories — Home Goods / Apparel / Beauty / Kitchen / Electronics / Outdoors). Listings include 20 deliberate cross-marketplace twins (same product_title, different listing_ids, different marketplaces) — the cannibalization signal you’ll detect in M18.

Question patterns you’ll be asked

Across multi-marketplace seller and platform-side interviews, six patterns dominate. Each one has a distinct SQL shape, and once you can recognize the shape you can almost always write the query. The worked examples below are deliberately adjacent to the exercises in the mission path — same skill, different angle — so you can read the example, then go practice the variant.

Settlement reconciliation (GMV ≠ payout)

“Why is your payout $215K when your gross was $312K?” is the most-asked question in marketplace interviews. The answer is the settlement reconciliation walk: gross minus marketplace_fee minus ad_fee minus fulfillment_fee minus refund_amount equals net_payout. The skill is aggregating four fee categories side-by-side and proving the equation balances.

Worked example, all-time settlement summary:

SELECT 'all' AS scope,
       ROUND(SUM(gross_amount)::numeric, 2) AS gross_total,
       ROUND(SUM(marketplace_fee)::numeric, 2) AS marketplace_fee_total,
       ROUND(SUM(ad_fee)::numeric, 2) AS ad_fee_total,
       ROUND(SUM(fulfillment_fee)::numeric, 2) AS fulfillment_fee_total,
       ROUND(SUM(refund_amount)::numeric, 2) AS refund_total,
       ROUND(SUM(net_payout)::numeric, 2) AS payout_total
FROM   fact_settlements;

Practice this in Mission #16 — and Mission #21 extends it to the per-(marketplace, month) three-way reconciliation report the CFO uses for audit-grade evidence.

Buy Box loss-window detection

Amazon-specific: ~80–90% of Amazon sales come through the Buy Box (the default buy button). Losing the Buy Box for several hours destroys daily revenue without you changing your price — a competitor underbid you. Detection is conditional aggregation on hourly snapshots, with a HAVING threshold to surface listings with multi-hour loss runs.

Worked example, listings that lost the Buy Box for at least 5 hours:

SELECT l.listing_id, l.product_title,
       COUNT(*) AS total_hours,
       SUM(CASE WHEN b.won_buybox_pct < 0.50 THEN 1 ELSE 0 END) AS loss_hours,
       ROUND(100.0 * SUM(CASE WHEN b.won_buybox_pct < 0.50 THEN 1 ELSE 0 END) / COUNT(*), 1) AS loss_pct
FROM   fact_buybox_snapshots b
       JOIN dim_listings l ON l.listing_id = b.listing_id
GROUP  BY l.listing_id, l.product_title
HAVING SUM(CASE WHEN b.won_buybox_pct < 0.50 THEN 1 ELSE 0 END) >= 5
ORDER  BY loss_pct DESC, l.listing_id ASC;

Practice this in Mission #17 — paired with Mission #23’s defense playbook ranking listings by loss volume + competitor count to prioritize the repricer-tool rollout.

Cross-marketplace cannibalization

Same product, two marketplaces. The seller listed it twice (different ASINs, same product). Now the inventory pools split, the price coherence breaks, and the channel-strategy team needs to know which products are cannibalized. Detection is a self-join on product_title with a marketplace_id mismatch.

Worked example, finding products listed on multiple marketplaces:

SELECT l.product_title,
       COUNT(DISTINCT l.marketplace_id) AS marketplace_count,
       SUM(o.units) AS total_units
FROM   fact_orders o
       JOIN dim_listings l ON l.listing_id = o.listing_id
GROUP  BY l.product_title
HAVING COUNT(DISTINCT l.marketplace_id) >= 2
ORDER  BY total_units DESC, l.product_title ASC;

Practice this in Mission #18 — and Mission #24 takes it deeper with per-side revenue split, surfacing pairs where one marketplace dominates 80%+ (a delisting candidate).

ROAS at listing grain

Marketplace-side ad spend lives on the settlement record (sponsored listings on Amazon, promoted listings on Etsy, promoted on eBay). ROAS at listing grain answers “which products are advertising profitably?” — pair it with a HAVING threshold to skip listings with statistically thin ad spend.

Worked example, top 20 listings by ROAS with ad spend ≥ $50:

SELECT l.listing_id, l.product_title,
       ROUND(SUM(s.ad_fee)::numeric, 2) AS ad_spend,
       ROUND(SUM(s.gross_amount)::numeric, 2) AS revenue,
       ROUND((SUM(s.gross_amount) / NULLIF(SUM(s.ad_fee), 0))::numeric, 2) AS roas
FROM   fact_settlements s
       JOIN fact_orders o ON o.order_id = s.order_id
       JOIN dim_listings l ON l.listing_id = o.listing_id
WHERE  s.ad_fee > 0
GROUP  BY l.listing_id, l.product_title
HAVING SUM(s.ad_fee) >= 50
ORDER  BY roas DESC, l.listing_id ASC LIMIT 20;

Practice this in Mission #14 — and Mission #19 extends it to the per-marketplace ad-attribution waterfall the marketing team uses for budget reallocation.

Dead-stock detection (anti-join)

Listings with view traffic but zero orders are budget bleeders — especially when the views are sponsored. Detection is a classic anti-join: dim_listings where listing_id NOT IN (SELECT DISTINCT listing_id FROM fact_orders).

Worked example, sponsored-view dead-zone listings:

SELECT l.listing_id, l.product_title, COUNT(*) AS sponsored_views, 0 AS orders
FROM   fact_listing_views v
       JOIN dim_listings l ON l.listing_id = v.listing_id
WHERE  v.source = 'sponsored'
  AND  l.listing_id NOT IN (SELECT DISTINCT listing_id FROM fact_orders)
GROUP  BY l.listing_id, l.product_title
HAVING COUNT(*) >= 50
ORDER  BY sponsored_views DESC, l.listing_id ASC;

Practice this in Mission #15 — for the broader case (any source) — and Mission #20 for the sponsored-only refinement that drives the pause-or-relist decision.

Per-marketplace P&L capstone

The board-pack capstone: per marketplace, last 12 months, the full waterfall — orders, gross, fees, refunds, net payout, margin pct. Six aggregates against a three-table join, scoped by settlement date.

Worked example, year-end capstone:

SELECT m.marketplace_name, COUNT(*) AS orders,
       ROUND(SUM(s.gross_amount)::numeric, 2) AS gross_revenue,
       ROUND(SUM(s.marketplace_fee + s.ad_fee + s.fulfillment_fee)::numeric, 2) AS total_fees,
       ROUND(SUM(s.refund_amount)::numeric, 2) AS refund_total,
       ROUND(SUM(s.net_payout)::numeric, 2) AS net_payout,
       ROUND((100.0 * SUM(s.net_payout) / SUM(s.gross_amount))::numeric, 1) AS margin_pct
FROM   fact_settlements s
       JOIN fact_orders o ON o.order_id = s.order_id
       JOIN dim_marketplaces m ON m.marketplace_id = o.marketplace_id
WHERE  s.settle_ts >= '2025-04-01'
GROUP  BY m.marketplace_name
ORDER  BY gross_revenue DESC, m.marketplace_name ASC;

Mission #25 is the path capstone — it composes M7 + M16 + M19 + M22 patterns into one query the CEO presents quarterly.

Gotchas that cost you the offer

Six mistakes show up reliably on marketplace seller analytics screens. They’re the difference between a candidate who has “done some SQL” and a candidate who has worked on a real multi-marketplace warehouse.

1. Reporting GMV as “revenue”

GMV is what customers paid; revenue (to the seller) is what the marketplace paid out. The four fee categories — marketplace_fee + ad_fee + fulfillment_fee + refund_amount — net out 18-27% of GMV before the seller sees a dollar. Conflating GMV and net revenue inflates margin estimates by exactly that much.

2. Forgetting refund_amount in fee aggregations

Refunds are economically equivalent to fees from the seller’s perspective (revenue you don’t keep). Most analysts remember the three platform fees (marketplace, ad, fulfillment) and forget refund_amount, making the gross-fees-payout reconciliation off by ~5-10%. Always include all four.

3. Date-filtering on order_ts when the question is cash-flow

Orders placed in late March settle in early April; if you filter onorder_ts >= ‘2025-04-01’ you’ll miss those settlements (cash-flow attribution belongs to settle month). The opposite mistake — filtering settle_ts when the question is product velocity — pulls in orders placed before the window. Match the date filter to the question.

4. Treating Buy Box wins as binary instead of percentage

won_buybox_pct is the fraction of an hour you held the Buy Box, not a 0/1 flag. Filteringwon_buybox_pct < 0.50 identifies hours where you lost more than half. Some interview answers treat it as binary (won or lost) — wrong granularity.

5. Identifying cross-marketplace cannibalization by listing_id

Cross-marketplace twins have DIFFERENT listing_ids (each marketplace assigns its own ID) but the SAME product_title or SKU mapping. Grouping by listing_id shows every listing once with marketplace_count = 1 — the cannibalization signal never appears. Group by product_title (or by an external SKU table when the warehouse has one).

6. ROAS computed across ALL orders, not ad-attributed

Only ~5% of orders typically carry sponsored attribution. Computing ROAS as total_revenue / total_ad_spend across all orders is meaningless — most of that revenue isn’t ad-driven. The right metric isolates orders wheread_fee > 0 and divides their revenue by ad spend.

The 25-mission curriculum

The path is structured to walk from listing-catalog and order-shape fundamentals (free tier, M1–M10) through Buy Box and settlement-drift mechanics (Hard tier, M11–M15), the unit-economic constructions every seller-side CFO ships (Expert tier, M16–M20), and the multi-CTE capstones every senior marketplace analyst writes (Master tier, M21–M25). M25 is the per-marketplace P&L capstone — the metric every marketplace operator presents quarterly.

Free tier (M1–M10)

Listings, orders, refunds. SELECT, JOIN, GROUP BY, conditional aggregation, first window function (SUM OVER for percent-of-total in M9).

Hard tier (M11–M15)

Buy Box, settlement drift, ROAS at listing grain, dead-stock detection. Three-table joins; first NOT IN anti-join.

Expert tier (M16–M20)

Settlement reconciliation across 4 fee types (M16), Buy Box loss-window cohort (M17), cross-marketplace cannibalization (M18), ad-attribution waterfall (M19), sponsored-views dead-zone (M20). Walkthroughs on every mission.

Master tier (M21–M25)

Three-way reconciliation per (marketplace, month) (M21), profitable-listing ranking (M22), Buy Box defense playbook (M23), cross-marketplace pair revenue split (M24), and the year-end capstone (M25).

Frequently asked

What is settlement reconciliation in marketplace SQL?

Settlement reconciliation walks gross order amount through the four fee categories (marketplace_fee, ad_fee, fulfillment_fee, refund_amount) to net payout. In SQL it’s parallel SUM aggregates against fact_settlements, often joined to fact_orders for the marketplace dimension. Mission #16 is the all-time version; #21 is the per-(marketplace, month) audit-grade variant.

Why do Amazon listings have Buy Box snapshots but Etsy and eBay don’t?

Buy Box is an Amazon mechanic: when a product has multiple sellers, the platform picks one for the default purchase button. ~80–90% of Amazon sales go through the Buy Box. Etsy and eBay don’t have an equivalent mechanic — Etsy is listing-by-listing handmade-style, eBay is auction or fixed-price per listing. The fact_buybox_snapshots table is Amazon-specific by design.

How do you handle date filtering across order_ts vs settle_ts?

Match the filter to the question. Cash-flow questions (CFO, audit, board) belong on settle_ts. Product-velocity questions (merchandising, listing manager) belong on order_ts. Some questions need both — order velocity in month X reconciled against the cash that landed in month X+1 — and require a three-way join with explicit timing logic.

What companies hire on this SQL profile?

Multi-marketplace seller brands (Anker, Public Goods, Pattern, Thrasio-style aggregators), platform-side analytics teams (Amazon Seller Central / Etsy / eBay Marketplace Insights), 3PL companies that handle multi-marketplace inventory (ShipBob, Deliverr/ShipBob, Quiet Logistics), and any DTC brand with meaningful Amazon presence (most of them, post-2022). The skills also transfer to platform-side roles at Shopify (which now offers a Marketplace product) and Walmart Marketplace.

What does a marketplace seller analytics SQL test look like?

Most are 60–90-minute take-homes against an Amazon Seller Central-shaped schema. Expect three or four tasks: a listings inventory roll-up, a settlement reconciliation with all four fee categories, a Buy Box win-rate analysis, and one stretch question that’s either cross-marketplace cannibalization or per-listing ROAS attribution.

Ready to drill the patterns?

Twenty-five missions, real Postgres schema, instant feedback. Walk from browse the listings tablethrough Buy Box loss-window detection and the capstone year-end P&L every senior marketplace analyst writes.

Start the Marketplace path →

You sold $312K. Amazon paid you $241K.
Where did the other $71K go? Settlement reconciliation by Monday.

How It Works

Why This Path

Buy Box Dynamics in SQL

GMV ≠ Payout

Cross-Marketplace Cannibalization

Listing-Grain Ad Attribution

The Missions

The Database

You sold $312K. Amazon paid you $241K.

Marketplace Seller SQL Interview Questions

What marketplace data actually looks like

Question patterns you’ll be asked

Settlement reconciliation (GMV ≠ payout)

Buy Box loss-window detection

Cross-marketplace cannibalization

ROAS at listing grain

Dead-stock detection (anti-join)

Per-marketplace P&L capstone

Gotchas that cost you the offer

1. Reporting GMV as “revenue”

2. Forgetting refund_amount in fee aggregations

3. Date-filtering on order_ts when the question is cash-flow

4. Treating Buy Box wins as binary instead of percentage

5. Identifying cross-marketplace cannibalization by listing_id

6. ROAS computed across ALL orders, not ad-attributed

The 25-mission curriculum

Free tier (M1–M10)

Hard tier (M11–M15)

Expert tier (M16–M20)

Master tier (M21–M25)

Frequently asked

What is settlement reconciliation in marketplace SQL?

Why do Amazon listings have Buy Box snapshots but Etsy and eBay don’t?

How do you handle date filtering across order_ts vs settle_ts?

What companies hire on this SQL profile?

What does a marketplace seller analytics SQL test look like?

Ready to drill the patterns?

You sold $312K. Amazon paid you $241K.Where did the other $71K go? Settlement reconciliation by Monday.

How It Works

Why This Path

Buy Box Dynamics in SQL

GMV ≠ Payout

Cross-Marketplace Cannibalization

Listing-Grain Ad Attribution

The Missions

The Database

You sold $312K. Amazon paid you $241K.

Marketplace Seller SQL Interview Questions

What marketplace data actually looks like

Question patterns you’ll be asked

Settlement reconciliation (GMV ≠ payout)

Buy Box loss-window detection

Cross-marketplace cannibalization

ROAS at listing grain

Dead-stock detection (anti-join)

Per-marketplace P&L capstone

Gotchas that cost you the offer

1. Reporting GMV as “revenue”

2. Forgetting refund_amount in fee aggregations

3. Date-filtering on order_ts when the question is cash-flow

4. Treating Buy Box wins as binary instead of percentage

5. Identifying cross-marketplace cannibalization by listing_id

6. ROAS computed across ALL orders, not ad-attributed

The 25-mission curriculum

Free tier (M1–M10)

Hard tier (M11–M15)

Expert tier (M16–M20)

Master tier (M21–M25)

Frequently asked

What is settlement reconciliation in marketplace SQL?

Why do Amazon listings have Buy Box snapshots but Etsy and eBay don’t?

How do you handle date filtering across order_ts vs settle_ts?

What companies hire on this SQL profile?

What does a marketplace seller analytics SQL test look like?

Ready to drill the patterns?

You sold $312K. Amazon paid you $241K.
Where did the other $71K go? Settlement reconciliation by Monday.