Design Sprint Test and Score: Storevine Brief Interface Validation
Scenario
Friday 2026-07-17. Trial run passed Thursday 17:00 PT. 5 merchandisers completed the full Five-Act with counterbalanced A/D presentation Friday 09:00-16:30 PT. The team invokes tool-design-sprint-test-and-score to capture observations, score the rumble result, and frame Mei’s call.
Per-Customer Interview Observation Notes
Customer 1: Bayfront-Merch-A (09:00 slot; design partner; A first)
Profile: Outdoor retailer; 14 stores; merchandiser 6 years.
Context: Opens email Mon 07-09 AM; has ~5 min before ops fires. Pilot brief currently takes her ~8 min to digest.
Task A: Opened email, read subject “Monday: 3 actions…” Looked at email body for 12 sec; identified top action immediately (“restock SKU 4421”). Then opened PDF; spent 3 more min reading. Total comprehension: 3:20.
Task D: Opened email, read narrative “What changed” lead for 20 sec; then read action bullets for 30 sec. Total comprehension: 0:50.
Debrief: “D is faster. A’s named-analyst thing is nice but I already know Carlos. The narrative lead in D gave me context I needed to trust the actions.” Pricing: “$1k MRR is reasonable; I’d pay $1.2k if it consistently saved me 20 min on Mondays.”
Customer 2: West-Loop-Merch (10:30 slot; design partner; D first)
Profile: Home retailer; 22 stores; merchandiser 4 years.
Task D: 1:10 comprehension. “What changed paragraph gave me the why; bullets gave me the what.”
Task A: 2:45 comprehension. “Subject line is clever but I don’t trust unfamiliar analyst names.”
Debrief: Strong preference for D. Pricing: “$1k yes; would pay more for sector benchmarks.”
Customer 3: Outdoor-Net-A (12:00 slot; A first)
Profile: Outdoor; 8 stores; merchandiser 3 years.
Task A: 4:10 comprehension. “Took me a while to see the analyst sidebar.”
Task D: 1:30 comprehension. “This one I just got.”
Debrief: D preferred. Pricing $800.
Customer 4: Home-Net-B (14:00 slot; D first)
Profile: Home goods; 32 stores; merchandiser 8 years.
Task D: 0:45 comprehension. Fastest of all customers.
Task A: 2:20 comprehension. “Cluttered for me.”
Debrief: D strongly preferred. Pricing $1.2k.
Customer 5: Specialty-Food-Net (15:30 slot; A first)
Profile: Specialty food; 11 stores; merchandiser 5 years.
Task A: 3:50 comprehension.
Task D: 1:15 comprehension.
Debrief: D preferred. “The ‘what changed’ framing matches how I think.” Pricing $1k.
Best Quotes
- “D is faster. The narrative lead gave me context I needed to trust the actions.” - C1
- “What changed paragraph gave me the why; bullets gave me the what.” - C2
- “This one I just got.” - C3 (re Sketch D)
- “I’d pay more for sector benchmarks.” - C2
- “Cluttered for me.” - C4 (re Sketch A)
- “Would pay $1.2k if it consistently saved me 20 min on Mondays.” - C1
- “The ‘what changed’ framing matches how I think.” - C5
- “I don’t trust unfamiliar analyst names.” - C2 (re Sketch A subject-line)
Scorecard Grid
| C1 | C2 | C3 | C4 | C5 | Day-end | |
|---|---|---|---|---|---|---|
| Q1 Sketch A: Top-3 in <5 min | partial (3:20) | partial (2:45) | partial (4:10) | partial (2:20) | partial (3:50) | Partial-Validated for A (5-of-5 under 5 min but median 3:20 above 30-sec scan goal) |
| Q1 Sketch D: Top-3 in <5 min | Y (0:50) | Y (1:10) | Y (1:30) | Y (0:45) | Y (1:15) | Validated 5-of-5 for D (median 1:10) |
| Q2 Sketch A: analyst credibility | Y | N (unfamiliar names distrusted) | Y | Y | partial | Mixed for A (3-of-5 Y) |
| Q2 Sketch D: analyst credibility | Y (narrative carries trust) | Y | Y | Y | Y | Validated 5-of-5 for D |
| Q3: 2+ actions actionable (D preferred) | Y | Y | Y | Y | Y | Validated 5-of-5 |
| Q4: web companion adds value vs fragments | PDF-primary; web drill-down 2-of-5 | (same) | (same) | (same) | (same) | PDF-primary validated; web marginal |
| Q5: top-priority in first 30 sec | Y D / partial A | (same) | (same) | (same) | (same) | Validated for D 5-of-5; partial for A 3-of-5 |
Decider override: None needed; D wins decisively.
Observed Patterns
Worked (D)
- “What changed” narrative-lead (5-of-5): customers consistently described the lead paragraph as the context that made the actions trustable.
- Sub-90-sec comprehension (5-of-5; median 1:10): well below 5-min target.
- Action bullets with SKU + store + qty specificity (5-of-5): self-described all 3 as actionable.
Hesitated (A)
- Named-analyst subject-line (3-of-5): customers either didn’t know Carlos personally (C2, C3) or found the subject-line cleverness manipulative (C4).
- Email-body-as-deck (4-of-5): A’s email body tries to do too much; customers tap to PDF anyway.
Broke trust
- None observed.
Unexpected
- Sector benchmark request (3-of-5 unprompted; C1, C2, C5): customers self-described wanting “how my numbers compare to similar retailers.”
- D’s narrative lead was the credibility carrier, not analyst attribution. Trust comes from narrative voice, not named person.
Hot Takes
Mei
D wins. Rumble was the right call; we got 5-on-5 validation on D that we wouldn’t have if we’d shipped A. Going paid GA with D as launch format. Sector benchmark ask is v0.2 pull and changes long-term roadmap.
Tasha
Narrative lead is the design win. We should make narrative-writing a core analyst skill. PDF-primary + web-drill-down split is also confirmed.
Devon
Data pipeline can produce “what changed” delta calculations reliably; this is buildable. Sector benchmarks need cross-customer data which raises privacy/aggregation questions for v0.2.
Carlos
The merchandisers got D in under 90 seconds. That’s not just better than A; that’s better than anything I built at REI. This is the right format for the segment.
Decider Summary
The call: Scale to paid GA with Sketch D as the launch format.
Rationale: D validated 5-of-5 on Sprint Question 1 (median comprehension 1:10 vs A’s 3:20), 5-of-5 on Q2 trust signals, 5-of-5 on Q3 actionability, and 5-of-5 on Q5 first-30-sec scan. A’s named-analyst attribution did not scale across customers (3-of-5). Pilot’s 78% open-rate + 3.6 actionability gap is closed by D; pre-test commits met. Sector-benchmark ask is next-version, not blocking v0.1 launch.
Highest-confidence learning: Narrative-led briefs with specific SKU + store + qty action bullets are read in under 90 seconds and produce actionable Monday decisions for specialty-retail merchandisers.
Most important revision: Scope sector benchmarks into v0.2 roadmap (currently not planned).
Next artifact: PRD for paid-GA launch using Sketch D format; produced via deliver-prd within 5 business days (target: 2026-07-24).
Decider Checkpoint
- Mei confirms scorecard
- Mei commits to scale-call with Sketch D
- Mei names highest-confidence learning
- Mei names revision: scope sector benchmarks into v0.2
- Mei names next artifact: PRD by 2026-07-24
- Mei acknowledges Monday handoff: pilot retention conversations resume 2026-07-21
Signed: Mei (founder, PM/CEO), 2026-07-17 17:25 PT.
Sprint closed. Foundation Sprint + 4-week pilot + Design Sprint arc complete; Scale authorized; PRD work begins Monday 2026-07-20.