Skip to content

Survey Analysis

Quick facts

Phase: Measure | Version: 1.0.0 | Category: research | License: Apache-2.0

Try it: /survey-analysis "Your context here"

You analyze survey results into actionable PM insights. Your job is to (a) honestly characterize what the data shows, (b) flag what it does NOT show, (c) identify themes in open-text responses, (d) connect findings to hypotheses, and (e) produce prioritized recommendations.

How to Use

Use the /survey-analysis slash command:

/survey-analysis "Your context here"

Or reference the skill file directly: skills/measure-survey-analysis/SKILL.md

Output Template

Survey Analysis: [Survey Name]

Executive Summary

[Summary]

Survey Methodology Summary

  • Sample size (N): [N] (response rate: [X%] if known)
  • Recruitment method: [Panel / customer email / in-product / social]
  • Who responded vs. who was invited: [Distribution]
  • Selection bias risks: [Who is over/under-represented and why]
  • Question-design risks: [Leading, double-barreled, response-option bias]

Per-Question Analysis

Q#QuestionDistribution (counts / %)ConfidenceWhat it showsWhat it does NOT show
Q1[Question][Counts][Direction-only / Medium / High][Reading][Caveat]

Persona / Segment Breakdown

SegmentnKey difference from overallConfidence
[Segment][n][Difference][Flag if n<30]

Open-Text Thematic Clustering

ThemeApprox. mentionsRepresentative quotes (from provided excerpts)ValenceContradicts quant pattern?
[Theme 1][~N]“[quote]”[+/-/mixed][Yes/No]

Hypothesis Validation

HypothesisStatusEvidenceConfidence
[H1][SUPPORTED / CONTRADICTED / INCONCLUSIVE / NOT-TESTED][Question / theme][High/Medium/Low]

What the Data Does NOT Show

  • Population not represented: [Who]
  • Questions not answered: [What]
  • Confounds: [What could distort the reading]
  • Follow-up that would close the biggest gap: [Research]

Prioritized Recommendations

#RecommendationEvidenceConfidenceCounter-evidenceResearch that would strengthen it
1[Recommendation][Q/theme][H/M/L][If any][What]

Next Steps

  • [Next artifact: update PRD / trigger follow-up survey / commission interviews]
  • [Decisions this can inform; decisions it cannot]

Example Output

Survey Analysis: AI Notes-to-Tasks Adoption Survey

Survey Analysis: AI Notes-to-Tasks Adoption Survey

This is an illustrative survey analysis. All response counts, percentages, and open-text quotes are fictional [fictional] stand-ins for what real survey data would look like.

Executive Summary

We surveyed users to test the hypothesis that they would adopt an AI feature converting meeting notes into tasks (N=240, in-product prompt). Stated interest is high (78% said they would use it), but two things temper that: the key question is mildly leading, and the open-text reveals a strong accuracy/trust concern that the quantitative number hides. The honest verdict is INCONCLUSIVE leaning supported: there is real demand signal, but stated intent from a power-user-biased sample is not proof of adoption. Confidence: Medium. The most important caveat: this measures what users say, not what they will do.

Survey Methodology Summary

  • Sample size (N): 240 (response rate ~6% from ~4,000 in-app prompts)
  • Recruitment method: In-product banner shown to users who opened a project in the last 7 days
  • Who responded vs. who was invited: Active users only; dormant and churned users had no chance to respond
  • Selection bias risks: Active/power users are over-represented; people who do not take meeting notes self-selected out, inflating interest
  • Question-design risks: Q2 (“Would you use an AI feature that automatically turns your messy meeting notes into organized tasks?”) is mildly leading - it pairs a pain (“messy”) with a benefit (“organized”)

These methodology choices affect what conclusions can be drawn: this is a directional read from engaged users, not a representative adoption forecast.

Per-Question Analysis

Q#QuestionDistributionConfidenceWhat it showsWhat it does NOT show
Q1How often do you take meeting notes in the product?Weekly 41% / Sometimes 38% / Never 21%Medium (N=240)A majority take notes at least sometimesWhether note-takers are the buyers
Q2Would you use an AI notes-to-tasks feature?Yes 78% / Maybe 16% / No 6%Medium, flagged BiasedStrong stated interestReal adoption; the wording is leading
Q3What would stop you from using it? (open text)142 responsesMediumAccuracy and trust concerns dominateMagnitude of the concern at scale
Q4Plan tier (segmentation)Free 90 / Pro 110 / Enterprise 40-Enables segment cuts-

Q2 is reported but flagged Biased. Based on instrument-bias patterns, leading questions of this kind typically overstate intent; treat the 78% as an optimistic ceiling, not a forecast.

Persona / Segment Breakdown

SegmentnKey difference from overallConfidence
Free9071% “yes” on Q2; most accuracy-skeptical in open textMedium
Pro11082% “yes”; highest note-taking frequencyMedium
Enterprise4080% “yes” but raised data-privacy concernsLow (n=40)
Enterprise admins (sub-segment)12Privacy concern concentrated hereToo small (n<30) - directional only

The Enterprise admin sub-segment (n=12) is below the threshold for a defensible claim; the privacy signal there is a flag to investigate, not a finding.

Open-Text Thematic Clustering

AI-assisted clustering of the 142 Q3 responses; quotes are drawn from the provided open-text excerpts. Mention counts are approximate.

ThemeApprox. mentionsRepresentative quotesValenceContradicts quant pattern?
Accuracy / trust~64”I would not trust it to capture action items correctly”; “if it misses a task that is worse than no feature”NegativeYes - tempers the 78% yes
Editing control~38”I would want to review and edit before it creates anything”ConditionalPartially
Privacy / data handling~22”where do my meeting notes get sent?”NegativeConcentrated in Enterprise
Time saved~26”this would save me 20 minutes after every standup”PositiveReinforces

The accuracy/trust theme is the most valuable signal: it contradicts the upbeat Q2 number and predicts that adoption hinges on perceived reliability, not on interest.

Hypothesis Validation

HypothesisStatusEvidenceConfidence
Users would adopt an AI notes-to-tasks featureINCONCLUSIVE (leaning supported)Q2 stated interest high (but leading + biased sample); open-text shows adoption is gated on accuracy/trustMedium
Users will pay more for itNOT TESTED BY THIS SURVEYNo pricing or willingness-to-pay question was asked-

What the Data Does NOT Show

  • Population not represented: Dormant and churned users (only active users were prompted); non-note-takers self-selected out
  • Questions not answered: Willingness to pay; whether stated intent converts to actual usage
  • Confounds: Q2 wording inflates intent; in-product recruitment inflates the engaged-user signal
  • Follow-up that would close the biggest gap: A prototype with real usage measurement (does stated 78% interest convert to actual use?), and a neutrally-worded re-ask of Q2

Prioritized Recommendations

#RecommendationEvidenceConfidenceCounter-evidenceResearch that would strengthen it
1Prototype and measure actual usage before full buildStated intent is high but unproven; trust themeMediumThe 78% could be real demandA behavioral pilot with usage telemetry
2Make accuracy and edit-before-commit the headline design constraintAccuracy/trust is the top open-text themeHighNoneUsability test of an editable draft flow
3Address Enterprise data handling explicitlyPrivacy theme concentrated in EnterpriseLow (small n)n=40, sub-segment n=12Targeted Enterprise-admin interviews
4Re-ask the adoption question with neutral wordingQ2 is leadingMedium-A/B the question wording in the next pulse

Next Steps

  • Build a prototype and instrument actual usage; do not commit the full feature on stated intent
  • Commission 5-8 interviews to deepen the accuracy/trust theme (skill: discover-interview-synthesis)
  • This analysis can inform whether to prototype; it cannot, on its own, justify a full build or a pricing decision

Real-World Examples

See this skill applied to three different product contexts:

Storevine (B2B): Storevine B2B forecasting platform - feature-prioritization survey of 180 customer admins, segmented by company size

Prompt:

/survey-analysis
analyze our storevine feature-prioritization survey. 180 customer admins
responded. we asked them to rate 5 candidate features by importance and pick
their #1.
our hypotheses going in:
- H1: multi-warehouse support is the top ask
- H2: seasonal-adjustment accuracy is a close second
segment by company size (we captured it). tell us what to build next.

Output:

Survey Analysis: Storevine Next-Feature Validation

Brainshelf (Consumer): Brainshelf consumer subscription - quarterly NPS survey (N=1200) with an open-text follow-up

Prompt:

/survey-analysis
analyze our Q2 brainshelf NPS survey. 1200 subscribers responded. standard
NPS question (0-10) plus an open text "what's the one thing you'd change?"
last quarter's NPS was 18. mine the open text for what we should build next.

Output:

Survey Analysis: Brainshelf Q2 NPS Pulse

Workbench (Enterprise): Workbench internal dev-experience platform - exploratory pulse survey of 65 engineers, sample too small for strong inference

Prompt:

/survey-analysis
analyze our dev-experience pulse survey. 65 engineers responded out of ~280.
mix of likert questions (rate your dev experience 1-5 across a few areas)
plus an open text "biggest friction in your day?". tell us what to prioritize.

Output:

Survey Analysis: Workbench Dev-Experience Pulse

Read this first: N=65 (of ~280 engineers, ~23% response). This sample is large enough to spot directional themes but too small for statistically reliable conclusions or capital-allocation decisions. Everything below is direction-only. Treat it as a signal of where to look, not as a mandate of what to fund.

Quality Checklist

Before finalizing, verify:

  • Methodology summary audits sample size, recruitment, and question-design risks
  • Every confidence label is qualitative and tied to sample size (no implied computed precision)
  • Segment claims with n < 30 are flagged as too small
  • Open-text quotes are drawn only from provided excerpts, never invented
  • Each hypothesis gets a status, including “Not tested by this survey” where applicable
  • A “what the data does NOT show” section is present and specific
  • No causal claim is made from cross-sectional data
  • Recommendations carry confidence labels and counter-evidence