PM Critic (Dispatch Skill)

Try it: /pm-skills:utility-pm-critic "Your context here"

This skill is a cross-client dispatch wrapper for the pm-critic sub-agent. It exists so that users on non-Claude clients can run adversarial review with the same intent as Claude Code users, without depending on native plugin sub-agent infrastructure.

Per master plan D11 (amended) + D30, sub-agents are a Claude Code plugin feature. Non-Claude clients (Codex CLI, Cursor, Windsurf, Copilot, Gemini CLI) cannot natively load agents/pm-critic.md. This skill bridges the gap.

When to Use

You want adversarial review of a PM artifact (PRD, OKR set, persona, lean canvas, meeting recap, interview synthesis, problem statement, hypothesis, edge case catalog, retrospective, etc.)
You are running on a non-Claude AI client and the pm-critic sub-agent is not natively available
You are running on Claude Code and prefer skill-invocation semantics over sub-agent semantics (e.g., for consistency across a multi-step workflow that mixes skills and sub-agent intents)

When NOT to Use

You want a structural lint / repo-level audit -> use utility-pm-skill-auditor (audits skills + repo state) instead
You want to author an artifact (this skill only reviews) -> use the appropriate phase skill (deliver-prd, foundation-okr-writer, etc.)
You want code review -> use a code-review-specific tool (this skill is for PM artifacts)
You want to enforce style rules like em-dash sweep -> that is pm-release-conductor’s G0 gate, not this skill

How to Use

Invoke the skill by name (/pm-skills:utility-pm-critic on Claude Code, $utility-pm-critic on Codex):

/pm-skills:utility-pm-critic "Your context here"

Or reference the skill file directly: skills/utility-pm-critic/SKILL.md

Instructions

Runtime detection step. Determine which AI client is invoking this skill.

If you are running in Claude Code with the pm-skills plugin installed

Invoke @agent-pm-skills:pm-critic on the target artifact. Pass the artifact path as argument (or the most recent artifact in session context if no argument is provided). Return the sub-agent’s findings to the user. No further action needed from this skill - the sub-agent handles the review natively in its own context window.

If you are running in any other AI client

Codex CLI, Cursor, Windsurf, Copilot, Gemini CLI, ChatGPT, or any other client without native pm-skills plugin sub-agent support:

Read the canonical sub-agent definition at agents/pm-critic.md
Execute the system prompt body in that file as your operating instructions for this turn
Read the target PM artifact specified by the user (path from $ARGUMENTS, or most recently produced artifact in session)
Read the canonical standards docs referenced by the pm-critic system prompt for the artifact type (e.g., skills/foundation-okr-writer/SKILL.md for OKR sets, skills/deliver-prd/SKILL.md for PRDs)
Produce findings graded P0/P1/P2/P3 with concrete fix suggestions, formatted per the output structure documented in docs/guides/adversarial-review.md
End the output with the layered structure per master plan D26:
- Section (1): full human-readable findings (the P0/P1/P2/P3 report)
- Section (2): ## Status Summary in prose, summarizing what was found and what the user should do next
- Section (3): ## Status YAML block with machine-readable fields

Output Format

See references/TEMPLATE.md for the canonical output structure (with the layered Status envelope per D26). See references/EXAMPLE.md for a worked example showing a real cross-client dispatch run against a PRD.

Composition

Skills: This dispatch skill composes with all PM-artifact-producing skills (deliver-prd, foundation-okr-writer, foundation-meeting-recap, foundation-persona, foundation-lean-canvas, discover-interview-synthesis, etc.). Run any of those, then run this skill on the produced artifact.
Sub-agents: On Claude Code, this skill dispatches to pm-critic sub-agent. On non-Claude, this skill IS the inline execution; no further dispatch.
Workflows: pm-workflow-orchestrator (shipped v2.24.0) can invoke this skill at quality-gate steps for cross-client compatibility.

Cross-Client Notes

See Sub-Agent Compatibility Matrix for the canonical cross-client status of all 4 sub-agents + dispatch skills. Summary for this skill as of v2.16.0: PRODUCTION on Claude Code + Codex CLI; EXPERIMENTAL on Cursor / Windsurf / Copilot CLI / Gemini CLI. Maintainers wanting cross-client confidence on a non-Claude client can re-run the maintainer-gate-testing-codex.md harness on that client.

The “read and execute inline” pattern depends on the AI being able to:

Read a file path provided as a reference (most AI clients support this)
Treat that file’s content as operating instructions for the current turn (most AI clients support this for SKILL.md-style instructions)
Read additional referenced standards docs at invocation time (all major AI clients support this)

If any of these are unreliable on a given client, that client cannot use this dispatch skill effectively.

Reference Files

Canonical sub-agent definition: agents/pm-critic.md
Behavioral spec: docs/internal/release-plans/v2.16.0/spec_pm-critic.md
User-facing guide: docs/guides/adversarial-review.md
Runtime components catalog: docs/reference/runtime-components.md
Output template: references/TEMPLATE.md
Worked example: references/EXAMPLE.md

Output Template

When utility-pm-critic produces output (either via native sub-agent dispatch on Claude Code or via inline execution on non-Claude clients), the output follows this three-section layered structure per master plan D26.

Section 1: Full Findings Report

# pm-critic findings: {artifact name}

**Artifact reviewed:** {file path or session-context reference}
**Standards consulted:** {comma-separated list of contract docs read at invocation}
**Findings:** N (P0: X, P1: Y, P2: Z, P3: W)

## P0 findings

### F-01: {short finding title (one line)}

**Location:** {line range or section heading in the artifact}
**Issue:** {what is wrong, written precisely}
**Why it matters:** {consequence if shipped as-is}
**Fix:** {concrete change to make, including specific text or restructuring}

### F-02: {next finding}
...

## P1 findings

(same per-finding structure as P0)

## P2 findings

(same per-finding structure as P0; can be more abbreviated)

## P3 findings

(same per-finding structure as P0; one-line findings acceptable)

If no findings at a severity, omit that section heading. If zero findings total, output the canonical “passes adversarial review” line per agents/pm-critic.md.

Section 2: Status Summary (Prose, for Human Readers)

## Status Summary

The {artifact type} review returned N findings: {P0: X, P1: Y, P2: Z, P3: W}.

{One paragraph summarizing the top 1-3 issues by severity. Plain English. No bullets.}

**Recommended next action:** {explicit recommendation - revise with producing skill, defer to maintainer judgment, accept all P2/P3 and ship, etc.}

**Refusal triggered:** {yes/no. If yes, explain why. Refusal protocols are in agents/pm-critic.md.}

This section is for the human reader (PM or maintainer). It captures the “what does this mean” in prose so the reader does not have to interpret the findings table to make a decision.

Section 3: Status YAML (Machine-Parseable, for Parent Sub-Agent or Tooling)

## Status

```yaml
artifact:
  path: {file path of reviewed artifact}
  type: {prd | okr-set | persona | lean-canvas | meeting-recap | etc.}
  thread: {brainshelf | storevine | workbench | other | none}

findings:
  total: {integer}
  p0: {integer}
  p1: {integer}
  p2: {integer}
  p3: {integer}

refusal:
  triggered: {true | false}
  reason: {null OR one-line explanation if triggered}

standards_consulted:
  - {path to first standards doc read}
  - {path to second}
  - ...

recommended_action: {revise | accept-and-ship | escalate | refused}

dispatch:
  mode: {native-subagent | inline-execution}
  client: {claude-code | codex-cli | cursor | windsurf | copilot | gemini-cli | other}
```

The YAML block enables programmatic consumption: a parent sub-agent (e.g., pm-release-conductor at gate G0) can parse the YAML to decide whether to advance the release flow. A CI workflow or automation can parse it to integrate findings into a tracking system.

Format Notes

All three sections are present in every output, even if Section 1 is the “zero findings” line or Section 2 reports a clean review
The Status YAML uses the four-space indentation pattern shown above; YAML parsers tolerate it but the indentation MUST be consistent within a single output
The dispatch.mode field distinguishes native sub-agent runs (sub-agent context isolation, no inline execution) from inline execution runs (sub-agent system prompt executed in the dispatch skill’s context). Useful for telemetry and debugging.
If the dispatch skill is invoked on Claude Code and the native sub-agent succeeds, the sub-agent itself produces all three sections; the dispatch skill simply relays them. If the dispatch skill is invoked on a non-Claude client, the dispatch skill produces all three sections itself by executing the pm-critic system prompt inline.

Canonical sub-agent: agents/pm-critic.md
Worked example: EXAMPLE.md
User guide: docs/guides/adversarial-review.md

Example Output

Example: pm-critic Dispatch on Codex CLI

This example shows what utility-pm-critic produces when invoked on a non-Claude client (here, Codex CLI). The dispatch skill reads agents/pm-critic.md as its operating instructions for the turn and executes the review inline.

Invocation

codex> /utility-pm-critic library/skill-output-samples/deliver-prd/sample_deliver-prd_brainshelf_resurface.md

Skill behavior

Runtime detection. The skill detects it is running in Codex CLI (not Claude Code with pm-skills plugin), so it takes the non-Claude branch.
Reads canonical sub-agent definition. Loads agents/pm-critic.md as the operating system prompt for this turn.
Reads target artifact. Loads the Brainshelf Resurface PRD at the supplied path.
Reads standards docs. Per the pm-critic system prompt’s referential discipline, reads skills/deliver-prd/SKILL.md (PRD contract), skills/deliver-edge-cases/SKILL.md (edge case completeness), and skills/define-hypothesis/SKILL.md (hypothesis-PRD coherence).
Produces findings. Returns the three-section layered output below.

Output (verbatim from dispatch skill execution)

pm-critic findings: Brainshelf Resurface PRD

Artifact reviewed: library/skill-output-samples/deliver-prd/sample_deliver-prd_brainshelf_resurface.md Standards consulted: skills/deliver-prd/SKILL.md, skills/deliver-edge-cases/SKILL.md, skills/define-hypothesis/SKILL.md Findings: 7 (P0: 1, P1: 3, P2: 2, P3: 1)

P0 findings

F-01: User-story acceptance criteria deferred to external document; PRD ships without verifiable behavior

Location: User Stories section, line 106 (“See the detailed user stories document for full acceptance criteria”) Issue: The PRD lists 5 user stories (US-001 through US-005) with priorities but defers full acceptance criteria to an external “user stories document” that is referenced but not linked, not versioned, and not committed alongside this PRD. Why it matters: Per skills/deliver-prd/SKILL.md, a PRD must contain testable acceptance criteria for the engineering team to build verifiable behavior. External references create drift risk and ambiguity. Fix: Embed full acceptance criteria for each user story inline in this PRD using Given/When/Then structure, or replace the User Stories section with a single link to the external doc + explicit commitment that this PRD is abstract scope and the linked doc is the testable spec.

P1 findings

F-02: Threshold inconsistency between opt-in goal and mitigation trigger creates a 5-percentage-point dead zone

Location: Success Metrics table line 88 (“Opt-in rate >= 10%”) vs Risks table line 213 (“if below 5% at day 7”) Issue: The PRD states a >=10% opt-in goal but the mitigation triggers at <5% at day 7. The 5%-9.99% range has no playbook. Fix: Pick one resolution: raise the mitigation trigger to <10%, lower the goal to >=5%, or split into “minimum acceptable” + “target” bands.

F-03: 14-day exclusion window interacts pathologically with small-library users

Location: FR-4 (line 144) and Open Question 3 (line 235) Issue: Users with 10-20 saved items exhaust their resurface pool in 3-4 days. From day 5 onward they receive no email, biasing the experiment toward null. Fix: Shorten the exclusion window proportionally to library size, OR exclude small-library users from the experiment and run as separate cohort.

F-04: A/B test on opt-in self-selected sample vs no-email control; treatment effect not generalizable

Location: Solution Summary (line 67) and Goals & Success Metrics (line 77) Issue: Opt-in users are systematically more engaged than non-opt-in. The 7-day return-rate uplift is confounded with “kind of user who opts in.” Fix: Compare opt-in users in treatment vs opt-in users in control (both opted in; only treatment receives email), OR add an opt-in + no email washout arm.

P2 findings

F-05: TF-IDF relevance threshold (0.15) lacks rationale

Location: FR-7 (line 151) Issue: Threshold value asserted without justification or sensitivity analysis. Fix: Cite the spike summary’s threshold curve, or add a Week 1 setup observation to measure threshold behavior.

F-06: “Active in last 30 days” eligibility uses low bar (one save event)

Location: Target Users (line 71) Issue: Single save event in 30 days qualifies a user; this includes users least likely to engage with email and inflates the eligible-user denominator. Fix: Tighten to “more than one save event in 30 days” or “at least one app-open in 14 days.”

P3 findings

F-07: “Sprint 8” terminology used without team-calendar definition

Location: Timeline & Milestones (line 222) Issue: Sprint references depend on team-internal calendar; readers without context cannot date-anchor. Fix: Replace with date range on first mention or add a one-line “Sprint 8 = Feb 3-16, 2026” footnote.

Status Summary

The Brainshelf Resurface PRD review returned 7 findings: 1 P0, 3 P1, 2 P2, 1 P3.

The single P0 finding (F-01) is a real ship-blocker: user stories ship without inline acceptance criteria, leaving engineering to build against ambiguous specs. The 3 P1 findings cluster around experiment validity (F-03, F-04 directly affect causal inference; F-02 is a decision-criteria inconsistency). P2 and P3 findings are quality improvements that the author can defer if time-constrained.

Recommended next action: Revise with /deliver-prd before engineering kickoff on Feb 3. Inline the user-story acceptance criteria (F-01), resolve the opt-in-rate threshold conflict (F-02), and decide between the two F-03/F-04 fixes for experiment validity before the Sprint 8 build starts. P2/P3 are author judgment.

Refusal triggered: no. The PRD is complete, in scope, above review threshold, and the artifact type is recognized via frontmatter (artifact: prd).

Status

artifact:
  path: library/skill-output-samples/deliver-prd/sample_deliver-prd_brainshelf_resurface.md
  type: prd
  thread: brainshelf

findings:
  total: 7
  p0: 1
  p1: 3
  p2: 2
  p3: 1

refusal:
  triggered: false
  reason: null

standards_consulted:
  - skills/deliver-prd/SKILL.md
  - skills/deliver-edge-cases/SKILL.md
  - skills/define-hypothesis/SKILL.md

recommended_action: revise

dispatch:
  mode: inline-execution
  client: codex-cli

Notes on This Example

This example demonstrates the dispatch skill’s “read pm-critic.md and execute inline” pattern on Codex CLI. The findings reproduce the canonical Brainshelf PRD review (also shipped as library/sub-agent-samples/pm-critic/sample_pm-critic_brainshelf_prd-review.md) because the inline execution should produce findings consistent with the native sub-agent output on Claude Code.

The cross-client consistency is the validation criterion for Phase 2 GATE B. If a non-Claude client running this dispatch skill produces findings that diverge significantly from the Claude Code native sub-agent output on the same artifact, the dispatch mechanism is unreliable and the fallback (Option F: clean Claude-Code-only labeling) applies.

Canonical sub-agent: agents/pm-critic.md
Skill manifest: SKILL.md
Output template: TEMPLATE.md
Native sub-agent sample: library/sub-agent-samples/pm-critic/sample_pm-critic_brainshelf_prd-review.md

PM Critic (Dispatch Skill)

When to Use

When NOT to Use

How to Use

Instructions

If you are running in Claude Code with the pm-skills plugin installed

If you are running in any other AI client

Output Format

Composition

Cross-Client Notes

Reference Files

Output Template

Output Template

Section 1: Full Findings Report

Section 2: Status Summary (Prose, for Human Readers)

Section 3: Status YAML (Machine-Parseable, for Parent Sub-Agent or Tooling)

Format Notes

Related Files

Example Output

Example: pm-critic Dispatch on Codex CLI

Invocation

Skill behavior

Output (verbatim from dispatch skill execution)

pm-critic findings: Brainshelf Resurface PRD

P0 findings

F-01: User-story acceptance criteria deferred to external document; PRD ships without verifiable behavior

P1 findings

F-02: Threshold inconsistency between opt-in goal and mitigation trigger creates a 5-percentage-point dead zone

F-03: 14-day exclusion window interacts pathologically with small-library users

F-04: A/B test on opt-in self-selected sample vs no-email control; treatment effect not generalizable

P2 findings

F-05: TF-IDF relevance threshold (0.15) lacks rationale

F-06: “Active in last 30 days” eligibility uses low bar (one save event)

P3 findings

F-07: “Sprint 8” terminology used without team-calendar definition

Status Summary

Status

Notes on This Example

Related Files