Skip to content

Tuckman group-development stages

Status: Documented, not shipped · Evidence: P · Family: Self and team awareness · Verdict: reject (2026-06-11)

Tuckman’s model says a small group passes through a sequence of named phases as it matures: forming (orientation, polite dependence on a leader), storming (conflict over roles, power, and approach), norming (settling into shared norms and cohesion), performing (working effectively on the task), and - added later - adjourning (disbanding). The shippable “thinking move” people draw from it is a two-step diagnostic: read which phase a team is currently in from its observable behaviour, then select the facilitation or leadership intervention the model prescribes for that phase (for example, give structure while forming, mediate conflict while storming, step back while performing).

The durable cognitive move, stated plainly, is fixed-typology classification of a team’s current state against a four-box developmental sequence, followed by reading off the box’s prescribed action. It is a descriptive map offered as a diagnosis-then-prescription, not a generative reasoning procedure.

It helps as a shared vocabulary. Naming “we are storming” can de-personalise conflict (“this is a normal phase, not a sign the team is broken”) and give a facilitator a quick, communicable frame for why a young team is struggling. As a coaching heuristic for someone with no model at all, it is better than nothing.

It misleads in several specific ways. First, the strict linear sequence is the weakest part of the model and the part the diagnosis leans on: real teams rarely march forming -> storming -> norming -> performing in order. They loop back, skip, re-storm after a membership change, or never visibly storm. A facilitator who has decided a team “should” be norming can misread a healthy team or force a phase that is not there. Second, the typology is coarse: most real team trouble (unclear goals, bad incentives, a missing skill, an absent decision) is not a “phase” and is not fixed by a phase-appropriate intervention. Third, the prescriptive half (“now do the storming intervention”) is generic facilitation advice that the stage label adds little to. The when-NOT boundary: do not use it to explain away a structural or task problem as a developmental phase, do not use it on a stable long-running team where the sequence has no purchase, and do not treat the box you assign as a measurement.

Honest grade: P (practitioner). Widely taught, genuinely useful as vocabulary, but the specific claims it makes are descriptive and contested, and the effectiveness of the diagnose-then-act move itself is untested.

Tuckman’s original 1965 paper, “Developmental Sequence in Small Groups,” was a literature review of about 50 published articles on therapy, training, and laboratory groups - not an original empirical study of work teams. The famous labels appear only in the paper’s summary. Tuckman and Jensen’s 1977 follow-up, “Stages of Small-Group Development Revisited,” reviewed the intervening decade of studies and reported that of the studies examined only one had set out to directly test the four-stage hypothesis; they added “adjourning” and called for more empirical work. The strict linear sequence has been directly challenged: Gersick (1988) found teams advance by punctuated equilibrium (long stable periods broken by an abrupt midpoint transition) rather than gradual staged progress, and Bonebright’s (2010) “40 Years of Storming” historical review traces the model’s popularity to HRD practice rather than to a strong empirical record. A frequently cited software-engineering observation reports that only a small minority of studied teams (on the order of a few percent) actually followed the canonical forming-to-performing order; treat that specific percentage as illustrative of the direction of the evidence rather than as a precise, primary-sourced effect.

Crucially, the stronger empirical record in this area belongs to a DIFFERENT model and instrument: Susan Wheelan’s Integrated Model of Group Development and its Group Development Questionnaire (GDQ). Wheelan and colleagues (Wheelan and Tilin 1999; Wheelan, Davidson and Tilin 2003; Wheelan 2009) report acceptable GDQ reliability and correlational links between a group’s developmental stage and outcomes - more productive groups score lower on the early dependency and conflict scales and higher on the later work scale, and faculty groups at higher developmental stages were associated with better student standardized-test results. That is real evidence, but it is evidence for an instrument-scored sibling model, and it is correlational (stage-and-productivity association), not a test of whether an agent or facilitator diagnosing Tuckman’s four stages and acting on the diagnosis improves outcomes. Per this library’s grading rule, that adjacent, transferred, instrument-validity evidence does NOT raise the grade of the Tuckman thinking move: the move’s own effectiveness is unmeasured, and the model it names is descriptive with a contested core. Honest grade therefore caps at P; it is not laundered up to M on the strength of Wheelan’s GDQ studies.

Verdict: Reject (documented as a deliberate exclusion), tier P, status excl. This overturns the preliminary cand / build placement, and here is the stated reason.

Overlap is low against every shipped skill: think-after-action-review is event-retrospective (expected vs actual on a finished event), not a team-state diagnosis; the perspective skills (parallel-perspectives-review, role-storming) take viewpoints, they do not classify a team’s developmental stage. So this is not a fold into a shipped skill, and it is not a recipe (no chain of existing moves reconstructs “diagnose the team’s developmental stage”). It clears the overlap ceiling.

It fails the selection bar on the merits, which is the decisive test. (1) The move is fixed-typology classification - fit the team into one of four named boxes, then read off the prescribed intervention. This library has consistently declined to ship fixed-typology lenses whose structure is empirically weak (MBTI is excluded at X, DISC at V, Belbin and Enneagram at C); Tuckman is the team-level analogue of that pattern, and the box-structure it asks the agent to apply (the strict linear sequence) is exactly the contested part. (2) There is no move-effectiveness evidence: nothing shows that diagnosing a team’s Tuckman stage and acting on it improves outcomes, and the supporting empirical record (Wheelan’s GDQ) measures a different instrument and is correlational. (3) The prescriptive half (“pick the fitting facilitation intervention”) lands in the facilitation-and-group-structures genre, where this library ships nothing and has judged every candidate fold, excl, or flag - the standing facilitation wall. An agent-executable artifact would amount to “label the team’s box and restate generic stage advice,” which is closer to instrument administration than to a durable, generative thinking move - the exact line the self-and-team-awareness cluster was set up to police.

The model is famous enough that the honest, sourced exclusion dossier is itself the product: it lets the advisor and the catalog say, truthfully, why a household-name framework is not shipped here, rather than leaving its absence to look like an oversight. A maintainer who wanted the diagnostic vocabulary anyway could keep it at cand and revisit, but on the evidence and the selection bar the conservative call is excl.

  • Bruce W. Tuckman (1965), “Developmental Sequence in Small Groups,” Psychological Bulletin 63(6): 384-399 - the origin; a literature review that coined forming/storming/norming/performing.
  • Bruce W. Tuckman and Mary Ann C. Jensen (1977), “Stages of Small-Group Development Revisited,” Group and Organization Studies 2(4): 419-427 - adds adjourning; notes how little of the intervening research directly tested the hypothesis.
  • Connie J. G. Gersick (1988), “Time and Transition in Work Teams: Toward a New Model of Group Development,” Academy of Management Journal 31(1): 9-41 - the punctuated-equilibrium challenge to linear staging.
  • Denise A. Bonebright (2010), “40 Years of Storming: A Historical Review of Tuckman’s Model of Small Group Development,” Human Resource Development International 13(1): 111-120 - the practice-driven history and the honest read on the empirical record.
  • Susan A. Wheelan, Barbara Davidson and Felice Tilin (2003), “Group Development Across Time: Reality or Illusion?,” Small Group Research 34(2): 223-245; and Susan A. Wheelan (2009), “Group Size, Group Development, and Group Productivity,” Small Group Research 40(2): 247-262 - the GDQ / Integrated Model strand, the stronger but separate empirical record for stage-and-productivity, cited here precisely to mark why it does not transfer to the Tuckman move.
Was this page helpful?
Thinking Framework Skills v0.8.0 · 56 frameworks