Architecture Decision Record

A short structured document that captures a significant architectural decision, its context, and its consequences.

Architecture Decision Record

An ADR is a lightweight record of a decision that was hard to make and will be hard to change. Its value is not in the decision itself but in the reasoning: future engineers who encounter the system and wonder “why is it built this way?” have an answer that does not depend on institutional memory. An ADR is a gift to your future self and to everyone who inherits the codebase.

The canonical ADR structure is three sections: Context (what forces were at play), Decision (what was chosen and why), and Consequences (what the decision costs and what it enables). Context is not history - it is the live forces at the time of the decision that made this the right call. Consequences are not just the good ones - a good ADR honestly names the tradeoffs.

Canonical template

# [ADR-NNNN] [Decision Title]

## Status
[Proposed | Accepted | Deprecated | Superseded by ADR-NNNN]

## Context
[What forces were at play. The problem, the constraints, the alternatives considered.]

## Decision
[What was decided. Be specific. Name the choice, not just the category.]

## Consequences
### Positive
- [benefit]
### Negative
- [tradeoff or cost]
### Neutral
- [consequence that is neither good nor bad]

When to use

Recording architectural decisions, capturing technology choices, documenting tradeoffs for future maintainers, team alignment on significant choices.

When not to use

Operational documentation, explaining how a system works, consumer-facing content.

Pairs well with

pragmatic-architect, operator, candid, matter-of-fact, problem-solution, comparison-contrast

Often confused with

prd: A PRD defines what should be built and why. An ADR records a decision already made about how to build it.

Tells

Three canonical sections: Context, Decision, Consequences
A Status line (Proposed, Accepted, Deprecated, or Superseded by ADR-NNNN)
Context names the live forces and constraints at decision time, not project history
Decision names the specific choice, not just the category
Consequences honestly includes a Negative subsection, not only benefits
Short and focused, roughly 300-600 words, readable in a few minutes

Anti-patterns

Writing Context as a history narrative instead of the live forces at decision time - Context exists to show why this was the right call under the constraints then; history does not let a future reader reconstruct the reasoning.
Listing only positive consequences and omitting the tradeoffs - The Consequences section is where an ADR earns its value; hiding the negatives turns a decision record into marketing.
Reserving ADRs only for rare, major decisions - ADRs should be small and numerous; the compounding smaller decisions go unrecorded if the bar is set too high.
Describing what to build rather than recording a decision already made about how - That collides with the confusable prd format; an ADR records a made decision and its reasoning, not a product specification.

Failure modes

Over-formalizes - a short, clear call balloons into a multi-section treatise that buries the decision it exists to record - Keep it to roughly 300-600 words; if it needs a 20-minute read, the decision is buried, so split it or move detail to a linked doc.
Fills out the full Context/Decision/Consequences ceremony for a non-decision, so the record is all scaffold and no actual choice - Write an ADR only when there is a real, hard-to-reverse decision; if naming the Decision section feels forced, there is no ADR to write.

Instruction

Write as an Architecture Decision Record (ADR). Use the canonical three-section structure:
Context, Decision, Consequences. In Context: name the live forces at the time of decision - not
history, but the constraints, options, and pressures that made this choice necessary. In
Decision: name the specific choice, not just the category. In Consequences: be honest about the
tradeoffs - name the negative consequences alongside the positive. The Consequences section is
where ADRs earn their value. Do not omit the hard truths. Keep the document focused and short -
an ADR that requires a 20-minute read is too long.

Template

See the Architecture Decision Record template.

Examples

ADR-0014: Adopt Async-First Standup Format

Status

Accepted

Context

The engineering team has grown from 6 to 11 engineers over 18 months. The team now spans four timezones: US Pacific (3 engineers), US Eastern (3 engineers), UK (2 engineers), India (3 engineers). The current synchronous standup is scheduled at 9am Pacific, which is 9:30pm India Standard Time.

Three forces pushed this decision:

Timezone asymmetry. The India-based engineers are disproportionately burdened by the meeting schedule. Attendance data from Q1 shows the three India engineers averaged 3.2 standup appearances per week out of 5, compared to 4.6 for US-based engineers. The shortfall is not disengagement - it is 9:30pm.

Information loss. Status shared verbally in the meeting does not persist. We have documented three incidents in the past quarter where an engineer spent more than an hour on a problem that had already been solved and discussed in a previous standup. There is no searchable record.

Meeting-to-value ratio. The standup averages 14 minutes. Analysis of the past month shows an average of 4.2 minutes of content that changed someone’s behavior - a blocker raised, a dependency flagged, a context shared. The remaining 10 minutes is status that required no response from anyone.

Alternatives considered: rotating the meeting time (solves equity but adds overhead and still does not create persistence), eliminating standup entirely (loses coordination value), and adopting an async tool like Geekbot (rejected on cost and added tooling complexity - Slack templates serve the same function).

Decision

Replace the synchronous daily standup with an async standup update in #team-standup. Engineers post by 10am their local time using a pinned template:

Shipped: what completed in the last 24 hours
In progress: current focus
Blocked / at risk: anything that needs attention, with @mention of the person who can resolve it

The on-call engineer reads the channel by 9am Pacific and responds to blocked items within 30 minutes during business hours. The synchronous standup slot is replaced with a 60-minute Thursday working session - not a status meeting, reserved for discussion requiring real-time exchange.

Consequences

Positive

All engineers participate on a schedule that fits their timezone
Status information is persistent and searchable
Blocked items route directly to the person who can resolve them via @mention
Engineers recover 70 minutes per week previously spent in synchronous status reporting

Negative

Social cohesion that comes from shared daily presence is reduced; the Thursday session is a partial substitute but not equivalent
The format depends on consistent participation - if engineers stop posting, the channel’s value drops for everyone
Blockers that require nuance are harder to surface in a structured three-field template than in a live conversation

Neutral

On-call rotation adds a daily channel-reading responsibility, but this replaces the meeting facilitation responsibility previously on the same rotation
The Thursday working session is new overhead for some engineers who previously skipped standup

ADR-0023: Use Postgres for the Notification Service

Status

Accepted

Context

Lattice Notify is launching a real-time notification system that needs a new persistent data store. The system will handle 500K notification events per day at launch, with a 10x growth scenario in 12 months if the pending Slack-partnership deal closes. The decision sits between two candidates:

Option A: Postgres. Extend the existing Postgres footprint with a new schema, add a job queue, and absorb the resulting scaling work. The team has operated Postgres at this scale before. Cross-database queries against the existing monolith data stay simple.
Option B: DynamoDB. Adopt a new datastore that fits the notification access pattern (write-heavy, point-lookups by user) and scales without operator intervention. The team has no production DynamoDB experience. Ops surface area doubles. There is no rollback plan if it goes wrong.

Three forces pushed this decision:

Team operational capacity. We have 8 backend engineers and a 4-person on-call rotation. Adding a second database adds a second runbook, a second monitoring surface, a second backup story, and a second debugging skillset on every page. We have measured this cost before in a separate workstream and it is non-trivial.

Growth uncertainty. The 10x growth scenario depends on a deal that has not closed. Designing the system for the larger scenario, when the smaller one is the certain one, optimizes for the case that may not arrive.

Reversibility cost. If we choose Postgres and outgrow it, we incur 3-6 weeks of rework to migrate. If we choose DynamoDB and find we need cross-database joins for product features, we incur similar rework plus a team that has learned the wrong tool. The asymmetry is small; both choices are recoverable.

Marcus made a strong case for DynamoDB’s access-pattern fit. Ana raised the operational capacity concern. The architecture meeting on Wednesday confirmed that the operational concern is the load-bearing one.

Decision

Build the notification service on Postgres, using a new schema (notifications) in the existing primary cluster and a job queue backed by pg_notify plus a notification_jobs table. Provision read replicas to absorb fanout reads. Add a documented threshold (5M events/day sustained) at which we revisit DynamoDB before scaling the Postgres path further.

Priya has the decision recorded for the Friday sprint planning.

Consequences

Positive

Single operational surface for the 4-person on-call rotation. No new runbooks, no new monitoring, no new debugging skillset on call.
Cross-database queries (joining notifications to users, accounts, workspaces) remain simple SQL.
The team ships the launch scope on familiar ground. Estimated 3 weeks faster to first production traffic than the DynamoDB path.
The decision is reversible: if we cross the 5M events/day threshold, we have the data and the operational margin to plan a migration.

Negative

We will likely need to do non-trivial Postgres tuning at the 10x growth point: partitioning the notifications table, tuning the job queue, possibly sharding. This work is real and is on the roadmap, not avoided.
Marcus’s argument about access-pattern fit is correct in isolation; we are accepting a worse fit for the access pattern in exchange for a better fit for the team’s operational reality.
If the Slack deal closes and growth arrives faster than 12 months, we hit the rework window earlier than planned.

Neutral

The notification_jobs table becomes a new operational concern: queue depth, dead-letter handling, retry policy. These are familiar problems on a familiar platform.
The 5M events/day revisit threshold becomes a tracked metric. The on-call rotation owns the dashboard.

Appears in diff-pairs

adr vs meeting-notes (varies format)
adr vs prd (varies format)
adr vs whitepaper (varies format)

Architecture Decision Record