The three-year reference architecture for an agentic professional services firm.
The closing piece of the SALT insight series. A working blueprint — workflow surface, agent inventory, auth layer, governance gates, talent overlay, contracting model — that operators can mark themselves against. Every prior thread compounds here.
This is the closing piece of the SALT insight series. The previous twenty-three pieces argued positions — Constrained Agency as the operating discipline, the auth layer as the moat, the 75% rule on operating-model spend, the Microsoft estate as the default stack, the manufacturing 2030 causal chain, the hybrid-by-regulation healthcare ceiling, the billable hour as a deflating commodity, agents as systems rather than teammates. This piece pulls those threads into a single working blueprint for one operator class — the professional services firm in the 100–2,000 person band — across a three-year horizon. The architecture is opinionated, dated, and explicitly falsifiable.
The thesis is one sentence. An agentic professional services firm is built in three sequenced years: Year 1 lays the platform foundation, Year 2 builds the constraint stack, Year 3 migrates the contracting model and captures the margin lift. Firms that compress the sequence fail; firms that follow it compound. The reference architecture below is what each year actually contains, what milestones the firm should hit, what failure modes to guard against, and what falsifiable indicators grade the result.
Seven layers of an agentic firm.
Before walking the three-year sequence, name the seven layers. Every agentic firm runs on these, whether the firm has built them deliberately or accumulated them by accident. The reference architecture builds them in deliberate order.
Foundation.
Year 1 is the foundation year. The firm picks the substrate, the partner, and the first three pilots. The temptation is to skip Year 1 and start with the high-margin work in Year 3; firms that try this consistently fail because the substrate gaps surface six months in and force a rework. Year 1 is unglamorous, predictable, and load-bearing.
Platform decision. If the firm is Microsoft-aligned (which the under-500 band almost always is), default to Microsoft 365 + Copilot Studio + AI Foundry + Entra Agent ID. End the build-vs-buy debate. Reinvest the saved cycles into the operating-model layer.
MSP partnership. Pick a Microsoft-aligned MSP that ships outcome-priced delivery. Sign a 12-month engagement with three named pilots. The MSP brings the constraint engineering bench the firm cannot hire internally.
Three bounded pilots. Each pilot ships in <90 days, with a named partner owner, <$200k all-in, and a defined utilization or margin target. No 18-month roadmaps.
- Milestone · Month 4: Microsoft estate baseline complete (Entra ID for human IAM, Purview lineage, Microsoft 365 + Dynamics deployed at firm scale).
- Milestone · Month 6: First pilot in production. Named partner owner. Telemetry visible in real time.
- Milestone · Month 9: Second pilot live. Cross-pilot pattern recognition starting (what works, what doesn’t).
- Milestone · Month 12: Third pilot live. Constraint engineering hire (the first internal one) onboarded.
The constraint stack.
Year 2 is where most firms lose the plot. The Year 1 pilots produced wins, the partner team is excited, and the temptation is to scale aggressively and skip the unsexy infrastructure work. Skipping the constraint stack is the dominant Year 2 failure mode. Firms that scale before they build the auth, governance, and talent layers produce a wider footprint of agents that all share the same brittle foundation. When a single agent misbehaves, the blast radius is the entire footprint.
Auth layer (L3). Migrate every agent in production onto Entra Agent ID with the four primitives — ephemeral, intent-bounded, delegation-chained, machine-introspectable. Retire over-scoped service accounts. The audit chain has to point to a named human delegator for every agent action by end of Year 2.
Governance & gating (L4). Stand up the non-delegable decision register. Define what agents are structurally prohibited from doing without human signoff. Build the machine-readable policy that agents query before acting. The governance gate is not a slide; it is software running in production.
Talent overlay (L5). Hire 2–3 constraint engineers (or develop internally from your strongest senior associates). Hire 1–2 agent managers per practice area. Train every partner on the agent management discipline — agents as systems with delegated authority, not teammates.
- Milestone · Month 15: Entra Agent ID production rollout complete.
- Milestone · Month 18: Governance gate live. First non-delegable decision register published. First agent action successfully blocked by the gate.
- Milestone · Month 21: Constraint engineering team at five FTEs. Agent managers named in every practice area.
- Milestone · Month 24: Twenty agents in production, all on the constraint stack. Cross-practice pattern compounding.
Outcome scale.
Year 3 is where the strategic payoff arrives. The substrate is laid, the constraint stack is built, the talent is in place. The firm migrates the contracting model from hourly to outcome-based on the service lines where the constraint stack supports it, captures the margin lift the agentic decade offers to outcome-priced firms, and consolidates the firm’s competitive position for the next decade. Firms that arrive at Year 3 having built Years 1–2 properly compound. Firms that arrive having skipped layers find the contracting model migration produces revenue volatility and partner churn.
Outcome-based pricing migration (L7). Pick three service lines for the migration. Move from hourly to fixed-fee on the pilot service lines. Measure margin lift quarterly. By end of Year 3, target >30% of revenue from outcome-priced engagements.
Operating-model lock-in (L6). Tie partner compensation to outcome-priced revenue, not hourly billings. Restructure the project margin reporting in the PSA to support real-time outcome-priced delivery. The compensation system is the contracting model’s real enforcer.
Industry-specific compounding. The constraint stack and the talent overlay now compound across client engagements. Agents built once for one client’s use case get redeployed at the next client at a fraction of the original build cost. The firm’s margin profile begins to look more like a platform business than a services business.
- Milestone · Month 30: First three service lines migrated to outcome-based pricing. Margin lift measurable.
- Milestone · Month 33: Partner compensation tied to outcome-priced revenue across the firm.
- Milestone · Month 36: >30% revenue outcome-priced. Per-engagement margin >1.5× the hourly baseline. Firm-specific agent library compounding across client deliveries.
The strongest argument against this architecture.
The strongest counter is that three years is too long — the agentic landscape will shift materially over the architecture’s horizon, and a firm that commits today to a 36-month plan locks in choices that will be wrong by Month 24. This is partially right. The architecture is not a 36-month commitment to specific products; it is a 36-month commitment to a sequence — substrate, constraint stack, contracting model — that is structurally robust to product churn underneath. If Microsoft Entra Agent ID is replaced by something better in 2027, the firm migrates the auth layer without redesigning the architecture. If the dominant agent runtime changes, L2 swaps without affecting L3–L7. The architecture is opinionated about sequence, not about specific vendors. The vendor decisions can be revisited as the market evolves; the sequence cannot be skipped.
A second counter is that three years is too short — the operating-model layer requires deeper change than 36 months allows for the typical firm’s political dynamics. This is right for some firms. For those firms, the architecture is a five-year program rather than a three-year program, with Year 3 stretched into Years 3–5. The sequence still holds; the calendar accommodates the firm’s actual change capacity.
Three things to do this quarter.
01 · Place yourself on the architecture today. Walk the seven layers. Mark which are built, which are partial, which don’t exist. Most firms in the band will find L1 partial, L2 piloting, L3–L7 absent. That is the honest baseline. Stop comparing to vendor reference architectures; compare to the seven layers above.
02 · Commit to the three-year sequence explicitly, with the partner group, in writing. Year 1 is the substrate. Year 2 is the constraint stack. Year 3 is the contracting model. The commitment matters because compression is the dominant failure mode, and partner-level discipline is what prevents compression.
03 · Hire the first constraint engineer in the next 90 days. Whatever the partner group decides about platform, MSP, pilots — none of it works without the constraint engineer. Make this hire before any agentic deployment goes live, not after.
SALT’s position-review rhythm grades published positions against subsequent reality. This piece carries two predictions on the architecture itself.
The series ends here.
This is the twenty-fourth piece of the SALT insight series. Across the four pillars — Foundation, Manufacturing, Healthcare, Professional Services — the throughline has been the same: Autonomous Business is the destination, Constrained Agency is the operating discipline that gets you there, the Microsoft estate is the default substrate, and the operating-model layer is where the actual work compounds. The reference architecture in this piece is the working version of the thesis for the firm class SALT serves directly.
The shelf is now twenty-four pieces. Each carries a falsifiable position. Each will be graded against subsequent reality on the standing position-review rhythm. Where positions falsify, SALT publishes the correction explicitly. The intellectual estate is built; the work of operating against it begins.
Now go build.