Category

12-Month AI Roadmap for Mid-Market Operators

June 5, 2026

12-month AI roadmap timeline showing four quarters Assess Pilot Deploy Scale with gate questions

Last updated: June 4, 2026

If you are three weeks out from a board meeting and somebody on the slate expects a 12-month AI roadmap that holds up to a CFO line-by-line review, this guide is the template. Generic seven-swim-lane Gantt charts and three-horizons decks do not survive the first finance question; what survives is four sequenced quarters, one gate question per quarter, and one board metric per quarter that says whether the quarter actually shipped. In this guide, you'll get the Assess-Pilot-Deploy-Scale calendar, the gate questions that halt the next quarter when they fail, the budget shape across all four, and the one-page board deliverable that ships at each quarter end.

The PwC AI Agent Survey of 300 senior US executives found 88 percent of executives raising AI budgets in the next twelve months; boards know the money is moving, and what they want from the operator is a calendar of decisions, not a calendar of meetings. Before you put dates next to the four quarters, pressure-test the assumptions with a free AI Assessment that names the one workflow worth taking to production first.

Quick Answer
What it is: A 12-month AI roadmap is four sequenced quarters (Assess, Pilot, Deploy, Scale) with one gate question and one board metric per quarter.
Cost shape: $50K to $200K in year one for a mid-market business who finishes all four quarters, weighted toward Q2 build and Q4 expansion.
Why it matters: A roadmap that does not ship a gate question every quarter becomes a wish list. Board approval depends on the gates, not the Gantt.

Why do generic Gantt charts fail as AI roadmaps?

A generic Gantt chart is a calendar of tasks; an AI roadmap is a calendar of decisions. The Gantt promises that work will happen on certain dates. The roadmap promises that, by the end of each quarter, one specific question will be answered in writing and one specific board metric will move. That is the difference between a plan that survives the first finance review and a plan that becomes a wish list by mid-Q2.

The pressure is real. The Stanford HAI 2025 AI Index reports 78 percent of organizations used AI in 2024, up from 55 percent the year before, and US private AI investment reached $109.1 billion. The capital is in motion. What boards want from the operator is a calendar of decisions, not a calendar of meetings.

The cluster on enterprise AI strategy covers the methodology that sits behind the roadmap; the dedicated 90-day AI implementation plan walks the first quarter at day-level resolution; sequencing your AI implementation roadmap covers the order-of-operations logic for which workflow goes first. This piece is the twelve-month calendar that wraps them.

What are the four quarters of a 12-month AI roadmap?

The four quarters are Assess, Pilot, Deploy, and Scale. Each quarter has a deliverable, a gate question that has to be answered in writing before the next quarter starts, and a single board metric that tells you whether the quarter shipped. Skip the gate, and the next quarter inherits a problem the budget cannot solve.

THE ROADMAP

The four quarters

Each quarter ships a deliverable, a gate question, and a board metric. Skip the gate, and the next quarter inherits a problem.

Q1

ASSESS

Current-state diagnostic, a shortlist of three to five candidate workflows, and a data sovereignty decision (private versus hosted) signed off by legal and security.

Gate question: Which one workflow do we take to pilot, and where does its data live?

Board metric: Workflow chosen, named owner, data path approved in writing.

Q2

PILOT

First agent built in a controlled environment against a real document set, with an integration plan into the production system already drafted.

Gate question: Does the agent beat the manual baseline on the two metrics that matter, on real data?

Board metric: Measured time or error delta versus the baseline, plus a go or no-go decision.

Q3

DEPLOY

Agent in production with human-in-the-loop review on the high-risk steps, governance live, on-call rotation defined, monitoring against the gate metrics.

Gate question: Who runs this on a Monday morning when it drifts, and what is the escalation path?

Board metric: Volume of work executed by the agent and human override rate.

Q4

SCALE

Workflows two and three built on the same operating pattern, same data path, same governance, reusing the security review from the first deployment.

Gate question: Did the second and third workflows ship faster than the first, because the pattern held?

Board metric: Number of workflows in production and average time-to-deploy per workflow.

Each quarter ships a gate question, not just a project.

The Q1 work is the most easily under-budgeted and the most consequential. Picture a 200-person regional logistics operator running on a legacy TMS, three Excel-based ops dashboards, and a customer-service team that lives inside a shared inbox. The Q1 deliverable is not a slide deck. It is a one-page inventory of the candidate workflows (claims triage, invoice exception handling, driver-debrief summarization), a documented shortlist of three, and a data sovereignty decision that says "the driver-debrief audio and the claims documents stay inside our infrastructure, processed by a private model." That decision drives every contract, every security review, and every architecture choice in Q2 and Q3. Made up front, it costs a quarter; made in week 30, it costs a year.

The Q2 pilot is built in a controlled environment against real documents, not synthetic data. In Arkeo's build experience, a scoped single-workflow agent runs about $15,000 to $40,000 and reaches production in 6 to 10 weeks, or 8 to 12 weeks when the deployment is private or enterprise-grade. Off-the-shelf copilots come in at roughly $20 to $30 per user per month and go live in days. The first measurable quick win lands in 30 to 90 days. The PwC AI Agent Survey found 66 percent of agent adopters reporting measurable productivity gains; the gain is real, but it is conditional on the Q1 gate having been answered honestly.

What halts progression from one quarter to the next?

The roadmap is gated by evidence, not by calendar. A quarter does not graduate because the date arrived; it graduates because the gate question has a written answer that the board can read and the operating team can act on.

Q1 will not graduate to Q2 if the data sovereignty decision is still open. Wiring a pilot to a public model "just for the pilot" while the contract review is in flight buys a security re-platform later and loses a full quarter. BCG's Where's the Value in AI? (October 2024) found 74 percent of companies struggle to capture value from AI, and the most common quiet reason is a Q1 gate that was waved through to keep the calendar moving.

Q2 will not graduate to Q3 if the pilot did not beat the manual baseline on real data. The honest move is to halt the pilot under the kill criteria, return to the shortlist, and pick the second-best candidate. Sunk cost is not a reason to deploy a workflow that did not clear the gate; it is the reason most pilots end up in pilot purgatory. The Deloitte State of Generative AI Wave 4 study of 2,773 C-suite respondents found more than two-thirds expect 30 percent or fewer of their generative AI experiments to scale within three to six months. Treating Q2 as a portfolio that prunes, not a project that ships, inverts that ratio.

Q3 will not graduate to Q4 if there is no named operator on a Monday morning. Picture a regional professional services firm that ran a strong Q2 pilot on a contract-summarization agent, then handed Q3 to the same project manager who shepherded the build. The agent went live, then drifted three weeks in, and nobody had a Monday-morning protocol for catching it. The override rate climbed quietly and the metric did not surface until the next board meeting. Production without an on-call rotation is a demo that has been left running. The IBM IBV CEO Study of 2,000 CEOs across 33 countries found 31 percent of the workforce will need retraining or reskilling in three years, and 54 percent of CEOs are already hiring for AI roles; the operator is part of that workforce, named ahead of go-live, not after.

Populate Q1 with a real diagnostic, not a slide template

The free AI Assessment names the one workflow worth piloting first, the data sovereignty decision behind it, and the operating model that will carry it from Q2 build into Q3 production.

Book Your Free AI Assessment →

What is the budget shape across the 12 months?

The budget shape across the four quarters is not flat. Q1 is the smallest line and the highest leverage; Q2 carries the build cost; Q3 is operations and integration; Q4 funds two more workflows on the same pattern. Misreading the shape (front-loading Q1 with $200K of "strategy work" or back-loading Q4 because the executive sponsor expects pricing to fall) is the single most common reason year-one totals come in at twice the credible estimate without producing more deployed workflows.

THE BUDGET SHAPE

Spend pattern across the year

Year-one totals range from $50K to $200K for a mid-market business who finishes all four quarters.

Q1

INVEST

A scoped assessment that names workflows, data path, and owner, plus a small off-the-shelf copilot rollout to give the team an honest feel for the technology.

Typical spend: Diagnostic plus $20 to $30 per user per month for a copilot pilot.

Q2

BUILD

First custom workflow agent built against real data, on the data path Q1 approved, with the integration plan drafted before the build kicks off.

Typical spend: $15K to $40K per scoped workflow agent, 6 to 10 weeks (8 to 12 if private).

Q3

OPERATE

Production deployment with human-in-the-loop, monitoring, on-call rotation, plus the integration work into the system the agent feeds or is fed by.

Typical spend: Run-rate operations cost plus integration work scoped against the existing stack.

Q4

EXPAND

Workflows two and three built on the same pattern, reusing the data path and the security review, so the per-workflow cost trends down across the year.

Typical spend: Two more workflow builds at the Q2 unit cost, with reduced security and integration overhead.

$50K to $200K year one for a mid-market business who finishes all four quarters.

The shape implies a tactical move that most operators miss in Q1: keep the assessment spend honest and use the savings to fund a small copilot deployment in the same quarter. The copilot is not the destination, but it gives the executive team a felt sense of the technology before the Q2 build kicks off, which dramatically improves the quality of the Q1 gate decision. Arkeo's own operating pattern, "we use what we sell," is built on this principle: the team forms judgment by running production agents, not by reading product brochures.

What does a board-reporting deliverable look like at each quarter end?

The board does not want a sixty-slide deck four times a year. The board wants the gate question and the gate answer, on one page, with the metric that supports it. Q1 ships a workflow choice, an owner, and a data path. Q2 ships a baseline-versus-pilot comparison and a go or no-go. Q3 ships a production-volume number and a human-override rate. Q4 ships the count of workflows in production and the trend in time-to-deploy.

The Arkeo Assess, Deploy, Manage model maps to the back three quarters cleanly: Q1 and the workflow-choice work sit inside Assess; Q2 and Q3 are Deploy; Q4 and steady-state operations are Manage. The build pattern is the same across all three. Data never leaves the building when the data sovereignty decision was made for private, which is the common path for regulated industries, professional services with sensitive client data, and any operator who already runs on-premise core systems. The current-state inventory work that feeds Q1 belongs to the AI readiness assessment workstream; the workflow-sequencing logic that determines the order of Q2, Q3, and Q4 candidates is treated in detail in the spoke on sequencing the implementation roadmap referenced above.

Walk into the board meeting with the four quarters answered

The free AI Assessment populates Q1 with a real current-state diagnostic, a shortlisted workflow, and the gate questions for Q2 through Q4, ready to put on the board document.

Book Your Free AI Assessment →

Frequently Asked Questions

What is an AI roadmap?

An AI roadmap is a sequenced calendar of decisions and deliverables that takes an organization from scattered AI interest to deployed AI workflows in production, with a gate question and a board metric at each milestone. For a mid-market business, the practical form is four quarters: Assess (current state and workflow choice), Pilot (first agent built and tested on real data), Deploy (production with human-in-the-loop and governance), and Scale (workflows two and three on the same pattern). The roadmap differs from a project Gantt because it commits to which questions will be answered by which dates, not to which tasks will be in flight on which dates.

How does a mid-market business build a 12-month AI roadmap?

A mid-market business builds a 12-month AI roadmap by committing to four quarters, each with one gate question and one board metric. Q1 selects the first workflow and resolves data sovereignty. Q2 builds the agent in a controlled environment against real data. Q3 takes it into production with human-in-the-loop review and a named operator. Q4 ships workflows two and three on the same pattern. The work that does not belong in this roadmap (current-state audits at depth, financial ROI math, multi-year horizon planning) is treated in dedicated workstreams so the calendar stays focused on what ships in twelve months.

What are the phases of an AI roadmap?

For a 12-month AI roadmap aimed at a mid-market operator, the four phases are Assess, Pilot, Deploy, and Scale. Assess produces the workflow shortlist and the data sovereignty decision. Pilot builds the first agent in a controlled environment and tests it against the manual baseline on real data. Deploy moves the agent into production under human-in-the-loop review, with a named operator and an on-call escalation path. Scale extends the same operating pattern to workflows two and three, reusing the security review and the data path so each new workflow ships faster than the one before it.

What should be in a quarterly AI roadmap update to the board?

A quarterly AI roadmap update to the board is one page. It states the gate question for the quarter, the written answer to that question, the single board metric that supports the answer, and the gate question for the next quarter. For Q1 that is workflow choice, named owner, and data path. For Q2 it is the baseline-versus-pilot comparison and a go or no-go decision. For Q3 it is production volume and the human-override rate. For Q4 it is workflows in production and the trend in time-to-deploy. A sixty-slide deck is not a roadmap update; it is a substitute for one.

What is the difference between an AI roadmap and an AI strategy?

An AI strategy is the methodology and the positioning that explain why a business is investing in AI and where the value is expected to come from. An AI roadmap is the calendar that turns that strategy into deployed workflows. Strategy answers the question "why this, why now?"; the roadmap answers the question "what ships this quarter, and how will the board know?" A strategy without a roadmap is a slide deck; a roadmap without a strategy is a Gantt chart. Mid-market operators need both, but the roadmap is what gets approved and funded at the board.

Category

Ready to Own Your AI?

Apply for the free AI Assessment. In 60 minutes you walk away with a 12-month plan tailored to your business. No software demo. No obligation.

Free Planning Session →