Category

Last updated: June 2026
If you run a $10M to $200M company and your board has asked for an AI audit before the next budget cycle, the cost of getting this wrong is the entire AI line item: a pilot that stalls at integration, a six-figure platform license burning monthly with no agent in production, and a board next quarter that quietly cuts the AI envelope by half. The IBM 2025 Cost of a Data Breach report found that organizations with high shadow-AI usage carry an extra $670,000 per breach, and 97% of organizations that suffered a breach of an AI model or application lacked proper AI access controls. The audit is what surfaces those gaps before they become a number on a board slide. In this guide, you will get the five-step enterprise AI audit methodology Arkeo runs on mid-market engagements, the deliverables each step produces, who owns what, and the timeline to a go, fix-first, or no-go decision per workflow, so you can answer the board with a credible plan instead of a pitch deck.
According to the Stanford HAI 2025 AI Index, 78% of organizations reported using AI in 2024, up from 55% in 2023, the largest year-over-year jump in the Index's history.
of organizations used AI in 2024, up from 55% in 2023 — the largest year-over-year jump on record.
Source: Stanford HAI 2025 AI Index
Adoption is no longer the question. Whether your specific operation can put a custom agent into production is. Arkeo has spent three years deploying AI agents on its own operations and on mid-market client engagements, and the failure mode that repeats is not model quality. It is an unaudited environment: data the agent needs is locked in three systems that do not talk to each other, the approvals were never designed, and the post-launch owner was never named. The audit is the cheapest way to find that out before you sign a build contract. If you want this run against your own workflows, book a free AI Assessment; Arkeo will audit one workflow end-to-end so you can see if you are ready for custom agents.
Quick Answer
• What it is: A structured five-step diagnostic of one or more candidate workflows, scored across data, integration, governance, and ownership, that ends with a go, fix-first, or no-go decision per workflow.
• Who does it: A senior internal operator (former PMO, COO chief of staff, fractional CTO) with 30 to 50 hours over two to three weeks, or an external firm with production AI deployment experience.
• Timeline: 5 working days for a single-workflow audit; 3 to 4 weeks for a company-wide audit.
• Cost: Internal time only if DIY; mid-market external fees commonly $15K to $50K single-workflow, $40K to $150K company-wide. Arkeo's lead-magnet AI Assessment audits one workflow free.
• Why it matters: The audit is the prerequisite for any custom agent build. Skip it and you pay the breach tax instead.
• Next step: Book a free AI Assessment; Arkeo will run step one against your business in 60 minutes.

An enterprise AI audit is a structured, workload-by-workload inventory of every data source, system integration, workflow rule, governance control, and operating owner that a candidate AI agent depends on, concluded with a go, fix-first, or no-go decision for each workload. It is not a database review. It is an operating review with a data lens, executed by someone who can sit in the same room as the CFO, the COO, and the VP of Engineering and finish a defensible diagnosis they all sign off on.
The audit answers four questions per workflow: can the agent reach the data it needs, is the data clean enough to act on, where does a human have to sign, and who owns this on the Monday after it ships? A workflow that cannot answer all four with operating-grade specifics is not ready for a custom agent, regardless of how the team feels about ChatGPT.
Most mid-market operators conflate the audit with the build. They think hiring a firm to "do an AI audit" means hiring someone to build the agent. That is the mistake the BCG October 2024 research is measuring when it reports that 74% of companies struggle to scale value from AI and only 4% have built leading capabilities. The audit and the build are separate engagements with separate deliverables. The audit produces a decision. The build executes on the decision. Conflating them is how companies end up with a 40-slide deck and no production agent.
Two viable owners, one common failure mode. The audit is performed either by a senior internal operator who can carve out 30 to 50 hours over two to three weeks, or by an external firm that operates AI in production itself. The failure mode is asking a generalist consulting firm with no agent-build experience to audit your operation; the output is a slide deck, not a deployable plan.
The honest filter for an external firm: ask whether they have built the kind of agent the audit would recommend. If they cannot describe, in operating language a CFO understands, the agent they would build for your top workflow, the audit is going to end at the deck. Arkeo performs the audit as the first step of its Assess, Deploy, Manage methodology, which means the audit output is a deployable plan, not a stand-alone report.
INTERNAL OWNER
Former PMO lead, COO chief of staff, or fractional CTO who can carve out 30 to 50 hours over two to three weeks and actually finish. Best when an existing senior person already has political coverage with the CFO and the COO.
EXTERNAL OWNER
An outside firm that operates AI agents in production for itself and for clients. Best when you want a known timeline, an outside voice the board will trust, and an audit that hands the build team a deployable plan instead of a deck.
HYBRID (PREFERRED)
External operator leads the audit, internal operator pairs through every interview and sign-off. The audit lands on schedule and the internal owner ends the engagement able to run the same exercise on the next workflow.
Whichever owner runs the audit, the deliverable is the same: a one-page workflow scorecard the CFO and COO both sign, plus the source inventory, integration map, approval design, and named operator behind it. The audit is the front end of the broader ai readiness assessment work; the assessment is the company-wide rollup of per-workflow audits.
Run step one against your own workflow in 60 minutesArkeo's free AI Assessment scopes one of your workflows, scores its top data sources, and tells you whether the audit will end go, fix-first, or no-go. Let's audit your workflows to see if you're ready for custom agents.
Book Your Free AI Assessment →
Run the audit in five sequential steps. Each step has a clear deliverable, a named owner, and a stop-the-line rule that prevents the team from moving to the next step before the current one is real.
Pick one candidate workflow, not a department and not the company. Name the workflow in operating language: "quote generation for inbound RFQs over $10K," not "sales AI." The scoping output is a one-page workflow brief that names the trigger event, the steps a human takes today, the systems touched, the decision points, the dollar thresholds, and the operator who owns the workflow now. Stop-the-line rule: if the team cannot agree on the workflow boundary in 60 minutes, the workflow is not coherent enough to audit; pick a narrower one.
List every system, document store, inbox, spreadsheet, and human notebook that contains data the workflow uses today. For each digital source, pull a 100-row sample export and score it for completeness, correctness, deduplication, and timeliness. PDFs in a shared drive score zero until they are parsed and structured. Inspection notes in a tradesperson's notebook, customer commitments captured in Slack DMs, pricing exceptions agreed on the phone and never reconciled, those score zero on availability and have to be either captured or accepted as gaps the agent cannot reason over. The deliverable is a source table with availability score, quality score, and integration cost class (cheap, medium, expensive). Stop-the-line rule: a workflow whose top three data sources score below 3 out of 5 on availability is a fix-first, not a go.
For each source the agent needs to read or write, document the access route (API, export, OCR, manual entry), the latency, the cost, and the failure mode. Then design the approval points: where does the agent need a human signature? An invoice over $5K, a customer refund, a contract clause, a quote above an engineering threshold. Approval points must be designed before the build, not after. The deliverable is an integration map plus an approval matrix that lists every decision the agent will make, the dollar or risk threshold that triggers a human in the loop, and the named approver. Stop-the-line rule: any decision the agent will make autonomously must have a documented rollback path; without it, the workflow cannot ship.
Classify the data the workflow touches by sensitivity. Flag regulated fields (PII, PHI, financials, customer contracts). Decide where the workload must run: public cloud, private cloud, or on-premise. This decision drives the build budget, the build timeline, and the security architecture. Arkeo deploys private, on-premise AI in environments where the data cannot leave the building, which is why the risk profile is settled before the build conversation starts, not negotiated mid-build. The deliverable is a risk classification, a deployment shape recommendation, and a list of governance controls the workload will need.
Name the operator who will own the agent the Monday after it ships. Name the on-call schedule, the runbook owner, the metric the agent will be measured on, and the cadence at which the team will review it.
If the answer to “who owns this on Monday” is the consultant, the agent will die in 90 days.
The deliverable is a one-page operator readiness card and the audit's final verdict: go, fix-first, or no-go for this workflow. Stop-the-line rule: no named operator means the audit ends at no-go for that workflow, regardless of how the data and integration scores looked.
Five steps, one workflow, one week. That is the unit of work. A company-wide audit is the same five steps repeated across a prioritized list of workflows, typically 5 to 8 over three to four weeks. The output is the same: a workflow scorecard per candidate, ranked.
An audit that ends at "we should do more discovery" is not an audit. The deliverables are concrete and the CFO can read all of them on a single afternoon.
DELIVERABLES
Each is concrete, defensible, and built from samples instead of opinions.
01
One-page description of the candidate workflow: trigger, steps, systems, decisions, dollar thresholds, current owner. Written in operating language.
02
Every system, document store, inbox, spreadsheet, and notebook scored for availability, quality, and integration cost class. Sample sizes, not anecdote.
03
Access route, latency, cost, and failure mode for each source the agent reads or writes. The build team uses this directly.
04
Every decision the agent will make, the threshold that triggers a human in the loop, and the named approver. Designed pre-build, not patched post-launch.
05
Data classification, regulated field map, and the recommendation for public cloud, private cloud, or on-premise. Sets the budget and architecture.
06
Named owner, on-call schedule, runbook owner, the metric the agent is measured on, and the audit's verdict: go, fix-first, or no-go.
Six artifacts, one decision per workflow. The build team executes against these without re-auditing.
The reason these deliverables matter is that they map directly onto the NIST AI Risk Management Framework 1.0, the US government's reference standard for AI trustworthiness. NIST organizes its work around four functions: Govern, Map, Measure, Manage. The workflow brief and the operator card map to Govern. The source table and integration map map to Map. The risk profile and approval matrix map to Measure. The deployment shape and operator card map to Manage. Mid-market operators do not need to implement NIST line by line. They do need a deliverable trail that maps to it, because regulators, insurers, and enterprise customers are going to ask about it inside the next two budget cycles.
The audit lives inside the larger ai readiness question, which Arkeo treats through a five-stage workload-anchored ai maturity model: Ad Hoc, Aware, Active, Operating, Embedded. A company can be Stage 1 in finance, Stage 3 in customer support, and looking at a Stage 4 deployment in sales, all on the same Monday. The audit is what tells you which stage each workload sits at today and what the shortest path is to Stage 3 in any one of them.
The PwC AI Agent Survey of 300 senior US executives in May 2025 reports that 79% of US businesses say AI agents are already being adopted and 88% of executives plan to increase AI-related budgets in the next 12 months. The budget is coming. Whether that budget produces working agents or stalled pilots is what the audit decides. The Deloitte State of Generative AI Wave 4 survey of 2,773 C-suite and director-level leaders across 14 countries found that more than two-thirds of enterprise respondents expect 30% or fewer of their GenAI experiments to be fully scaled within the next three to six months. That is the unaudited cohort.
The IBM IBV CEO Study of 2,000 CEOs across 33 countries adds the people side: 54% of CEOs are already hiring for AI roles that did not exist a year ago, 31% of the workforce will require retraining or reskilling over the next three years, and lack of expertise is cited as the top barrier to AI innovation. Step 5 of the audit, operator readiness, is where that statistic stops being a survey number and starts being a named person on your org chart.
Four failure modes recur across mid-market audits, none of them technology problems.
FAILURE MODES
Each is a decision the audit owner makes in week one, not a discovery in week six.
01
The team picks "sales AI" instead of one workflow. Step 1 stalls; the audit ends at a slide deck. Fix: name the workflow in operating language with a trigger and a dollar threshold.
02
The team scores data quality by asking the workflow owner. Real quality scores require a 100-row sample export, not an opinion. Without samples, the audit is theatre.
03
The team plans to "figure out approvals during the build." Approval points are workflow rules, not UX choices. Skipping them is how a quoting agent autonomously sends a $200K quote.
04
The audit ends with a verdict but no Monday-morning owner. The agent ships, the consultant leaves, no one runs the runbook. 90 days later the agent is unplugged.
None of these are technology problems. They are operating-discipline problems wearing AI costumes.
The blunt truth is that most "AI strategy" engagements deliver a 40-slide deck and disappear before any of these failure modes are caught. That is the failure mode the 74% value-gap figure from BCG's October 2024 research is measuring. Arkeo has been in business for 25 years operating real companies before deploying AI agents on top of them, which is why the audit looks like operations work: process maps, data sources, approval routing, owner names, on-call schedules. We use what we sell, which means the audit we run on a client engagement is the same audit we ran before deploying agents on Arkeo itself.
The honest cost and timeline ranges in the mid-market: a single-workflow audit takes 5 working days and lands in the $15K to $50K range with an external firm; a company-wide audit takes 3 to 4 weeks and lands in the $40K to $150K range, depending on scope and sensitivity. DIY costs internal time only but routinely stalls at week six because the operator gets pulled to their actual job. Arkeo's lead-magnet AI Assessment audits one workflow free as a 60-minute structured engagement; the paid Consult extends the audit across the company. Build and Manage phases follow only if the audit says go.
For comparison, a scoped single-workflow agent build runs $15K to $40K and 6 to 10 weeks to production, 8 to 12 weeks when the deployment is private or on-premise. Off-the-shelf copilots like Microsoft 365 Copilot or ChatGPT Enterprise are roughly $20 to $30 per user per month and live in days. The first quick win typically lands inside 30 to 90 days. Those are operator ranges from Arkeo's own builds, not sourced benchmarks. The audit is the lowest-cost, highest-leverage step in the sequence: it is the one that decides whether the build money is well spent.
The clean handoff: the workflow brief, source table, integration map, approval matrix, risk and deployment shape, and operator card become the build team's specification. The build team does not re-audit. The build team executes against the audit's deliverables. If the build team needs to re-audit, the audit was not done.
The handoff is also when the cluster boundary matters. Readiness owns the current state. Strategy owns the future state. ROI owns the financial justification. The audit's go decision feeds the strategy team's sequencing question (which agents in what order, over what timeline) and the ROI team's business case (which agents pay back first). Skipping readiness is how mid-market companies end up with a beautiful roadmap that crashes at month three when nobody can find the data the month-three milestone depends on.
Audit your top workflow in 60 minutesArkeo's free AI Assessment runs step one of the audit against one of your workflows, scores its top data sources, and gives you a go, fix-first, or no-go preview you can take to the board.
Book Your Free AI Assessment →
Apply for the free AI Assessment. In 60 minutes you walk away with a 12-month plan tailored to your business. No software demo. No obligation.
Free Planning Session →