Gartner predicts over 40% of agentic AI projects canceled by 2027. The pattern looks a lot like ERP in the 1990s, and the failure mode is the same.
I read the Gartner prediction a few weeks ago and my first reaction was not surprise. It was recognition. Gartner forecasts that more than 40% of agentic AI projects will be canceled by the end of 2027, citing escalating costs, unclear business value, and inadequate risk controls (https://www.gartner.com/en/newsroom/press-releases/2026-04-07-gartner-forecasts-worldwide-it-spending-to-grow-9-8-percent-in-2026). Those three reasons appear in every post-mortem I have ever read about large enterprise software projects. They are not new problems. They are the same problems that killed ERP rollouts in the late 1990s, that stalled CRM implementations in the 2000s, and that turned cloud migration projects into multi-year ordeals throughout the 2010s. The technology changes. The failure mode is remarkably stable.
The same Gartner research also found that 80% of CEOs say AI will force operational capability overhauls in their organizations. So the top of the house is convinced that the transformation is necessary. The people building the projects are canceling them at a rate that, if the forecast is correct, will exceed 40%. The gap between strategic conviction and operational execution is very wide, and that gap is what I want to think about.
McKinsey's State of AI 2025 puts some texture around where organizations actually are (https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai). 88% use AI in some form. Only 7% have fully scaled it. The picture this paints is not one of organizations confidently deploying AI at scale. It is one of organizations running dozens of pilots, a few of which move forward, most of which quietly stall or get canceled. This is what I think of as the pilot trap, and the mechanism has a few distinct layers that are worth separating.
The first layer is selection bias in the pilot itself. Organizations choose their best use case, their cleanest data, and their most motivated team for a pilot. The conditions are favorable by design. When the pilot succeeds, leaders interpret that success as evidence the organization is ready to scale. It is not. It is evidence that a small motivated team could make a model work under good conditions. Those are different things. Scaling requires pushing the technology into less favorable contexts, with messier data, less motivated stakeholders, and less organizational support. The pilot is a best-case scenario. Scaling is an average-case scenario. Treating best-case results as predictions of average-case performance is how organizations end up surprised by a 40% cancellation rate.
The second layer is what I think of as the integration cliff. A pilot runs in isolation. Production runs inside an existing technology stack, with existing data pipelines, existing access controls, existing approval workflows, and existing support infrastructure. The cost of connecting an AI agent to all of that infrastructure, testing it against edge cases, and maintaining it as the surrounding systems change, is almost never part of the pilot budget. When the integration cost arrives, it often exceeds the expected value of the project, and the project gets canceled. Escalating costs, to use Gartner's language, is not a surprise outcome. It is what happens when integration reality meets pilot economics.
The third layer is governance. Agentic AI systems take actions. They do not just generate text for a human to review before acting. When an agent books a flight, updates a customer record, sends a communication, or triggers a payment, someone has to be accountable for what it does. Who owns the agent? Who is responsible when it makes an error? What is the escalation path when it produces an output nobody expected? Most organizations deploying AI pilots have not worked through these questions because pilots are not accountable the same way production systems are. Inadequate risk controls, the third of Gartner's cited reasons, is almost always a governance problem rather than a technical one.
This is where Cohen and Levinthal's (1990) absorptive capacity concept becomes directly relevant. Absorptive capacity is path-dependent: the ability to recognize, assimilate, and exploit new technical knowledge depends on prior related knowledge. An organization that has never built data pipelines at scale, never governed automated decision systems, and never structured accountability around algorithmic outputs does not suddenly develop those capabilities because it approves an agentic AI project. The organizational knowledge that would allow it to absorb and deploy the technology has to exist before the technology arrives, or be deliberately built in parallel with deployment.
Most pilots are not building that capacity. They are demonstrating what is technically possible under favorable conditions, and leaving the hard organizational work for a future budget cycle that may never come. This is why absorptive capacity is path-dependent in exactly the way that matters here: organizations that started building data governance, automated workflow oversight, and algorithmic accountability structures two or three years ago are better positioned to deploy agentic AI successfully today. Organizations that did not build those foundations are now trying to build them while also deploying the technology, under time pressure, with uncertain budgets. That combination is how projects get canceled.
The ERP parallels are specific enough that I want to name them directly. Organizations ran ERP pilots. The demos were compelling. The consultants were confident. The boards approved large budgets. Then the implementations encountered data quality problems, integration complexity, user resistance, and governance gaps the pilots had never surfaced. Many of those projects were canceled after significant investment. The ones that succeeded were not the ones with the best software. They were the ones with the best organizational preparation, the clearest ownership of the change, and the most honest accounting of what the implementation would actually require.
As an IS researcher, what worries me is that the academic literature on IT implementation failure is extensive and has been accumulating for decades. The reasons projects fail are reasonably well understood. The question is not whether we know why projects fail. The question is why that knowledge does not appear to be transferring into better decision-making about AI projects. My hypothesis is that the absorptive capacity problem applies at the level of managerial knowledge as well as technical knowledge. Organizations that have never seriously engaged with IS implementation research do not have the prior related knowledge needed to recognize the warnings when they read a Gartner forecast about 40% cancellation rates. They see the number. They do not connect it to the underlying mechanism. So they proceed with their pilot.
The 40% cancellation prediction will probably prove to be conservative. Not because the technology is bad. Because the organizational conditions for deploying it sustainably are rare, and the pattern of optimistic pilots followed by disappointed post-mortems has been running for decades without the underlying mechanism changing enough to produce a different outcome.
---
claims_checked:
- "40%+ of agentic AI projects canceled by 2027 (escalating costs, unclear business value, inadequate risk controls)": "https://www.gartner.com/en/newsroom/press-releases/2026-04-07-gartner-forecasts-worldwide-it-spending-to-grow-9-8-percent-in-2026"
- "80% of CEOs say AI will force operational capability overhauls": "https://www.gartner.com/en/newsroom/press-releases/2026-04-07-gartner-forecasts-worldwide-it-spending-to-grow-9-8-percent-in-2026"
- "88% of organizations use AI; only 7% fully scaled": "https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai"
claims_unverified:
- "23% scaling agentic AI; 39% experimenting: referenced in prior version from McKinsey; exact figures not re-verified from fetched URL for this rewrite"
- "Prediction that 40% cancellation rate will prove conservative: my own interpretation, clearly framed as opinion"
- "ERP implementation failure rate parallels: based on well-documented historical record, not a single directly cited study"
sources_used:
- "https://www.gartner.com/en/newsroom/press-releases/2026-04-07-gartner-forecasts-worldwide-it-spending-to-grow-9-8-percent-in-2026"
- "https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai"
word_count: 1055
About the author
Share
More notes
Related notes