The 73% problem: why most enterprise AI projects fail before they start

The number nobody wants to talk about

Roughly three quarters of enterprise AI projects fail to deliver against the goals they were funded for. They get scoped, approved, and kicked off with the right people in the room. Then they stall — quietly, expensively, and almost always for the same underlying reason.

The technology gets blamed. The vendor gets blamed. Sometimes the team gets blamed. The real cause sits much earlier in the project, in a decision that was made before a single model was trained.

The core insight: AI projects do not fail because the models are wrong. They fail because the data underneath them was never properly understood. By the time the project hits delivery, that gap is no longer fixable on the original timeline.

What actually goes wrong

The moment most AI projects go off track can be traced back to a much earlier conversation — the one where someone said "we have the data, let's build something with it."

The problem is that having data and understanding data are two different things. Most enterprises run on a stack of operational systems that grew up at different times for different reasons. An ERP. A CRM. A warehouse or operations platform. A finance system. A handful of spreadsheets that nobody wants to admit are load-bearing. Each system works on its own. None of them, taken together, tell a coherent story about the business.

So when an AI initiative kicks off, the team spends the first weeks trying to answer questions that should have been answered years ago. What data do we have? Where does it live? What does this field mean? Why does this customer ID look different in three systems? Is this number trustworthy? Discovery work that should have been a week becomes a month. The model that was supposed to ship in Q2 slides to Q4. The budget gets eaten by people mapping tables instead of building anything.

Why this happens so consistently

The teams running these projects are not unskilled. The technology is not broken. What is missing is a layer that almost no organisation has built deliberately — a governed, current, structured understanding of the data estate itself.

Without that layer, every new initiative starts from zero. Every project re-discovers the same gaps. Every senior data architect spends their first month doing work that should already exist. And every business case is built on assumptions rather than evidence, which is why so many of them collapse the moment they meet production data.

The businesses moving fastest on AI share one trait: they know their data. Not in a vague "we have a data lake" sense, but in a specific, queryable, current sense. They know what they have, where it sits, how clean it is, how the systems connect, and what is safe to build on. Until recently, getting to that state required months of manual work by people who cost a lot. Most organisations could not justify the spend, so they skipped it — and paid for it later when projects stalled.

What this looks like in practice

A common pattern: a logistics or operations business invests in a BI rollout to give leadership visibility across the business. Six months in, the dashboards exist but nobody trusts them. Order counts do not match between operational systems and the finance ledger. Customer names appear three ways. The team building the reports spends more time reconciling than reporting. The project has not failed on paper, but it has stopped delivering anything new.

Or: a financial services firm pilots an AI agent to handle internal queries about policies. The pilot works on a clean test dataset. When it hits the real data, half the answers are wrong because the underlying records contradict each other and nobody flagged it upfront. The pilot gets parked indefinitely.

These are not AI failures. They are foundation failures. The AI worked. The data underneath it was not ready, and nobody knew until the project was already in flight.

Where Sidekick comes in

Sidekick was built specifically to close this gap. It gives an organisation a living, governed understanding of its entire data estate — automatically, continuously, and without moving the data anywhere.

It connects to the systems already in place, with read-only access, and within hours produces what would normally take a senior data team months to assemble: a complete inventory of every data asset, a quality score, a sensitivity classification, a map of how the systems relate to one another, and a list of where the gaps and risks actually sit.

That output is what an AI project needs before it begins, not after it stalls. Knowing the data estate in advance is the difference between a project that ships and a project that joins the seventy-three percent.

The takeaway

AI does not fail because the models are not good enough. It fails because the foundation underneath is not ready, and the foundation is not ready because nobody has built a clear, current picture of what is actually there.

Before the next AI initiative, the question worth asking is not what should we build? It is do we actually understand what we are building it on? If the answer is no, that is where the work starts.

For a deeper look at why this matters, see Why you cannot build AI on data you do not understand.

Ready to get started?

See what Sidekick finds in your data estate

Most organisations are surprised by what a first discovery surfaces. Talk to the Sidekick Lab team about a Proof of Value engagement — scoped, time-bounded, and deployed in your own environment.