The 5 actions AI leaders take to get value from agents

Leading firms are already turning AI agents into revenue and lower cost.

The value from AI agents is real, and a small group of firms are pulling ahead. For everyone else, the question is no longer whether agents work. It is why they are not seeing a return.

1.7× revenue growth, 40% more cost savings

the gap BCG found between AI leaders and laggards.

BCG, 2025

95% are leaving that money on the table

only 1 firm in 20 captures AI value at scale. The rest are stuck.

BCG

Money spent, no return

74% have already pulled a live, customer-facing AI agent back out of production, after a governance failure.

Sinch, 2026

What goes wrong, and what to do instead.

Across the AI programmes we see, the value rarely stalls on the model. It stalls on five decisions made around it. Here is each one, in plain terms, with what the firms getting it right do instead.

1What goes wrong

They choose the agent on a demo, not a business case

The agent is picked because it demos well, before anyone has worked out what it saves, what it costs to run, or what it is worth. It then costs more to run than the budget expected, and because it does not behave the same way every time, it needs rounds of testing the plan never allowed for. The spend overruns, and the board loses patience.

What good looks like

Size the value before you commit a pound

Work out the value in real numbers: how often the work happens, and the time and staff cost it removes, set against what it will cost to run, the model, the integration and the monitoring. Decide how much risk you are willing to carry, and make a clear go or no-go call before the spend scales up.

2What goes wrong

They leave the controls until after launch

It goes live in a regulated process with no independent checks around it. The first time it makes something up, it reaches a customer or a contract before anyone notices. A second slip follows, and the review that lands puts the whole AI investment under board scrutiny.

What good looks like

Build the controls in from the first sprint

Put the checks in code, sitting outside the model so it cannot talk its way past them, from the first sprint: every answer checked against its real source, a complete record of everything it read and did, and a gate that blocks an unsafe action before it happens.

3What goes wrong

They treat a prototype as if it were production

It passed a controlled demo, so it shipped. Real use showed what the demo could not: it was not reliable enough, it cost far more to run than the estimate, and it could be tricked into leaking data or taking the wrong action. With no way to absorb that, the team rolled it back, and leadership lost confidence again.

What good looks like

Earn the right to go live, in steps

Test it against a fixed set of real, scored examples. Then run it quietly alongside the people doing the work, then in a small live pilot, then fully. At each step it has to clear a set bar for accuracy, cost and safety, signed off by someone whose job is to find the holes, and it moves on evidence, not on a date.

4What goes wrong

They never engage the people whose work changes

The people whose work the agent changes are not consulted, trained, or told how their jobs will change. They start using their own unapproved tools, they quietly work around the agent, and trust never forms. An agent that works technically ends up barely used.

What good looks like

Bring your people with you, deliberately

Treat bringing people with you as real work from the start. Map who is affected and the new jobs the system creates, like running the agent, checking its quality, and reviewing its records. Expect the normal reaction to new AI, from early worry, through frustration, to settling into trust, and bring the doubters in rather than working around them.

5What goes wrong

They launch it, then leave it alone

No one owns the system once it is live. There is no routine to catch new ways it can fail, no way to stop it safely when it does, and no check on whether it still pays its way. It quietly drifts, and within months the agents are switched off and the manual work comes back.

What good looks like

Run it as a living system, with an owner

Give the system a named owner and a simple loop: watch what happens in production, catch the failures, add a check for each one, and adjust the agent. Keep a growing list of the ways it can fail, a way to stop it or fall back safely, and a regular review that grows, shrinks, or retires each agent on the evidence.

The good news: not one of these is a limit of the technology. Each is a decision, and there is a clear, practical method for getting it right.

Getting AI agents right is five stages of work.

The five things above are five stages, run in order. The leaders work through all five. The firms that miss out skip or rush some of them.

Stage 1

Decide where an agent adds value, where code is enough, and what the work is worthRun the candidate through the six questions: what it does, the decisions it makes, the systems and data it touches, and the cost of getting one wrong. That sets how much it should be allowed to do on its own, from only recommending actions to acting within set rules, and shows where plain code is the safer answer. Then size the return: how often it runs and the time and staff cost it saves, against what it costs to run, with a clear go or no-go call before the spend scales up.

Stage 2

Design an agentic system you can governDecide what the system may do on its own, where a person has to approve, and what data each agent can reach. Checks in code that sit outside the model, a gate that blocks an unsafe action before it happens, a complete record of everything it does, and clear ways to escalate are all designed in from the first sprint, not bolted on after launch.

Stage 3

Engineer it to be reliable, secure, and affordable under real loadTest it against a fixed set of real, scored examples, check every answer against its real source, and close the problems that only show up at scale: wrong answers, cost that runs away, and being tricked by malicious input. The right to go live is earned in steps, run quietly alongside people first, then a small pilot, then fully, on evidence and not a date.

Stage 4

Bring your people with youMap how each role changes and the new jobs the system creates: running the agent, checking its quality, reviewing its records. Plan for the normal reaction every team has to new AI, from early worry, through frustration, to settling into trust, so you head off the unapproved tools and quiet workarounds that follow when people are left out.

Stage 5

Operate the system, and keep it improvingGive the system a named owner, and run a simple loop: watch what happens in production, catch the failures, add a check for each one, and adjust the agent. Keep a growing list of the ways it can fail, a way to stop it or fall back safely, and a regular review that grows, shrinks, or retires each agent on the evidence.

Serpin has practical methods for every stage.

For each stage there is a clear method and a tool. Our Agent Discovery and Design framework runs across all of them, end to end. It gives product, engineering and change teams the knowledge and tools to deliver AI that is reliable, secure, and worth the money.

Stage 1

Decide where an agent fits, and size it

Agent Playbook

Value Radar

Stage 2

Design a governable agentic system

Agentic Operating Model

Stage 3

Engineer it for production

Bounded Agency

Seven Patterns

Stage 4

Bring your people with you

Adoption Engine

Stage 5

Operate the system, keep improving

Adoption Engine

Agent Discovery & Design, our end-to-end method for product, engineering and change practitioners implementing AI

The methods and IP behind each stage

Agent PlaybookWhich AI agent to build first, and how to scope it.Read it →

Seven PatternsHow Serpin stops AI agents making things up.Read it →

Bounded AgencyOur governance framework for running AI agents safely in regulated work.Read the framework →

Value RadarSizing the value and running cost of an agent before you build.Coming soon

Agentic Operating ModelDesigning the process, governance and roles around your agents.Coming soon

Adoption EngineBringing your people with you and keeping agents improving.Coming soon

AI leaders drive value with these 5 actions

Want help getting this right?

Leading firms are already turning AI agents into revenue and lower cost.

Five questions if you're planning an AI agent.

What goes wrong, and what to do instead.

They choose the agent on a demo, not a business case

Size the value before you commit a pound

They leave the controls until after launch

Build the controls in from the first sprint

They treat a prototype as if it were production

Earn the right to go live, in steps

They never engage the people whose work changes

Bring your people with you, deliberately

They launch it, then leave it alone

Run it as a living system, with an owner

Getting AI agents right is five stages of work.

Serpin has practical methods for every stage.

The methods and IP behind each stage

Not sure what is holding your AI programme back?

Three things you can do this week.

Want to work through your own programme?