Why Chatbots Are the Wrong Benchmark for the Agentic User Experience

10 January 2026

Why Chatbots Are the Wrong Benchmark for the Agentic User Experience

FeaturedDesigning AI Experiences

# Why Chatbots Are the Wrong Benchmark for the Agentic User Experience

Most organisations deploy AI in a chat window because it feels familiar. But measuring AI success by how well it chats is the wrong test entirely. Your staff and customers don't want a smoother conversation. They want a trusted answer and fewer steps to get there.

The problem with the chatbot frame

When you benchmark AI on conversational quality — how natural it sounds, how quickly it responds — you end up optimising for the wrong thing. The result is an AI that is pleasant to talk to but doesn't actually reduce the effort your team or customers have to put in.

Real value comes from removing steps, not adding a new interface that people have to learn alongside the systems they already use every day.

What good actually looks like

Think about a frontline service rep handling a billing enquiry. The right AI experience doesn't open a chat window and ask them to type a question. It surfaces the relevant policy, flags the next action, and saves them two minutes per call — without them noticing the AI at all.

That is the benchmark worth chasing: how much effort did we remove? How many calls to the help desk didn't happen? How many emails didn't need to be written?

From answering to resolving

There is a meaningful difference between an AI that tells someone what to do next and one that actually does it. For complex questions — ones that involve multiple policies, multiple sources, or multiple conditions — the best outcome is an answer that requires no follow-up.

That means the AI does the legwork: finds the relevant content, checks for conflicting information, and returns something the person can act on immediately. Fewer handoffs. Fewer clicks. A faster path to resolution.

The best AI experience is one your people don't notice — because the answer arrived before they finished wondering.

The bottom line

The organisations getting the most from AI right now are not building better chatbots. They are reducing the number of things their people have to do to reach a resolution.

That is a business decision. Not a technology decision.

It also requires systems designed to deliver trusted answers from the right sources — not generic conversational AI layered on top of complex knowledge bases.