Category: Uncategorized

  • The Gap Between AI and Us.

    AI is getting better. The experience is getting worse.

    Every week, another AI company announces a better model.

    More reasoning. More context. Better coding. Better agents. Better benchmarks. On paper, the progress is real. The capability curve is moving faster than most of us can keep up with.

    In daily use, it feels different.

    I use these tools every day. For Finz. For writing. For research. For decisions I am working through. I have a clear view of where the models are now strong, and where they are not. The strong list is genuinely impressive. The models are writing production code. They are running deep research across hundreds of sources. They are handling work that needed a team of engineers a year ago.

    The weak list is harder to talk about, because it does not show up in any benchmark.

    The models still struggle to understand tone. They lose the thread of a decision halfway through. They miss the one detail that mattered. They give the shape of an answer before doing the work. They over-explain simple things and under-think the hard ones. They refuse too much, or they agree too easily.

    So the user starts managing the model.

    You rewrite the prompt. Add more context. Repeat constraints. Correct the tone. Ask it to try again. Ask it to be more precise. Ask it to follow the instruction it was already given. Ask it not to hallucinate. Ask it not to be lazy.

    That is the frustration. AI was supposed to reduce work. Too often, it creates a new kind of work. Supervising the system that was supposed to help.

    This is not just a model problem. It is a product problem.

    Benchmarks measure capability under controlled conditions. Users measure usefulness inside messy work. Those are not the same thing. A model can improve on coding tests and still feel worse in a normal conversation. It can perform better on long-context evaluations and still miss the one detail the user cares about. It can become safer and more polished while becoming less direct, less decisive, and less useful.

    The gap between the benchmark and the experience is widening. And the people who feel it most are the people using these tools every day.

    There is another part of this that does not get said enough.

    Every release raises the bar. Three months ago, a response might have felt good enough because the expectation was lower. Today, the same response feels worse because the user has seen what the system can do when it is working well. The standard has moved. The promise has moved. The actual daily output has not moved at the same pace.

    The user is no longer asking, can this thing write. They are asking, can I trust this thing to understand what I mean, keep context, make a judgment call, and save me time without turning me into the operator. That is a harder bar. And it is the right bar.

    The strange thing about AI products today is that capability and trust do not always move together. A new model can be more powerful and still feel less trustworthy. A new interface can add features and still slow the user down. A safety layer can reduce one kind of risk and introduce another kind of friction. A warmer personality can make the product feel more human while making the answer less rigorous.

    Users feel all of this. They may not know whether the issue is model routing, guardrails, memory, latency, cost optimisation, or product design. They only know the result.

    It used to answer better. Now I have to work harder.

    That sentence should worry every AI company.

    There is a quieter cost in all of this that does not get measured. Time. Hours of it, every day, lost to prompting and re-prompting. The subscription bill keeps climbing because we keep paying for the higher tier, hoping that this version, this release, this update will close the gap. It rarely does. The promise stays just out of reach. The work stays on us.

    The next phase of AI will not be won by the company with the highest benchmark. It will be won by the product that feels consistently useful in real work. Fast when the task is simple. Careful when the task is hard. Direct when the user needs a decision. Honest when the system is unsure. Consistent enough that the user does not feel like they are starting from scratch every day.

    That is the product challenge. And it is bigger than the model challenge.

    AI is getting better. The experience is not. Until that changes, the gap between the promise and the daily reality will keep growing. The people building these systems will keep pointing at the benchmarks. The people using them will keep pointing at the clock.

    One of those views is right.

  • AI Should Not Be the Wall

    Voice AI is being sold as a customer service breakthrough.

    Lower costs. Shorter queues. Smaller teams. Support that scales without adding headcount.

    That is the company view.

    The customer view is different.

    When a customer calls support, they are not asking for automation. They are asking for help. Usually because the app, the chatbot, or the self-service flow already failed.

    A phone call is not the beginning of support. It is the escalation path.

    That matters.

    By the time someone picks up the phone, they are already carrying urgency. They may be blocked, delayed, charged incorrectly, or simply out of patience.

    At that point, the job of support is not to sound intelligent.

    The job is to shorten the distance between the customer and resolution.

    Too many voice AI systems are doing the opposite.

    They start with long greetings. They ask rigid questions. They misunderstand context. They send customers into loops. They schedule callbacks that do not solve the issue. They make the customer perform the company’s internal process.

    That is not progress.

    It is the old support problem with a smarter voice.

    AI should have made this better. It should have reduced waiting. It should have triaged urgency faster. It should have helped agents serve more customers with better context.

    Instead, many companies are using AI as a wall.

    The human agent is no longer the next step. The human agent is hidden behind another automated layer.

    Customer support is not just a cost line. It is one of the clearest moments where a company shows what it believes about its customers.

    When something goes wrong, the customer finds out whether the company is designed around resolution or deflection.

    Voice AI can be useful. It can identify intent. It can check status. It can route issues. It can prepare a human agent before the customer reaches them.

    But the operating principle has to be clear.

    AI should compress the path to resolution, not extend it.

    If the issue is simple, AI should resolve it quickly. If the customer is stuck, AI should stop the loop. If the system does not understand, it should hand off cleanly.

    No fake confidence. No repeated callbacks. No dead-end flows.

    The reason someone calls is simple.

    They need help.

    And in the age of AI, getting help should feel faster, not further away.