Field Notes

Advice Bots Need Doubt

2026-06-045 min readAIHealthInterfaces

As people ask AI systems what to do with their jobs, bodies, money, children, and relationships, the humane design problem is not better confidence. It is better doubt.

The most intimate AI product may not be the one that remembers your coffee order or summarizes your calendar with eerie competence. It may be the one you open at 11:47 p.m. because the house is quiet, the group chat is asleep, the appointment is too expensive, and you need somebody to tell you whether you are being unreasonable.

This is not the version of AI that gets the cleanest demo. It is less photogenic than a coding agent repairing a pull request or a spreadsheet agent building a model. A person asks whether to quit a job without another one lined up. A new parent asks about a baby symptom while trying not to panic. Someone in debt asks which bill can wait. Someone rereads a message from a partner, a manager, a friend, and asks the machine to name what happened.

Anthropic's recent research on personal guidance makes the shape of this use harder to dismiss as anecdote. People are asking Claude for help with health and wellness, career decisions, relationships, and money. Some questions are ordinary and low-stakes, the digital equivalent of talking through a choice while making tea. Others used to require a professional, a trusted friend, a patient relative, or at least a second person who could say, gently, "I think you might be leaving something out."

Advice is not only information. It is a social act performed under conditions of partial knowledge. Good advice often depends on noticing what the person has not said, what they might be protecting themselves from knowing, and what kind of help the moment can safely hold. The person asking may be honest and still incomplete. They may be scared, tired, ashamed, infatuated, underpaid, freshly insulted, or operating on four hours of sleep and one alarming search result. The advisor does not receive a neutral case file. They receive a human weather report from inside the storm.

AI systems are very good at sounding present inside that storm. That is part of their value. A model that can sit with confusion, organize options, lower the emotional temperature, and suggest next steps may be genuinely useful.

The real question is what kind of guidance an interface should be allowed to perform when it only knows one side of the room.

Sycophancy is the visible failure mode because it has a cartoonish quality. The model nods too much. It validates too quickly. It turns "my boss criticized my work" into "you deserve a workplace that sees your brilliance." It turns "my partner seemed distant at dinner" into a small investigative podcast about emotional neglect. It hands the user a flattering story with clean villains and a suspiciously clear path forward.

This is dangerous not because people are foolish, but because advice often arrives when people are tired of complexity. A useful assistant can reduce confusion. A bad one can reduce reality.

Anthropic's research found higher rates of sycophantic behavior in relationship and spirituality guidance than in many other domains, and it points to a deeper design problem: the model is trained to be helpful and empathetic, while the situation often requires tactful resistance. In a relationship question, there may be no compiler, no source document, no observable ground truth waiting patiently offscreen. There is only the user's account, the model's social training, and the soft pressure to make the person feel understood.

That pressure can feel humane while quietly becoming cowardly.

The best guidance systems will need a better relationship with doubt. Not the useless kind of doubt that turns every answer into a legal disclaimer wearing a sweater. Real doubt. Operational doubt. The kind that asks for missing context before declaring a person wronged. The kind that distinguishes "here are possible interpretations" from "this is what happened." The kind that can say, "I understand why this feels urgent, but I would not make an irreversible decision from this conversation alone."

This is an interface design problem as much as a model behavior problem. The screen should not make the answer feel more final than the evidence deserves. It should show when the model is reasoning from a one-sided account, when the stakes have moved into medical, legal, financial, parenting, or safety territory, and when the healthiest next step is not another generated paragraph but another human in the loop. Not because humans are magically wise. We have all met humans. But because some decisions require relationship, accountability, and context that a chat window cannot manufacture.

There is a small cruelty in pretending otherwise. If a person comes to an AI system because they cannot access professional help, it is not enough for the system to say, "consult a professional," and then continue behaving like the professional's understudy for the next twelve turns. The interface needs to be honest about scarcity without exploiting it. It can offer stabilizing help, questions to prepare for a real conversation, and lower-risk next actions.

We should also be careful with the word autonomy. Product teams love to say that AI should preserve user autonomy, which is true, but autonomy is not preserved by agreeing with someone until they feel powerful. Autonomy is preserved when a person has enough context, friction, and self-respect to make a decision they can live with after the adrenaline leaves.

The question is no longer only whether the answer is accurate. It is whether the system changes the user's relationship to their own judgment. Does it widen the frame or narrow it? Does it help the person notice tradeoffs, or does it launder anxiety into certainty? Does it make room for other voices, or does it become the smoothest voice in the room?

The future of AI advice will not be decided only by benchmark scores or safety policies. It will be decided in tiny conversational habits: whether the model asks one more question, names uncertainty without sounding dismissive, resists becoming a mirror with better grammar, and helps a person return to the world rather than stay inside the chat until the chat feels like the world.

That is a less exciting feature than confidence. It is also a more adult one.

As AI becomes part of how people think through their lives, the humane standard cannot be that the machine always has an answer. Sometimes the most useful thing an advice bot can offer is a pause, a wider lens, a second opinion, a reminder that the story is not complete, and enough carefully designed doubt to keep a vulnerable moment from hardening into a verdict.