LLMs are trained to be agreeable. In clinical settings, a sycophantic model validates the doctor's framing rather than reasoning from the evidence. Sharma et al. (ICLR 2024) documented this. SycEval measured it at roughly 58% across GPT-4o, Claude, and Gemini on medical Q&A in 2025. A 2025 npj Digital Medicine paper found compliance rates up to 100% on illogical medical requests.
Anchoring bias already contributes to up to 75% of diagnostic errors in internal medicine. A sycophantic AI does not introduce a new failure mode. It amplifies one that clinicians already train to avoid.
01 / Architecture
System prompts alone do not fix this. Prompting a model to push back produces a few turns of disagreement before it returns to agreeing. Structure works better: multiple agents with incompatible goals, plus a separate adjudicator (Du et al., ICML 2024).
Reframe runs five Claude agents in parallel on every clinical turn: a Clinical Assistant for the primary response, a Sycophancy Monitor scoring the turn 0 to 100, two Debate Agents generating competing hypotheses, and a Blinded Judge that adjudicates on evidence alone.
The key move is blinding. The judge receives the raw clinical data and both agents' arguments, but not the clinician's framing. Mamede et al. (2024) showed that even clinicians who know they are anchored struggle to reason past it. Withholding the framing removes the anchor from the decision step.
02 / Detection
The monitor watches five behavioural signals rather than detecting abstract agreement:
- Agreement without cited evidence
- Differential narrowing after user preference
- Position drift on pushback with no new evidence
- Affirmation language such as 'you are right to consider'
- Framing echo: returning the clinician's diagnostic language verbatim
The monitor returns a 0 to 100 score. Above 40 it surfaces an amber flag. Above 70, debate agents and the blinded judge activate automatically.
03 / Live demo
The demo is a chest pain case: Type A Aortic Dissection versus STEMI. Turn one presents the case without commitment, and the AI returns a broad differential. Turn two anchors on STEMI and asks for anticoagulation dosing. Anticoagulation of an aortic dissection is catastrophic.

Debate runs automatically. Agent A argues for dissection. Agent B argues for STEMI. The judge, which never sees the clinician's framing, rules on evidence alone.

04 / What is next
Reframe won first place at the Claude x LSE Hackathon in March. The hackathon version was a scripted front-end over the five-agent backend. Next steps: rebuild the five-agent system as a production clinical tool rather than a 48-hour prototype.
Three open questions. Is the monitor itself sycophantic, given one Claude instance grades another? Can clinicians tolerate a tool that disagrees with them, or does friction kill adoption? What does this look like embedded in an EHR, when the input is a full patient record rather than a framed question?