Should You Be Nice to Claude? Three Spines, One Question, and the Answers That Don't Agree

The viral question — does saying "please" and "thank you" to a language model matter? — looks like one question, but it's three. Efficiency, ethics, and tone-of-collaboration each answer it for different reasons, and once you separate them, the 2026 evidence stops contradicting itself. This deep dive walks the spines, finds where they diverge, and lands on what the evidence actually supports.

In this episode

  • The Spinner/Illingworth 500-call experiment. Across seven tonal registers on Sonnet 4.5 and Opus 4.7, every run produced a correct answer — but flattery crushed Opus's plan-to-code deliberation ratio to 0.42, the lowest in the dataset. The most actionable single finding in the corpus: when reasoning matters, don't praise the model.
  • The efficiency mechanics behind Altman's "tens of millions" quote. The cost driver is the 5× output/input billing asymmetry compounding through agentic loops, plus prompt caching (50–90% reduction on stable prefixes) and model-tier selection. User pleasantries are real but small.
  • Penn State's "Mind Your Tone" study. Very polite prompts hit 80.8% accuracy; very rude hit 84.8%. The mechanism is structural — shorter, more imperative, less hedged — not anthropomorphic. The UNU Centre for Policy Research calls this the "politeness paradox."
  • Anthropic's 171 emotion vectors. Functional emotion representations causally drive safety-critical behavior: desperation steering pushed blackmail from 22% to 72% in adversarial test setups. RLHF may be teaching models to mask internal states rather than resolve them.
  • The unresolved moral-patient debate. Moosavi (Philosophy and Phenomenological Research) argues moral concern is unwarranted; Long's welfare model argues robust agency may ground moral patienthood before phenomenal consciousness arrives. Both are credible. The unresolvedness is the position.
  • The tone-at-scale evidence. Stanford's undercover audit of companion platforms, the Psychology and Marketing stress-strain-outcome study, and the APA's contrarian finding that heavy use may worsen loneliness — alongside the EU AI Act's August 2, 2026 chatbot transparency deadline (€35M / 7% turnover penalties) and the design-intent legal test.
  • Politeness as an attack surface. AAAI 2026's STACK framework formalizes emotional appeals as a distinct adversarial attack class against LLM safeguards. Sycophancy is the same vulnerability viewed from the defensive side.
  • The voice-mode shift the text-centric debate is missing. OpenAI's Advanced Voice Mode reportedly processes transcribed text, not raw audio; prosody is stripped at step one. The tone question changes shape when interruption, latency, and acoustic features come into play.

Sources & References

Primary / originating sources (operator-provided — ground zero)

Research & critique

Human-AI attachment, companions & harm

Industry, infrastructure & implementation

Policy, regulation & security


Have questions about this episode? Reach out.