The Bounds of the Cloned Voice

In 2024, journalist Evan Ratliff cloned his own voice, wired it to a chatbot, and let it loose on the phone — and the thing that gave it away was a pause. This deep dive traces where that pause went. By 2026 the technical tells Ratliff relied on — latency, woodenness, the audible seams — have largely collapsed, which means the real boundary of the cloned voice has moved off the machine and onto us: our psychology, our laws, and the economics of who can afford to fake whom. The defense that still holds isn't a better detector. It's whether you and the person on the other end agreed on a code word in advance.

In this episode

  • The 2024 baseline, from the source. Evan Ratliff's Shell Game — the three-part voice-agent stack (ElevenLabs clone + LLM "brain" + phone number), the DIY rig that off-the-shelf platforms quickly obsoleted, and the three honest tells: a robotic pause, a wooden affect, and a "world-class bullshitter" that confabulated zip codes.
  • The tells fall. Native speech-to-speech models (ICLR 2026; Hume EVI 3) and sub-300ms latency engineer out the pause Ratliff exploited — while the "indistinguishable clone" claim rests on marketing language with no published benchmarks behind it.
  • Believability was never in the machine. The psychology research confirms Ratliff's hunch: primed expectations, emotional arousal, and years of bad-VoIP conditioning do the attacker's work. Seventy percent of people can't reliably tell a clone from a real voice.
  • The economics that make it inevitable. All-in automated vishing at two to six cents a minute; voice generation is now the cheapest part of the fraud stack. The same cost collapse is reshaping voice-actor labor — but vendors themselves concede the performance gap.
  • Consent, the dead, and the law racing to catch up. AB 1836, the ELVIS Act, and the NO FAKES Act's June 2026 committee advance — against Berkeley Law's finding that no will can stop cloning if your audio is public, and Cambridge's warning about griefbots "haunting" the bereaved.
  • The escalation: when the camera stops being proof. Ratliff turned his video on to prove he was human; FBI advisories now document multimodal "proof of life" fakes in virtual-kidnapping scams — and detection is measurably behind the attack.
  • The contrarian beat. Clones still fail, behavioral defenses outperform technical ones, the scariest fraud-economics numbers are vendor estimates, and the headline detection benchmark was self-published by a contestant.

Sources & References

Primary / originating sources (operator-provided — ground zero)

Research & critique (peer-reviewed and academic)

Law, regulation & government

Industry, implementation & economics


Have questions about this episode? Reach out.