Receptic AI logoReceptic AI
Back to blog
Voice AIMarch 1, 20268 min read

Designing five voices people actually want to talk to

What we learned shipping Nova, Iris, Atlas, Marco, and Juno — and why pacing matters more than timbre.

S
Selene Marquez
Voice Design · Receptic AI

We launched with five voices: Nova, Iris, Atlas, Marco, Juno. We've since tested two dozen more in the lab. Most got cut. The ones that survived share less than you might think — and more about rhythm than tone.

Timbre is table stakes

Modern neural TTS gives you a clean, intelligible voice for almost any persona. That part is solved. What you can't buy off the shelf is conversational pacing: how the voice handles pauses, breath, false starts, interruptions.

The four pacing dials

Why we shipped five, not fifty

More options paralyze customers. Five voices forces a choice. Each one was tuned for a clear use case:

The smiling voice problem

A receptionist sounds different when they're smiling. Most TTS doesn't. We tag certain prompts (greetings, confirmations, thank-yous) with a “smile” modifier that nudges the model toward a brighter formant. Subtle, but callers consistently rate “smiled” agents as friendlier in blind tests.

Pacing makes a voice feel alive. Timbre just makes it intelligible.

What we cut

Test in the wild

Lab listening tests are useful but not predictive. The voice that sounded best on headphones isn't necessarily the one callers prefer over a speakerphone in a car. Ship A/B in production, measure call completion and human-rated friendliness, iterate.

Try Receptic

See it answer a real call.

Spin up an agent on a sandbox number in minutes. No credit card to test.

Try the demo
Keep reading

More from the blog