Receptic AI logoReceptic AI
Back to blog
EngineeringApril 22, 202612 min read

Warm transfers, explained: SIP REFER vs. bridged conferences

Why we ship both, when each one wins, and how we keep the caller from hearing dead air during the handoff.

M
Maya Chen
Telephony · Receptic AI

“Transfer me to a human.” Five words that sound trivial, but expose every shortcut a voice platform took. Receptic ships two transfer modes — SIP REFER and bridged conference — because there is no single right answer.

SIP REFER: the elegant handoff

SIP REFER tells the originating PBX to re-route the call directly to a third party. The agent steps out, the carrier handles the new leg, and the original call terminates cleanly. Lowest cost, lowest media path, no ongoing leg to pay for.

Bridged conference: the warm handoff

The agent stays on the line, dials the human, plays hold music to the caller, briefs the human in a private leg, then merges the two legs into a three-way conference. The agent drops out once the human acknowledges, leaving caller and human talking.

How we choose

The agent picks based on three signals: the destination's configured preference, whether the customer flagged the call as high-context (PI intake, billing dispute), and whether the receiving number has been set up to handle blind transfers.

function pickTransferMode(call, destination) {
  if (destination.prefersBridged) return "bridged";
  if (call.context.requiresBriefing) return "bridged";
  if (destination.acceptsBlind) return "refer";
  return "bridged"; // safe default
}

The hold music problem

A 4-second silence during a transfer feels like a dropped call. We loop a soft music bed — different per customer brand — under a brief spoken cue: “One second while I get someone on the line.” Latency to the bridge is 600-1200ms. The cue covers all of it.

Whisper briefing

On a bridged transfer, the agent speaks a 2-sentence summary into the human-only leg before merging. Format:

“You're getting Sarah Lee, calling about a leak under her sink. She tried the shutoff valve already.”

We render this from the structured call state — caller name, stated reason, attempted self-help, urgency signal — so the human picks up already in the conversation.

Failure modes

What we'd do differently

If we were starting over, we'd build the bridged path first and only add SIP REFER as an optimization for high-volume customers. The cost savings of REFER are real, but the experience cliff when it fails is worse than the fixed cost of always bridging.

Try Receptic

See it answer a real call.

Spin up an agent on a sandbox number in minutes. No credit card to test.

Try the demo
Keep reading

More from the blog