Voice & Turn-Taking Settings

Voice & Turn-Taking Settings

These settings control how aggressively the agent detects user speech and decides when to respond.

Interruptions

  • allow_interruptions
    • Allows the user to interrupt the agent mid-speech.
  • min_interruption_duration
    • Minimum duration of detected user speech to count as an interruption.

Defaults:

  • allow_interruptions=True
  • min_interruption_duration=0.08

Turn-taking / endpointing

  • min_silence_duration
  • activation_threshold
  • prefix_padding_duration
  • min_endpointing_delay
  • max_endpointing_delay

Defaults:

  • min_silence_duration=0.25
  • activation_threshold=0.55
  • prefix_padding_duration=0.25
  • min_endpointing_delay=0.45
  • max_endpointing_delay=0.9

Practical guidance

  • If the agent responds too quickly, increase min_silence_duration.
  • If the agent interrupts users too easily, reduce allow_interruptions or increase min_interruption_duration.
  • If speech is missed at the beginning, increase prefix_padding_duration.