> ## Documentation Index
> Fetch the complete documentation index at: https://koreai.mintlify.app/llms.txt
> Use this file to discover all available pages before exploring further.

# Voice Interactions

When integrating a voice channel through the Automation Node in AI for Service, two modes are available: **Real-Time Voice** and **TTS Streaming (ASR/TTS)**. These modes determine how user audio is processed and how responses are delivered.

```
Real-Time Voice  →  Multimodal LLM  →  Audio response
TTS Streaming    →  Text LLM  →  AI for Service TTS engine  →  Audio response
```

***

## Real-Time Voice

Real-Time Voice enables natural conversation using multimodal language models that process audio input and generate audio output directly.

**How it works:**

* The Platform uses the configured real-time voice model to process user audio and generate audio responses.
* Real-Time Voice must be enabled in both the AI for Service Automation Node and the Agentic App on the Platform.
* If not enabled on the Platform side, audio requests fail with errors.
* When Real-Time Voice is disabled in the Automation Node, the system defaults to ASR/TTS.

<Note>[Wait Time Experience](/agent-platform/agents/agentic-apps/settings/app-configurations#waiting-experience) is not supported for Real-Time Voice interactions.</Note>

**To configure:**

1. Enable Real-Time Voice in the AI for Service Automation Node. <img src="https://mintcdn.com/koreai/DVM6QPNDpXRjmobZ/agent-platform/howto/images/voice/real-time-voice.png?fit=max&auto=format&n=DVM6QPNDpXRjmobZ&q=85&s=4efbb5396ccd3f347e88763826866bbd" alt="Enable Real Time Voice" width="381" height="810" data-path="agent-platform/howto/images/voice/real-time-voice.png" />
2. Enable Real-Time Voice in the Agentic App and configure a model that supports real-time voice. <img src="https://mintcdn.com/koreai/DVM6QPNDpXRjmobZ/agent-platform/howto/images/voice/enable-voice-ap.png?fit=max&auto=format&n=DVM6QPNDpXRjmobZ&q=85&s=6fac23ffafeb7bb345363a91b64909e8" alt="Enable Real Time Voice in Agent Platform" width="950" height="852" data-path="agent-platform/howto/images/voice/enable-voice-ap.png" />

***

## TTS Streaming (ASR/TTS)

TTS Streaming is the default when Real-Time Voice is disabled. The Platform uses a text-based LLM to generate responses; AI for Service converts the text to speech using TTS engines (Deepgram, ElevenLabs).

**How it works:**

* The Platform generates a text response and sends it to AI for Service.
* AI for Service converts the text to speech using configured TTS engines.
* When TTS Streaming is **off**, the full audio response is delivered only after the complete output is generated.
* When TTS Streaming is **on**, text is streamed progressively as it is generated, reducing latency.

**Wait-time experience:** When processing is delayed, the Platform sends a filler or system-generated message as configured. AI for Service presents this to the user via TTS as it is received (when streaming is enabled).

**To configure:**

1. Disable Real-Time Voice in the AI for Service Automation Node.
2. To enable streaming, turn on the **TTS Streaming** option. <img src="https://mintcdn.com/koreai/DVM6QPNDpXRjmobZ/agent-platform/howto/images/voice/tts-streaming.png?fit=max&auto=format&n=DVM6QPNDpXRjmobZ&q=85&s=9a32a1f81e9d9f20385b473daee282d9" alt="TTS Streaming" width="385" height="816" data-path="agent-platform/howto/images/voice/tts-streaming.png" />
3. No additional configuration is required on the Platform side.

***
