Interface: RealtimeSessionConfig

Configuration for a realtime session

Properties

optional instructions: string;

System instructions for the assistant

optional maxOutputTokens: number | "inf";

Maximum number of tokens in a response

optional model: string;

Model to use for the session

optional outputModalities: ("text" | "audio")[];

Output modalities for responses (e.g., ['audio', 'text'], ['text'])

optional providerOptions: Record<string, any>;

Provider-specific options

optional semanticEagerness: "low" | "high" | "medium";

Eagerness level for semantic VAD ('low', 'medium', 'high')

optional temperature: number;

Temperature for generation (provider-specific range, e.g., 0.6-1.2 for OpenAI)

optional tools: RealtimeToolConfig[];

Tools available in the session

optional vadConfig: VADConfig;

VAD configuration

optional vadMode: "server" | "manual" | "semantic";

VAD mode

optional voice: string;

Voice to use for audio output