Preemptive responses #57 by tanwir1703 · Pull Request #98 · think41/foundation-voice

tanwir1703 · 2025-07-31T06:50:55Z

Please find the updates below on the preemptive response issue number #57 against the requirements.
Requirements:

Allow configuration of one or more preemptive phrases (global + intent-level) - Done ☑️
Trigger a preemptive response based on a configurable latency threshold (e.g., 300ms) - Done ☑️
Support fallback behavior (e.g., skip preemptive response if actual response is ready quickly) - Almost Done ☑️ (Please note that while the fallback behaviour is working fine for large thresholds like 6+s, when I am giving a threshold of 3-4 seconds, the preemptive response is sometimes triggered after the LLM response)
Support TTS integration for seamless voice playback - Done ☑️
Optional: Use different fillers depending on the user intent (context-aware preemptives) - Done ☑️ (Last user text is being fetched using TranscriptionUpdateFrame inside the preemptive block, and intent-driven preemptive responses are provided to the user. However, the intent derivation logic is currently elementary and can be easily configured in the future.)
Should be integrated within the voice orchestration layer (likely between STT and LLM) - Done ☑️
Ensure preemptive responses don’t conflict or overlap with final responses - Done ☑️
Possible need for an interrupt or cancel mechanism if preemptive and actual responses collide - Done ☑️
Should work well across both streaming and non-streaming response types - Partially Done ☑️ (I couldn't test it for streaming responses, but it works well with non-streaming responses).

Steps to test it out:

basic_agent.json consists of necessary preemptive phrases and threshold values.
Run main.py(to access /ws endpoint) inside the examples folder, following the steps in readme.md
Navigate to foundation-voice/examples/websocket endpoint, run the client and server to test.

tanwir1703 added 5 commits July 28, 2025 23:06

Preemptive Block Added

b7d19e3

Updated preemptive_processor with intent driven & configurable responses

e230608

Removed ws_client.py

5bf9420

Setup the application environment

271645d

Updated threshold value

787dc30

Provide feedback