@Samuel As discussed over the call with @jb , the issue was coming from the transcriber not being able to capture single words or other words when using both transcribers during the call, which led to long silences and delayed responses. For startSpeakingPlan, Deepgram and custom transcriber changes were suggested. Please update me once you're done with your changes and let me know how it goes for you.