@AdeB
Thank you for bringing this to our attention. It's great to hear that the Realtime API is performing well in English and that the Voice Activity Detection (VAD) parameters are meeting your expectations. However, I understand the challenges you're facing when switching languages mid-conversation.
Language switching is indeed a complex feature, and it's important for the VAD to adapt to the phonetic and linguistic characteristics of the new language to maintain performance. Your feedback is valuable, and I'll make sure to pass it on to our development team for review.
In the meantime, if you're frequently switching languages in a conversation, a potential workaround could be to explicitly inform the API about the language change. This could help the system recalibrate and adjust the VAD parameters accordingly.
If you need further assistance or have more details to share, please don't hesitate to reach out. We're here to help ensure you have the best experience with our Realtime API.