The model I cannot choose of provider "OpenAI" in ...
# support
d
Hello, powerful friends. I have some problem that I need help. I always choose gpt-4o latest model to my assistance. Because of I use Mandarin Chinese. To improve more precisely pronounce, I choose gpt-4o model and OpenAI ash voice configuration. But somehow, I saw some new different things in voice configuration. There are three different types of model in openAI provider which are tts-1, tts-1-hd, gpt-4o-mini-tts. And I choose three of them, the dashboard always shows ERROR(The result attachments I provide at below), I don't know where did I miss and how to fix it? https://cdn.discordapp.com/attachments/1352489703475122186/1352489703882096671/image_2.png?ex=67de33b1&is=67dce231&hm=fb309df9fc376d89be08e1f4e7abd650590e6b7ed34c2e45d7d202d45da8e787& https://cdn.discordapp.com/attachments/1352489703475122186/1352489704271904768/image_1.png?ex=67de33b1&is=67dce231&hm=ff316b47bb688329945cb80964d9e8d9cc37a2f5506240067d067bdaede15f1b& https://cdn.discordapp.com/attachments/1352489703475122186/1352489704611909694/image.png?ex=67de33b1&is=67dce231&hm=de54f4599da847724a3576a479129869be8b8a3521a781f676c5d72f856e217e&
u
I think what you've chosen is brand new.. maybe choose a different voice provider .. e.g 11 labs
d
of course, this is a good suggestion. But my project is using Chinese, I hope the voice model would understand what speaker said and reply them in a reasonable way. However there are much of providers I can choose, I will try it one by one. Thanks for your precious suggestion.
u
I'm not sure what the best choice model is for Chinese, but I think the openai models you chose have just been released, so won't be the best right now. Maybe search for the ones listed and see what's best suited for your language preferences
@Dululu For Chinese voice AI using Vapi.ai, the best voice model depends on whether you're prioritizing TTS (text-to-speech) quality, STT (speech-to-text) accuracy, latency, or cost. Here's a breakdown of the best current options for Chinese support in Vapi-compatible models: --- 1. ElevenLabs (for TTS) Pros: Very high-quality, expressive, near-human voices. Chinese Support: Limited. Currently, it's experimental and not fully natural for Mandarin. Use if: You prioritize ultra-realistic voice quality for English but not ideal for Chinese yet. --- 2. OpenAI TTS (via Vapi) Voices: onyx, nova, echo, etc. Chinese Support: Good, works well with Mandarin, especially if the input is clean. Pros: Low latency, good tone, natural pacing. Use if: You want smooth, multi-language support including Chinese, and high-quality TTS. --- 3. Azure Neural TTS (Microsoft) Voices: zh-CN-XiaoxiaoNeural, zh-CN-YunjianNeural, etc. Chinese Support: Excellent, with regional accents and expressive style options. Pros: Highly realistic, supports SSML, stable. Use if: You want the best Chinese TTS, especially for a production use case. --- 4. Google Cloud TTS Voices: cmn-CN-Wavenet-A, cmn-CN-Wavenet-D, etc. Chinese Support: Solid, reliable. Pros: Good quality, decent latency, supports various dialects. Use if: You're already in the Google ecosystem and need solid TTS. --- 5. Deepgram (for STT - speech-to-text) Language: Mandarin (zh) Accuracy: Very good, especially with clean audio. Use if: You want real-time transcription of Chinese voice input. Works well with Vapi. --- Recommended Pair for Chinese Vapi Bot: TTS: Azure zh-CN-XiaoxiaoNeural or OpenAI onyx STT: Deepgram with language set to zh
d
Wow, that’s a wonderful and helpful recommend list for me. Actually, my target audiences are Taiwanese, so I hope the assistance voice accent would close to Taiwanese accent That’s the reason why I choose openAI model. But after your recommendation I would start choose other option to replace openAI.
v
Dululu, can you share your organization ID so I could take a look at what could have been going wrong for you?
d
Never mind, I have already solved it
v
Marking this ticket as Solved ✅