I've built an AI voice assistant to help me research restaurants while I'm driving. I've tried different voices and different ways of outputting things from my tool, but I can't get the voices to pronounce the restaurant names. They will say literally everything else in the output, and the logs show that the model is outputting the restaurant name to the voice, but the voice will not pronounce the restaurant names like 98% of the time. What can I do? I've tried vapi voices, elevenlabs voices, and cartesia voices.
Here's one call where this happened:
call-id: e03ee0c8-09d6-4c9d-a43a-11607271e549
timestamp: 21:05:58:375
response expected: the name of the restaurant: Tavernetta to be pronounced.
response received: the voice skipped the restaurant name.