IS VAPI Postprocessing LLM responses for text to speech? Vapi AI #support

IS VAPI Postprocessing LLM responses for text to s...

AiTitus :)

11/14/2024, 5:50 PM

Hey team! When i look at logs sometimes the model ouput is not exactly what is put into the voice input. I need it to be the same as the output from the llm, looks like there may be a prompt altering the output a bit can you confirm? For example I ask the model to respond with dashes between numbers like 5 - 5 - 5 but i notice when passed to the text to speech it is five five five and not including the dashes. I need dashes or other XML in their for eleven labs to correctly pronounce, say or pause, how can I do this? ---> https://elevenlabs.io/docs/product/speech-synthesis/prompting Thanks!

Vapi

11/14/2024, 5:53 PM

@AiTitus :) Hello! It sounds like you're encountering an issue where the formatting of the output from the language model isn't being preserved when passed to the text-to-speech (TTS) system. To ensure that the TTS system, such as ElevenLabs, correctly interprets the output with dashes or other SSML (Speech Synthesis Markup Language) tags, you'll need to make sure that the output from the language model includes these characters or tags as part of the response. If the model is currently set up to remove or alter these characters, you might need to adjust the prompt or the post-processing of the model's output to ensure that the desired formatting is maintained. This could involve modifying the prompt to explicitly instruct the model to include dashes or SSML tags, or it could involve a post-processing step where you add these elements back into the model's output before passing it to the TTS system. If you have control over the prompt or the post-processing logic, you can make these adjustments yourself. If not, you may need to reach out to the development team responsible for the voice pipeline to request these changes. If you need further assistance with this or have specific requirements for the SSML tags, please let us know, and we'll do our best to help you out!

Vapi

11/14/2024, 5:53 PM

You can continue this conversation by mentioning me in the message.

Shubham Bajaj

11/15/2024, 11:53 AM

Hey @AiTitus :) yes we do input formatting before passing it to the voice providers and dashes are replaced over there, If you disable voice formatting it may have a negative effect instead you can use

<break>-or-similar

from 11labs along with replacement plan to add the

post formatting. Do let me know if you have any questions.

AiTitus :)

11/15/2024, 3:33 PM

how do i disable voice formatting?

AiTitus :)

11/15/2024, 3:33 PM

i handle the LLM output on my end to make sure it is good for voice.

AiTitus :)

11/15/2024, 3:33 PM

what do you mean use instead you can use -or-similar from 11labs?

AiTitus :)

11/15/2024, 3:34 PM

will your formatting mess with this format for example? If I can rely on thiss then i can pass that from the LLM to TTS

AiTitus :)

11/15/2024, 3:37 PM

@Shubham Bajaj please provide an example of how this would look: "you can use -or-similar from 11labs along with replacement plan to add the " post formatting."

Shubham Bajaj

11/16/2024, 7:05 PM

Hey @AiTitus :) we have disable the quotes and voice formatting, can you try now?

AiTitus :)

11/18/2024, 3:52 PM

@Shubham Bajaj looks like its stayin MINUS so i am trying the break XML to see if that works, just fyi...call ID: 55ad44a6-6671-4f17-9bec-ec2410b5fcda starts at 1:12

Shubham Bajaj

11/18/2024, 5:31 PM

> stayin MINUS so i am trying the break XML @AiTitus :) can you please what your trying to say here?

Shubham Bajaj

11/18/2024, 5:34 PM

Hey @AiTitus :) just checking how does it goes and what you trying to convey?

AiTitus :)

11/18/2024, 5:34 PM

it is saying the dashes

AiTitus :)

11/18/2024, 5:35 PM

so i will try the see if it works

AiTitus :)

11/19/2024, 5:14 PM

one issue is JSON has to be issue is the " have to be escaped , are you unescaping the " when sending to eleven labs

Shubham Bajaj

11/21/2024, 3:53 PM

Hey @AiTitus :) can you rephrase it I cannot understand what your trying to say.

AiTitus :)

11/21/2024, 3:54 PM

i try and pass it in JSON but then the " " are escaped and elevenlabs does not use it properly when this

AiTitus :)

11/21/2024, 3:54 PM

any suggestions?

AiTitus :)

11/21/2024, 3:55 PM

i inject into the response from LLM like this "Hey @Shubham Bajaj how are you you?" so when it goes to VAPI it works as proper JSON but then it looks like i tbreaks eleven labs feature

Shubham Bajaj

11/22/2024, 12:36 PM

@AiTitus :) we have disabled formatting of

in voice formatting plan, can you share the recent call id where it's still breaking? Apology for not replying within sub-second because of list of tickets sometimes some messages gets lost in the stack.

4 Views

Previous Next