Can Vapi store the LLM output instead of the transcriber output in the conversation history? Vapi AI #support

Can Vapi store the LLM output instead of the trans...

Mohab

02/28/2025, 7:39 PM

When using a custom model, and inspecting the object the Vapi sends to my endpoint, I notice that the messages object contains the transcriber's output, not the LLM output. For example, if the LLM generated "One Two Three", the messages object will contain { "role": "assistant", "content": "1 2 3" }

Mason | Building KOI

02/28/2025, 8:02 PM

Hey again bro, in the advanced settings of your agent there's server messages, check the model output box and you should see now exactly what the model output

Mason | Building KOI

02/28/2025, 8:02 PM

same w client messages

Mohab

02/28/2025, 8:06 PM

Hey! thanks. Yeah we've done this already and what we see is the transcriber output. The example noted above is from inspecting those messages. Wanted to check if there's anyway we can configure Vapi to store the LLM output instead.

Mohab

02/28/2025, 8:10 PM

The concrete usecase we have is improving number pronunciation. We found that when the LLM generates numbers as words, they are pronounced better by the voice model. So, we want the LLM to always produce numbers in words (e.g. One Two Three, not 1 2 3). And we have some instructions for that in the system prompt. But since Vapi stores the transcriber output in the history. What we see in the history is "1 2 3", and as the conversation goes on, the effect of the digits in the history overrides our system prompt, and the LLM starts producing digits in the output. This is also relevant to [this question](https://discord.com/channels/1211482211119796234/1345118993937076347) If we add this suffix to the user message "Always respond in words", the LLM follows the instruction despite the history containing digits not words. But again, we'd like to implement this without a custom model due to latency. And I think if we can get the conversation history to include the LLM output, the effect of the system prompt will retain throughout the conversation.

Mason | Building KOI

02/28/2025, 8:17 PM

You have a great usecase for just fine tuning an openai model to produce this behavior natively. Would solve both of your problems. Have you tried that? -- What you're talking about is a product of your transcriber transcribing it as "1,2,3", those messages adding up in the user role messages of the thread and then that affecting the assistant roles output in the thread yeah?

Mason | Building KOI

02/28/2025, 8:22 PM

Storing the LLM output instead of the transcriber output isn't the fix because the root issue is your user messages from the transcriber causing your assistant to not follow it's system prompt. The root of it is: { role: 'user', content: '1 2 3' }, which is filled by the transcriber so your two options are to find a transcriber that outputs 1,2,3 as one two three or fine tune a model so it no matter what the transcriber outputs that the model never strays from outputting { role: 'assistant', content: 'One, Two, Three' }

Mason | Building KOI

02/28/2025, 8:22 PM

You get me?

Mohab

02/28/2025, 8:41 PM

Fine tuning is on our road map, but we are still in the data collection process as that will require some data. Yes this is the root cause as you highlighted:

Copy code

The root of it is: { role: 'user', content: '1 2 3' }, which is filled by the transcriber

And that is why I don't want to fill the conversation history with the transcriber's output, but with the LLM output. But it seems like this is not something I can configure Vapi to do. But I will try looking for a transcriber that produces words instead of digits. Thanks!

Mason | Building KOI

02/28/2025, 8:46 PM

the "conversation history" the model sees is the assistant thread, meaning the aggregation of user messages and assistant messages, you'll never be able to fill the user messages with the LLMs output because that will always go into the assistant messages the root here is definitely the transcriber. Even with 50 examples and 10 minutes you could fix the behavior by fine tuning it would be super quick. Let me know if you need help with anything.

Mason | Building KOI

02/28/2025, 8:48 PM

im going to make the dataset for you give me 10 minutes

Mason | Building KOI

02/28/2025, 8:56 PM

https://cdn.discordapp.com/attachments/1345118295594106900/1345137642383085568/phoneticnums.jsonl?ex=67c3748d&is=67c2230d&hm=6882412526ccd433b4fccb151aea00960c05850f9725be2b0e860f5328d20c33&

Mason | Building KOI

02/28/2025, 8:56 PM

@Mohab

Mason | Building KOI

02/28/2025, 8:57 PM

That will 100% fix this issue

Mohab

02/28/2025, 8:58 PM

Will give it a try, thanks!

Mohab

02/28/2025, 9:25 PM

@Mason | Building KOI It is possible in deepgram to produce numbers as text by setting the [numerals toggle to False](https://developers.deepgram.com/docs/numerals). Is this something I can customize in Vapi?

Mason | Building KOI

02/28/2025, 9:30 PM

No unfortunately we don't have control over the api calls between providers and we trade it off for their low latency improvements and infra. If that was the case you also could use deepgrams find and replace to solve it as well. 100% though Vapi wants to give us more options when it comes to the provider api calls but it just takes time

Mohab

02/28/2025, 9:32 PM

Are you referring to [custom keywords](https://docs.vapi.ai/customization/custom-keywords) ? Or how exactly would I use find and replace with Vapi

Mason | Building KOI

02/28/2025, 9:33 PM

no no you can't that's what I'm saying it's a deepgram feature but we don't have control of it through vapi yet: https://developers.deepgram.com/docs/find-and-replace

Mason | Building KOI

02/28/2025, 9:33 PM

ah wait

Mason | Building KOI

02/28/2025, 9:34 PM

lightbulb

Mason | Building KOI

02/28/2025, 9:34 PM

use deepgram as your custom transcriber and then you can set whatever find and replace or numeral toggle you want

Mason | Building KOI

02/28/2025, 9:35 PM

it's a lot more setup than fine tuning though

Mason | Building KOI

02/28/2025, 9:35 PM

https://docs.vapi.ai/documentation/advanced-concepts/customization/custom-transcriber

Mohab

02/28/2025, 9:39 PM

ok thanks! will take a look

Kings Big

03/03/2025, 8:48 AM

Hey @Mohab checking if this is resolved for you?

Mohab

03/03/2025, 5:20 PM

yes, thank you.

Kings Big

03/06/2025, 2:25 PM

Marking this ticket as Solved ✅

Previous Next