Get artefacts in voice with gpt4o Vapi AI #support

Get artefacts in voice with gpt4o

DomnulZeu

05/15/2024, 10:24 AM

Hey I am getting a lot of artefacts like EXAMPLE: "Voice input: Here’s a comprehensive look at what might lie ahead.. . ### **1." The model then says outloud hashtag hashtag hashtag asterisk asterisk 1. I tried fixing it with the prompt but it doesnt work. Any way to parse artefacts out of LLM generation since it ruins the conversations?

Vapi Support

05/15/2024, 10:24 AM

Hi @DomnulZeu, I'm an AI assistant for Vapi. I'll analyze available sources and get back to you shortly.

Vapi Support

05/15/2024, 10:24 AM

----------------------------- **While I generate an answer, here are some sources that might be helpful**: * [Web Calling]() * [Get Assistant]() * [Core Models]() -----------------------------

Vapi Support

05/15/2024, 10:25 AM

### Answer To address the issue of artifacts in voice outputs with GPT-4, you can apply some parsing and cleaning techniques to filter out unwanted characters before converting the text back to speech. Here is a simple example of how you might achieve this using JavaScript:

Copy code

javascript
function cleanText(text) {
    // Remove unwanted artifacts such as hashtags and asterisks
    return text.replace(/[#*]+/g, '');
}

const assistantOptions = {
    name: "Vapi’s Pizza Front Desk",
    firstMessage: "Vappy’s Pizzeria speaking, how can I help you?",
    transcriber: {
        provider: "deepgram",
        model: "nova-2",
        language: "en-US",
    },
    voice: {
        provider: "playht",
        voiceId: "jennifer",
    },
    model: {
        provider: "openai",
        model: "gpt-4",
        messages: [
            {
                role: "system",
                content: `You are a voice assistant for Vappy’s Pizzeria, a pizza shop located on the Internet.
                Your job is to take the order of customers calling in. The menu has only 3 types
                of items: pizza, sides, and drinks. Be sure to introduce the menu items, don't assume that the caller knows what
                is on the menu. Once you have all the information you need pertaining to their order, you can
                end the conversation. Use casual and short responses.`,
            },
        ],
    },
};

// Clean the system prompt
assistantOptions.model.messages[0].content = cleanText(assistantOptions.model.messages[0].content);

vapi.start(assistantOptions);

This script cleans the text before it is processed by the TTS engine [(1)]() . For more detailed configurations, refer to Vapi’s [Quickstart Documentation]().

Sahil

05/16/2024, 10:32 AM

Did you write in the prompt that you shouldn't output # or anything?

luke199335

05/16/2024, 6:50 PM

It's not very clear how implement this code through your platform. Can you give to us more details? I've the same issue.

Sahil

05/16/2024, 7:04 PM

It should be fixed can you try once again?

luke199335

05/16/2024, 7:31 PM

unfortunately I receive in the answer soemtimes unwanted characters as **

Sahil

05/16/2024, 7:34 PM

can you send me the call_id with timestamp

luke199335

05/16/2024, 7:59 PM

I added in my prompt this instruction: * Always remove hashtags and asterisks from your answers and other unwanted artifacts such as /[#*]+/g, ''. but absolutely it doesn't work... Sometimes, especially when it uses the knowledge base, instead of giving to me the name of one product for example it says "* *"

Sahil

05/16/2024, 8:08 PM

Can you write a rule in like "you're a voice assistant, respond with 1-2 sentences in every turn and it should be in text format"

luke199335

05/16/2024, 8:16 PM

I tried but it doesn't work. Always **...

05/16/2024, 8:22 PM

I am also having this issue.

Sahil

05/16/2024, 8:22 PM

can you try gpt4 turbo once and see if this issue is still there

Sahil

05/16/2024, 8:22 PM

also lower your token to 250

05/16/2024, 8:32 PM

This seemed to correct the issue

Sahil

05/16/2024, 8:34 PM

@luke199335 can you try it as well?

luke199335

05/16/2024, 8:40 PM

Yes it looks that now it works... unfortunately it costs more and the latency is higher 😦

Sam

05/16/2024, 8:45 PM

this issue wasn't there yesterday so it's not related to gpt4o.

luke199335

05/16/2024, 9:06 PM

Unfortunately after some test also with GPT4Turbo and 250 token I receive answers with ** ... I don't unserstand how fix this issue.

Sam

05/16/2024, 9:12 PM

yes i even removed all ** from all the system prompts and still the same problem

Sam

05/16/2024, 9:15 PM

and even with GPT 4 turbo the ** problem still presists!!!

Sam

05/17/2024, 12:49 PM

any luck on fixing the issue??

Sahil

05/17/2024, 1:06 PM

You need to fix it via prompting. You can write something like “you're a voice assistant, respond with 1-2 sentences in every turn"

luke199335

05/17/2024, 7:32 PM

It didn't help to me

Sahil

05/18/2024, 9:30 AM

@luke199335 and @DomnulZeu Can you add this rule that states you need to provide the format in text format?

luke199335

05/18/2024, 12:36 PM

Where?

Sahil

05/18/2024, 12:55 PM

in your system prompt

Sahil

05/18/2024, 5:19 PM

https://cdn.discordapp.com/attachments/1240248514668265472/1241440068607148044/image.png?ex=664a34a9&is=6648e329&hm=17297b82c133f76a2e660a65e1fb187c70420d5d34efa23010d9a707cccdf319&

Sahil

05/18/2024, 5:19 PM

For couple of people prompting is working and solving this issue @luke199335 @Sam

Sahil

05/19/2024, 2:44 AM

Added backend filter to remove it. So, you won't be facing any issues now. Can you please confirm it? @luke199335 @Sam @DomnulZeu

luke199335

05/19/2024, 7:27 AM

Ok, I'll try with system prompt. But what do you mean with "add backend filter" through the web platform? Thank you

Sahil

05/19/2024, 7:33 AM

By backend filter means, like we preprocess the text in order to make quality of the text you receive better. In reality if we just directly use deepgram with streaming then the quality will be very bad so we process few words to make it better.

luke199335

05/19/2024, 5:01 PM

Now the problem is that it doesn't say asteriks, *, anymore but it absolutely doesn't pronounce the name of my products. For example if my product is "1. [name product] is good for...", the system say "1. is good for..." so it misses the product name. I notice this problem when for example the name of the product is in strong/bold on gpt. How can I say to the system to pronounce these word but avoiding the asteriks or other special characters?

luke199335

05/21/2024, 10:10 AM

Any solution, please?

Sahil

05/21/2024, 12:02 PM

It is fixed.

luke199335

05/21/2024, 1:02 PM

Sorry, but I've always the same problem. For example if my product is "1. [name product] is good for...", the system say "1. is good for..." so it misses the product name.

Sahil

05/21/2024, 1:41 PM

it sounds like make.com issue or things are not populated properly.

luke199335

05/21/2024, 4:30 PM

Yes I'm using Vapi Function Calling with "Make" custom webhook amd - "make request" HTTP from Flowise API, with these settings: Method: POST Body type: Raw Content type: JSON (application/json) Request content: {"question": "1. message: functionCall: parameters: Question"} Pare response: Yes - and Webhook response: Status: 200 Body: 2. data: text Any setting suggested in "Make" or Vapi Function Calling to solve this issue?

Sahil

05/21/2024, 5:34 PM

Can you send a call_id so that I can see what is the exact payload which was sent during the call?

Sahil

05/21/2024, 5:34 PM

and can you please create a new support thread?

3 Views

Previous Next