My assistant does not say the response of my custom-llm Vapi AI #support

My assistant does not say the response of my custo...

𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨

05/14/2024, 3:04 PM

When using the custom-llm interface, VAPI does not TTS the response sent by my bot. What it sends is the following: response = { id: req.body.call.callId, object: "chat.completion", created: Math.floor(Date.now() / 1000), model: "gpt-3.5-turbo-0613", system_fingerprint: null, choices: [ { index: 0, delta: { content: messageResponse?.[messageResponse.length - 1]?.content ?? "" }, logprobs: null, finish_reason: "stop", }, ], } In the code we return it as res.status(201).json(response) callID: d045a553-e98c-49ad-87cd-c121dcfa3c0b https://cdn.discordapp.com/attachments/1239956548894326885/1239956549288464476/image.png?ex=6644cf06&is=66437d86&hm=1e3aff978374b6fe7c204bc283746a6a00f7ea7187f4ba85854fdbe3206cd1b8&

Vapi Support

05/14/2024, 3:04 PM

Hi @𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨, I'm an AI assistant for Vapi. I'll analyze available sources and get back to you shortly.

Vapi Support

05/14/2024, 3:04 PM

----------------------------- **While I generate an answer, here are some sources that might be helpful**: * [Connecting Your Custom LLM to Vapi: A Comprehensive Guide]() * [Inbound Calling]() * [Dashboard]() * [Web Calling]() -----------------------------

Vapi Support

05/14/2024, 3:04 PM

### Answer It appears that you are using a custom LLM and encountering issues with the response being processed by Vapi. A few details might need attention: 1. **Response Format**: Ensure that your response format adheres to the OpenAI-compatible schema. The response must include the correct structure. Here is a correct example:

Copy code

json
   {
     "choices": [
       {
         "finish_reason": "stop",
         "index": 0,
         "message": {
           "content": "Your response content here...",
           "role": "assistant"
         },
         "logprobs": null
       }
     ],
     "created": 1677664795,
     "id": "chatcmpl-7QyqpwdfhqwajicIEznoc6Q47XAyW",
     "model": "gpt-3.5-turbo-0613",
     "object": "chat.completion",
     "usage": {
       "completion_tokens": 17,
       "prompt_tokens": 57,
       "total_tokens": 74
     }
   }

Make sure your response matches this structure [(1)]() . 2. **Server and Streaming**: If your custom LLM server does not support streaming completions, it may cause issues. Vapi expects the server to support streaming completions [(2)]() . Ensure your server configuration can handle streaming or adjust your integration accordingly. 3. **Error Handling**: If the assistant stops working after switching to custom LLM, ensure that you have executed a PATCH request to the

/assistant

endpoint and set

model.urlRequestMetadataEnabled

to false [(3)]() , [(4)]() . Please verify these points in your implementation to ensure the responses are correctly formatted and processed by Vapi.

Sahil

05/14/2024, 3:08 PM

Are you using a fine tuned model?

𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨

05/14/2024, 3:11 PM

no, my api is a conversational bot that consults chatGPT in the middle and generates a response according to states and internal functions

Sahil

05/14/2024, 4:30 PM

Can you bring this up in tomorrow's Vapi Office Hour?

𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨

05/14/2024, 6:13 PM

I'm sorry, I won't be able to do it because I don't speak English.

Vapi Support

05/15/2024, 1:14 PM

Got it. Tagging @User for additional help.

Sahil

05/15/2024, 2:24 PM

can you show some network logs about what is the payload from vapi and how do you handle the payload related to it?

Agus

05/15/2024, 3:55 PM

We tried two different ways. One with a status response (201) and another with a streaming response. We also try to send you a pre-designed response just for testing. https://cdn.discordapp.com/attachments/1239956548894326885/1240331840011632640/image.png?ex=66462c8a&is=6644db0a&hm=f35331f3fc1ff7ea1e60003a29a86750c285e44a541fdff695c93ec6b816d0df&

Agus

05/15/2024, 3:56 PM

https://cdn.discordapp.com/attachments/1239956548894326885/1240331886337851475/image.png?ex=66462c95&is=6644db15&hm=8f0709adb706ef47e9c468655b6a11b38e6cdc6bfb63fa6e23e4bc9b948aa4d5&

Sahil

05/15/2024, 3:59 PM

@Agus Can you refer to this codebase and use the endpoint "/openai-advanced/chat/completions" https://github.com/VapiAI/server-side-example-python-flask/blob/main/app/api/custom_llm.py

Agus

05/15/2024, 5:13 PM

@User Sure, I review it and try to replicate it. I will be informing you of news! for now thank you.

Sahil

05/15/2024, 5:34 PM

Okay.

𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨

05/16/2024, 7:19 PM

I tried that, but it doesn't throw me this error. idCall: 333a5b05-0a74-488c-b3c7-f96e5d464cb7 https://cdn.discordapp.com/attachments/1239956548894326885/1240745440123682877/image.png?ex=6647adbc&is=66465c3c&hm=9f48a9779b1e7a767e44c9376e7ad9426d8bb5c3cab5cea1b27e96efef94b782&

Sahil

05/16/2024, 7:20 PM

call_id pls and did you added urlRequestMetadataEnabled:False?

𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨

05/16/2024, 7:25 PM

call_id: 333a5b05-0a74-488c-b3c7-f96e5d464cb7 Where do I add the urlRequestMetadataEnabled?

Sahil

05/16/2024, 7:26 PM

You need to send an API request to disable it. https://docs.vapi.ai/api-reference/assistants/update-assistant

Sahil

05/16/2024, 7:27 PM

The issue is related to that extra Metadata

𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨

05/16/2024, 7:31 PM

Is my assistant like this? Is it correct? https://cdn.discordapp.com/attachments/1239956548894326885/1240748504301830184/api-response.json?ex=6647b097&is=66465f17&hm=8c1cc3ba0f9ed9f7ee6688ab76a581ab2578413896efe76f143af187be9d5f07&

Sahil

05/16/2024, 7:33 PM

Seems correct to me.

𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨

05/16/2024, 7:35 PM

That's how I had it when I did this

𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨

05/16/2024, 7:35 PM

do I need to do anything else?

Sahil

05/16/2024, 7:35 PM

can you send me the request payload and output response once?

𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨

05/16/2024, 7:39 PM

request payload: https://cdn.discordapp.com/attachments/1239956548894326885/1240750403176497213/message.txt?ex=6647b25c&is=664660dc&hm=f3c1033bea95380a910b9d4c2a961b58eddae87920f567a660c3b81e6e23c130&

𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨

05/16/2024, 7:39 PM

output response https://cdn.discordapp.com/attachments/1239956548894326885/1240750468230021150/image.png?ex=6647b26b&is=664660eb&hm=73295f158a9b6017e7a706a4201b754d34fb9cd7f3cd4567c8fb4f7b0e73b351&

Sahil

05/16/2024, 7:41 PM

data: ${JSON.stringify(dataChunks[index])}\n\n

); await new Promise((resolve) => setTimeout(resolve, 5000)); } res.write('data: [DONE]\n\n'); res.end(); req.on('close', () => { res.end(); }); }); app.post('/', (req, res) => { console.dir(req.body, { depth: null }); res.json({}); }); app.listen(3030, () => { console.log('Server is running on port 3030'); });

Sahil

05/16/2024, 7:41 PM

You need to send data in streaming format and add this while responding back the data https://cdn.discordapp.com/attachments/1239956548894326885/1240751088504803378/image.png?ex=6647b2ff&is=6646617f&hm=27d76bae97e436906c71150deafbb8bcafa0396845882389805cca6a8cabbfa3&

𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨

05/16/2024, 7:50 PM

I tested that code and it throws the same error. call_id: f7f42eb7-e833-4b60-9006-e03b9f537823 https://cdn.discordapp.com/attachments/1239956548894326885/1240753314216280074/image.png?ex=6647b512&is=66466392&hm=b7b1a382bd965d5e1cca9ac5a6e8ee993c076fa6f353f37c50d6eedd49294b65&

𝙠𝙝𝙖𝙯𝙙𝙧𝙖𝙜𝙨

05/16/2024, 8:10 PM

My bad, if that code worked, thanks!

Sahil

05/16/2024, 8:13 PM

Awesome.

Agus

05/16/2024, 8:15 PM

Thanks @Sahil !

3 Views

Previous Next