What is Vapi's structure mentioned in Documentation exactly Vapi AI #support

What is Vapi's structure mentioned in Documentatio...

Pigeon

11/28/2024, 3:23 PM

So in: https://docs.vapi.ai/customization/custom-llm/using-your-server You mention some misterious vapi expected structure, but you never provide it. Where do I find that structure or what is that structure exactly? You cant just mention some structure and not provide it, come on... https://cdn.discordapp.com/attachments/1311713953461829794/1311713953650577439/image.png?ex=6749dc51&is=67488ad1&hm=e53768b41720afd4443a2416ea9030d490325a3935b85ba36b13d9b736b33d6d& https://cdn.discordapp.com/attachments/1311713953461829794/1311713954216677507/image.png?ex=6749dc51&is=67488ad1&hm=2ab6841091dd6f7ce94a19b18abe458227e6ebf8ffbb2bd872e12302418f186d&

Vapi

11/28/2024, 3:25 PM

@Pigeon I apologize for any confusion caused by the missing information. The expected structure for integrating a custom LLM with VAPI should be detailed in the documentation, but if it's not there or not clear, I can certainly help clarify. Typically, the expected structure would define how your custom LLM should communicate with VAPI, including the request and response formats. This would involve specifying the API endpoints, the required headers, the body of the requests, and the expected responses. Since the specific structure isn't provided in the link you mentioned, I recommend checking other sections of the VAPI documentation for a detailed guide on the expected structure. If the information is still not available, please reach out to support for more detailed guidance on this matter. They should be able to provide you with the necessary structure for your custom LLM integration.

Vapi

11/28/2024, 3:25 PM

You can continue this conversation by mentioning me in the message.

Vapi

11/28/2024, 3:26 PM

Got it. Tagging @User for additional help.

Pigeon

11/28/2024, 3:45 PM

My FastAPI endpoint code:

Copy code

import time
from typing import Any
from fastapi import APIRouter

router = APIRouter()


@router.post("/completions", status_code=200)
def respond(payload: dict) -> Any:
    """
    Endpoint for receiving responses without authentication.
    """
    user_messages = [msg for msg in payload["messages"] if msg["role"] == "user"]
    if user_messages:
        print(f"User content: {user_messages[-1]['content']}")
        print("--------------------------------")

    response = {
        "id": "chatcmpl-8mcLf78g0quztp4BMtwd3hEj58Uof",
        "object": "chat.completion",
        "created": int(time.time()),
        "model": "gpt-3.5-turbo-0613",
        "system_fingerprint": None,
        "choices": [
            {
                "index": 0,
                "delta": {"content": "Hi, how are you doing?"},
                "logprobs": None,
                "finish_reason": "stop",
            }
        ],
    }

    print("--------------------------------")
    print(response)

    return response

Pigeon

11/28/2024, 3:45 PM

This is assistant log from call: https://cdn.discordapp.com/attachments/1311713953461829794/1311719647569248266/image.png?ex=6749e19e&is=6748901e&hm=16ed050d50e4f5c9cd43afd2e5be093f11536a145b2d6ac934ac4a51105577ba&

Pigeon

11/28/2024, 3:45 PM

i dont hear my hardcoded message on the call.

Pigeon

11/28/2024, 3:48 PM

Message from my custom llm havent been created: https://cdn.discordapp.com/attachments/1311713953461829794/1311720339243532369/image.png?ex=6749e243&is=674890c3&hm=7bed85f2da716eca9d593eaacad421c0c91beabf3940cb5497aab74acc49de36&

Pigeon

11/28/2024, 3:48 PM

So, whats the correct response structure I have to send to vapi for it to create a bot message and TTS it?

Pigeon

11/28/2024, 3:50 PM

c4599cea-b165-416c-828f-69c393cbf359

call id

Pigeon

11/28/2024, 5:21 PM

Also tried piping through Groq which is supposed to serve completions in same format as open ai: Code:

Copy code

@router.post("/completions", status_code=200)
def respond(payload: dict) -> Any:
    """
    Endpoint for receiving responses without authentication.
    """
    user_messages = [msg for msg in payload["messages"] if msg["role"] == "user"]
    if user_messages:
        print(f"User content: {user_messages[-1]['content']}")
        print("--------------------------------")

    chat_completion = client.chat.completions.create(
        messages=[
            {
                "role": "user",
                "content": "Hi, how are ya?",
            }
        ],
        model="llama3-8b-8192",
    )

    print("----------------------------")
    print(chat_completion.choices[0].message.content)
    print(chat_completion.model_dump())

    return chat_completion.model_dump()

Groq responded with:

Copy code

2024-11-28 19:13:39 Hi! I'm just a language model, I don't have feelings like humans do, but I'm here to help you with any questions or tasks you have! How can I assist you today?
2024-11-28 19:13:39 {'id': 'chatcmpl-9d94d051-66b6-4f04-b1f8-2a59d74af156', 'choices': [{'finish_reason': 'stop', 'index': 0, 'logprobs': None, 'message': {'content': "Hi! I'm just a language model, I don't have feelings like humans do, but I'm here to help you with any questions or tasks you have! How can I assist you today?", 'role': 'assistant', 'function_call': None, 'tool_calls': None}}], 'created': 1732814021, 'model': 'llama3-8b-8192', 'object': 'chat.completion', 'system_fingerprint': 'fp_179b0f92c9', 'usage': {'completion_tokens': 41, 'prompt_tokens': 16, 'total_tokens': 57, 'completion_time': 0.034166667, 'prompt_time': 0.000146059, 'queue_time': 0.013842211, 'total_time': 0.034312726}, 'x_groq': {'id': 'req_01jdsw797beewax7v08dgv2arz'}}

Pigeon

11/28/2024, 5:21 PM

But yet again I cant here a message that groq responded with TTSed, nor I get message in log of vapi

Pigeon

11/28/2024, 5:22 PM

https://cdn.discordapp.com/attachments/1311713953461829794/1311743968739856454/image.png?ex=6749f845&is=6748a6c5&hm=9ab7cc03a908e02d4354c9f06d99cf7b7ba499a0d1f1600999c8c36abf23ae4c&

Pigeon

11/28/2024, 5:22 PM

call id

c7867d0e-0963-4ad0-aca0-c79ea407cfdb

Pigeon

11/28/2024, 5:22 PM

Im really lost on how is this supposed to work... Id expect

Hi! I'm just a language model, I don't have feelings like humans do, but I'm here to help you with any questions or tasks you have! How can I assist you today?

to be TTSed in a phone call. But nothing happens...

Pigeon

11/28/2024, 5:23 PM

Log has no errors as well...

Pigeon

11/28/2024, 5:24 PM

@User

Pigeon

11/28/2024, 5:53 PM

My ngrok seem to show response was sent with 200:

Copy code

HTTP/1.1 200 OK
Server: uvicorn
Content-Length: 744
Content-Type: application/json
Date: Thu, 28 Nov 2024 17:49:37 GMT

With body:

Copy code

{
  "id": "chatcmpl-61a93f6f-2e8a-4632-a965-c5a1e7ad8310",
  "choices": [
    {
      "finish_reason": "stop",
      "index": 0,
      "logprobs": null,
      "message": {
        "content": "Hi! I'm just a language model, so I don't have emotions or feelings like humans do, but I'm \"functioning\" well and ready to help you with any questions or tasks you have! How about you? How's your day going?",
        "role": "assistant",
        "function_call": null,
        "tool_calls": null
      }
    }
  ],
  "created": 1732816180,
  "model": "llama3-8b-8192",
  "object": "chat.completion",
  "system_fingerprint": "fp_a97cfe35ae",
  "usage": {
    "completion_tokens": 53,
    "prompt_tokens": 16,
    "total_tokens": 69,
    "completion_time": 0.044166667,
    "prompt_time": 0.002902825,
    "queue_time": 0.012564495,
    "total_time": 0.047069492
  },
  "x_groq": {
    "id": "req_01jdsy94smfm79ppb8fpk248xa"
  }
}

Pigeon

11/29/2024, 8:03 AM

@User @User so how long does it take to get actual support here?

Pigeon

11/29/2024, 2:48 PM

Pigeon

11/29/2024, 4:07 PM

Shubham Bajaj

11/29/2024, 4:47 PM

@Pigeon here's how you need to you use custom-llm with Vapi https://github.com/VapiAI/advanced-concepts-custom-llm

Pigeon

11/29/2024, 4:47 PM

Im not using OpenAI

Pigeon

11/29/2024, 4:48 PM

I need a response structure that works

Pigeon

11/29/2024, 4:49 PM

is all this documentation wrong then? https://docs.vapi.ai/customization/custom-llm/fine-tuned-openai-models#when-to-use-custom-llms

Shubham Bajaj

11/29/2024, 4:53 PM

We parse all different LLM responses into OpenAI Messages format. You just need stream response back to Vapi, and follow OpenAI Messages strucutre.

Shubham Bajaj

11/29/2024, 4:54 PM

@Pigeon what did you find wrong here, wanted to understand your pov to help you better.

Pigeon

11/29/2024, 4:54 PM

https://cdn.discordapp.com/attachments/1311713953461829794/1312099467801137212/image.png?ex=674b435a&is=6749f1da&hm=ce980d171b7c156667de8f5ca619cb57b5234ecd5396e5fa730961de8e891d25&

Pigeon

11/29/2024, 4:54 PM

This is very old

Pigeon

11/29/2024, 4:55 PM

and not working

Pigeon

11/29/2024, 4:55 PM

this example isnt streaming

Pigeon

11/29/2024, 4:55 PM

And now youre saying that I must stream, so whats the actual truth?

Shubham Bajaj

11/29/2024, 4:56 PM

@Pigeon Yes your correct docs are outdated, and you need to stream the response back to Vapi. Sorry for the inconvience caused to you.

Pigeon

11/29/2024, 4:59 PM

So maybe update it? people come, try to implement it and waste many hours on something that is completely false...

Pigeon

11/29/2024, 5:45 PM

Thanks, both openai and groq works with the new code you provided (https://github.com/VapiAI/advanced-concepts-custom-llm) using AsyncOpenAI and AsyncGroq

Shubham Bajaj

12/03/2024, 11:46 AM

@Pigeon checking if this ticket can be closed?

Pigeon

12/04/2024, 7:14 AM

Yes, thank you

7 Views

Previous Next