Custom LLM and Knowledge Base Vapi AI #support

Custom LLM and Knowledge Base

LearningAI

04/22/2024, 8:28 AM

I am looking for VAPI to use my custom LLM tied to my knowledge base so that the questions are answered based on my knowledge base. For this, I need VAPI to provide the transcripts to my web Method that interacts with my vector store. Has anyone does this before? I cannot see any docs that explain how this could be done, is this possible?

Vapi Support

04/22/2024, 8:28 AM

Hi @LearningAI, I'm an AI assistant for Vapi. While a team member reviews this question, I'll look through relevant sources and see if I can provide suggestions.

Vapi Support

04/22/2024, 8:28 AM

----------------------------- **While I generate an answer, here are some sources that might be helpful**: * [Frequently Asked Questions]() * [Introduction]() * [Custom LLMs]() * [Create Assistant]() -----------------------------

Vapi Support

04/22/2024, 8:29 AM

### Answer Yes, it's possible to integrate your custom LLM and knowledge base with Vapi. You can utilize Vapi's support for Custom LLMs, allowing you to use any OpenAI-compatible endpoint, including your own server. Here’s how you can set it up: 1. Using an LLM Provider: If your custom LLM is hosted with an LLM provider compatible with OpenAI's API, you can POST your API key via the

/credential

endpoint and specify your custom model in your API requests [(1)]() . 2. Using Your Server: For LLMs hosted on your server, you can create an endpoint compatible with the OpenAI Client. Your server can then interact with your knowledge base and vector store as needed. To implement this, post your server’s API key and URL via the

/credential

endpoint and specify these in your assistant creation with the

custom-llm

model provider. This setup allows Vapi to provide transcripts to your specified endpoint, which can then utilize your knowledge base for responses. For more detailed instructions on setting up your custom LLM with Vapi, refer to the Custom LLMs documentation.

Sahil

04/22/2024, 8:32 AM

I've attached the pinecone as my vector database and deployed it on Vercel, achieving a lightning-fast response time of 1.2 seconds. However, I haven't tested the vector database with a custom LLM yet. I'm willing to give it a try. Could you please share the specific issue you're facing?

LearningAI

04/22/2024, 4:17 PM

@Sahil - thanks that is impressive timing - would be good to know how you went about connecting your vector db to VAPI.

Sahil

04/22/2024, 4:26 PM

Check out this video

https://www.youtube.com/watch?v=9MD1VM7038Q▾

to learn how it works. If you have coding knowledge, I recommend using a personal server along with a reliable pinecode db plan. This combination will significantly reduce latency.

AiBizBox

04/23/2024, 2:37 PM

any chance you could share your js (?) code you deployed to Vercel in order to achieve this? I'm assuming that code on Vercel implements something similar as shown on the screenshot, right?

Sahil

04/23/2024, 3:42 PM

Sorry. I can't cause I have written that code for some other client. But, I guide you!

Sahil

04/23/2024, 3:43 PM

It is kinda similar to this https://github.com/VapiAI/server-side-example-serverless-vercel

AiBizBox

04/23/2024, 7:12 PM

appreciated but giving it a miss then for now, as that repo doesn't contain any Vector Store connection code, nor any OpenAI Embeddings code. I'll stick with my n8n solution above for now even though it might be slightly slower than a Vercel implementation...

Sahil

04/23/2024, 7:14 PM

You can use pipedream higher version subscription and you can stop the cold-start feature which will significantly increase your performance.

AiBizBox

04/23/2024, 7:16 PM

I'm not using Pipedream but a self-hosted n8n.io installation on a well equipped VPS so there's no cold starting issue at all 😉

Sahil

04/23/2024, 7:17 PM

Awesome. ✨ If you need any help feel free to ping me

AiBizBox

04/23/2024, 7:18 PM

this one: https://github.com/n8n-io/n8n

Sahil

04/23/2024, 7:19 PM

This project seems interesting will take a look at it.

AiBizBox

04/23/2024, 7:19 PM

it defo is 🙂 👍

Alozie | AI Voice Designer

09/24/2024, 2:14 AM

very interesting would love to know more on how you got that speed. Im doing a voyage and qdrant intergration and it seems good bit slower than that

Alozie | AI Voice Designer

09/24/2024, 2:17 AM

also @Sahil is this still needed ? still not sure if vapi's knowledge base is using a vector database or jsut adding the whole knwoledge base in the context window

Sahil

09/24/2024, 8:37 AM

still would suggest you to use the pinecone

Sahil

09/24/2024, 8:37 AM

Higher server + vercel

Sahil

09/24/2024, 8:37 AM

n8n will also do the job

Sahil

09/24/2024, 8:38 AM

sometimes tho latency go upto 1.5 but like avg is 1.2

5 Views

Previous Next