Halil
04/11/2025, 7:47 AMVapi Support Bot
04/11/2025, 7:49 AMonNoPunctuationSeconds
is set to 1.3 seconds, which means the system waits this long when there's no punctuation before determining the user has finished speaking. According to [information I found](https://discord.com/channels/1211482211119796234/1211483291191083018/threads/1339201697188286525), tweaking this value can help with latency.
## Prompt Size Considerations
While you mentioned reducing your prompt size (currently at 314 words), it's worth noting that [information suggests](https://discord.com/channels/1211482211119796234/1211483291191083018/threads/1333880245681393756) prompt token lengths exceeding 2,500 tokens may lead to performance issues. Your current prompt size is likely below this threshold, but further optimization might help.
## Other Potential Factors
- Network latency between your servers and VAPIVapi Support Bot
04/11/2025, 7:49 AMsilenceTimeoutSeconds
, responseDelaySeconds
, and llmRequestDelaySeconds
## Suggested Actions
1. Try enabling Smart Endpointing in your Start Speaking Plan
2. Reduce your onNoPunctuationSeconds
value (perhaps to 0.8-1.0 seconds)
3. Check your network connectivity and server configuration
4. Consider testing with a different model if possible
5. Review your server logs for any errors or warnings
Would you like more specific information about any of these potential solutions? I'm happy to dive deeper into any area that might help resolve your latency issues!
Sources:
- [Discord conversation about model performance](https://discord.com/channels/1211482211119796234/1211483291191083018/threads/1333224316053225582)
- [VAPI documentation on speech configuration](https://docs.vapi.ai/customization/speech-configuration)
- [Discord conversation about onNoPunctuationSeconds](https://discord.com/channels/1211482211119796234/1211483291191083018/threads/1339201697188286525)
- [Discord conversation about prompt token length](https://discord.com/channels/1211482211119796234/1211483291191083018/threads/1333880245681393756)Halil
04/11/2025, 7:50 AMVapi Support Bot
04/11/2025, 7:52 AMsilenceTimeoutSeconds
, responseDelaySeconds
, and llmRequestDelaySeconds
)
- TURN server latency (if it's showing 1200-2000ms)
- Issues with call transfers [source](https://discord.com/channels/1211482211119796234/1211483291191083018/threads/1332286268805087272)
### 4. Contact VAPI Support
For the most thorough analysis, contact VAPI support and provide these call IDs. They can:
- Check detailed logs associated with these specific calls
- Identify patterns or issues causing the delaysVapi Support Bot
04/11/2025, 7:52 AMPigeon
04/11/2025, 8:13 AMPigeon
04/11/2025, 8:14 AM[INFO]
tagDomnulZeu
04/11/2025, 12:53 PMhj.ong9
04/11/2025, 4:32 PMVapi
04/11/2025, 8:12 PMVapi
04/11/2025, 8:13 PM"smartEndpointingPlan": {
"provider": "livekit",
"waitFunction": "50 + 200 * x"
}