8000 Slow responses with short one word replies · Issue #290 · fixie-ai/ultravox · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Slow responses with short one word replies #290

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
rojithaDev opened this issue Mar 30, 2025 · 2 comments
Open

Slow responses with short one word replies #290

rojithaDev opened this issue Mar 30, 2025 · 2 comments
Assignees

Comments

@rojithaDev
Copy link

Hi. Ever since switching to neural endpointing, if the human replies with a short word like “sure”, “yes” etc the agent takes 3-4 secs to reply. It doesn’t seem like a significant time to wait but you’d be surprised at how many people think the agent stopped working and tries to talk to it again.

Neural endpointing works significantly better than VAD but this is an issue we are noticing a lot.

Something to note here is that some people may say “sure” a lot faster than others. If I say it in a normal speed it does recognize it most of the time.

Another issue we’re noticing is that during some hours of the day the agent takes noticeably longer to respond. Is there a resource allocation issue you are facing right now?

Thanks.

@zqhuang211
Copy link
Contributor

Thanks for reporting the issue. Our neural VAD is based on Ultravox and takes audio directly as input without a separate speech recognition step. This might be an edge case the model isn’t optimized for yet—we’ll look into it.

There could be occasional usage spikes, but we generally allocate sufficient compute. If you experience this issue frequently, please report it in our Discord channel: https://discord.gg/Qw6KHxv8YB. Discord is the better place for resolving issues related to Ultravox Realtime.

@ericwood8
Copy link

This same issue as rojithaDev . Especially if I pick Jessica voice.
The problem is if the human replies with a short word like “sure”, “yes” etc. the agent takes 3-4 secs to reply.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants
0