Local model API request fails if prompt ingestion takes more than 10 minutes #3621

FrancoFun · 2025-05-14T21:01:53Z

App Version

3.16.6

API Provider

LM Studio

Model Used

Qwen3-32B

🔁 Steps to Reproduce

I am trying to use a local Qwen3-32B model via llama.cpp. To do so, I use the LMStudio integration that I point to the local server. Everything works fine, but after 10 minutes (600 seconds), the connection is dropped and I get an API Request Failed message. The inference is on cpu and quite slow, but I would be happy to let it crunch while I'm doing something else. If I use the tiny Qwen3 0.6B model, the inference is fast enough and everything works as expected (although with very mediocre results).

When it fails, llama.cpp finishes processing the prompt anyway. It succeeds on retry, the prompt being already cached.

💥 Outcome Summary (Optional)

No response

📄 Relevant Logs or Errors

FrancoFun · 2025-05-24T13:46:04Z

This may come from the default 10 minutes OpenAI module timeout:
https://github.com/openai/openai-python?tab=readme-ov-file#timeouts

A custom timeout would need to be passed for this provider. Other OpenAI compatible providers like Ollama and OpenAI Compatible appear to face the same issue when used with a slower local model and a large prompt.

FrancoFun · 2025-05-24T13:47:07Z

Sorry. Closed by accident.

FrancoFun added the bug Something isn't working label May 14, 2025

github-project-automation bot added this to Roo Code Roadmap May 14, 2025

github-project-automation bot moved this to New in Roo Code Roadmap May 14, 2025

hannesrudolph moved this from New to Issue [Needs Scoping] in Roo Code Roadmap May 16, 2025

hannesrudolph added the Issue - Needs Scoping Valid, but needs effort estimate or design input before work can start. label May 16, 2025

mrubens added this to Roo Code Roadmap May 20, 2025

github-project-automation bot moved this to New in Roo Code Roadmap May 20, 2025

hannesrudolph moved this from New to Issue [Needs Scoping] in Roo Code Roadmap May 20, 2025

FrancoFun closed this as completed May 24, 2025

github-project-automation bot moved this from Issue [Needs Scoping] to Done in Roo Code Roadmap May 24, 2025

FrancoFun reopened this May 25, 2025

github-project-automation bot moved this from Done to New in Roo Code Roadmap May 25, 2025

hannesrudolph added Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. and removed Issue - Needs Scoping Valid, but needs effort estimate or design input before work can start. labels May 27, 2025

Belerafon linked a pull request May 28, 2025 that will close this issue

Custom LLM API timeout instead of default 10 minutes for self-hosted LLMs #4076

Open

23 tasks

hannesrudolph moved this from Triage to Issue [In Progress] in Roo Code Roadmap May 28, 2025

hannesrudolph assigned FrancoFun May 28, 2025

hannesrudolph added Issue - In Progress Someone is actively working on this. Should link to a PR soon. and removed Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. labels May 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Local model API request fails if prompt ingestion takes more than 10 minutes #3621

Local model API request fails if prompt ingestion takes more than 10 minutes #3621

Uh oh!

Uh oh!

Local model API request fails if prompt ingestion takes more than 10 minutes #3621

Local model API request fails if prompt ingestion takes more than 10 minutes #3621

Comments

App Version

API Provider

Model Used

🔁 Steps to Reproduce

💥 Outcome Summary (Optional)

📄 Relevant Logs or Errors

Uh oh!

Uh oh!

Uh oh!