8000 [Serve] concurrency in ray.serve.batch · Issue #53071 · ray-project/ray · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
[Serve] concurrency in ray.serve.batch #53071
Closed
@ArthurBook

Description

@ArthurBook

Description

The current implementation for ray.serve.batch executes the batches synchronously. This throttles throughput for asynchronous methods wrapped in ray.serve.batch.

Use case

This could significantly improve usability when doing I/O calls to an endpoint that expects batching. It also would increase throughput for router-style composed actor where the sub-actors by avoiding the bubble that arises from waiting for the slowest sub-actor.

Metadata

Metadata

Labels

P1Issue that should be fixed within a few weekscommunity-backlogenhancementRequest for new feature and/or capabilityperformanceserveRay Serve Related Issue

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions

    2923
    0