Tags: shazur/vllm
Tags
[Doc] add common case for long waiting time (vllm-project#5430)
[Build] Guard against older CUDA versions when building CUTLASS 3.x k… …ernels (vllm-project#5168)
[CI] Reduce wheel size by not shipping debug symbols (vllm-project#4602)
[CI/Build] 0.4.0.post1, fix sm 7.0/7.5 binary (vllm-project#3803)
Bump up version to v0.3.2 (vllm-project#2968) This version is for more model support. Add support for Gemma models (vllm-project#2964) and OLMo models (vllm-project#2832).
PreviousNext