Tags: anrp/vllm
Tags
[MISC] Add VLLM_SLEEP_WHEN_IDLE env arg Signed-off-by: Povilas Kanapickas <povilas@radix.lt>
[BugFix] FA2 MLA Accuracy Issue (vllm-project#18807) Signed-off-by: LucasWilkinson <lwilkinson@neuralmagic.com>
[Bugfix] Mistral tool calling when content is list (vllm-project#18729) Signed-off-by: mgoin <mgoin64@gmail.com>
[BugFix][Attention] Fix sliding window attention in V1 giving incorre… …ct results (vllm-project#17574) Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
[Core][V0] Enable regex support with xgrammar (vllm-project#13228) Signed-off-by: Russell Bryant <rbryant@redhat.com>
[V1][Spec Decode] Update N-gram Proposer Interface (vllm-project#15750) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
[V1][Spec Decode] Update target_logits in place for rejection sampling ( vllm-project#15427) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
[V1] Minor V1 async engine test refactor (vllm-project#15075) Signed-off-by: andoorve <murali.andoorveedu@mail.utoronto.ca> Co-authored-by: andoorve <murali.andoorveedu@mail.utoronto.ca>
PreviousNext