-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedOct 31, 2024 -
xla Public
Forked from pytorch/xlaEnabling PyTorch on XLA Devices (e.g. Google TPU)
C++ Other UpdatedMar 30, 2024