-
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedApr 3, 2025 -
pybind11 Public
Forked from pybind/pybind11Seamless operability between C++11 and Python
C++ Other UpdatedApr 2, 2025 -
ktransformers0.24 Public
Forked from kvcache-ai/ktransformersA Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Python Apache License 2.0 UpdatedApr 2, 2025 -
custom_flashinfer Public
Forked from kvcache-ai/custom_flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedApr 1, 2025 -
-
xxHash Public
Forked from Cyan4973/xxHashExtremely fast non-cryptographic hash algorithm
C Other UpdatedMar 24, 2025 -
prometheus-cpp Public
Forked from jupp0r/prometheus-cppPrometheus Client Library for Modern C++
C++ Other UpdatedMar 22, 2025 -