-
bytedance | ZJU | CSU
- ShangHai
-
20:47
(UTC +08:00) - https://justadogistaken.github.io/
-
aibrix Public
Forked from vllm-project/aibrixCost-efficient and pluggable Infrastructure components for GenAI inference
Go Apache License 2.0 UpdatedJul 5, 2025 -
gateway-api-inference-extension Public
Forked from kubernetes-sigs/gateway-api-inference-extensionGateway API Inference Extension
Jupyter Notebook Apache License 2.0 UpdatedJul 4, 2025 -
llm-d-kv-cache-manager Public
Forked from llm-d/llm-d-kv-cache-managerDistributed KV cache coordinator
Go Other UpdatedJun 26, 2025 -
llm-d Public
Forked from llm-d/llm-dllm-d is a Kubernetes-native high-performance distributed LLM inference framework
Makefile Apache License 2.0 UpdatedJun 25, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJun 22, 2025 -
tiktoken-go Public
Forked from pkoukk/tiktoken-gogo version of tiktoken
-
PromptWizard Public
Forked from microsoft/PromptWizardTask-Aware Agent-driven Prompt Optimization Framework
Python MIT License UpdatedMar 21, 2025 -
LMCache Public
Forked from LMCache/LMCacheRedis for LLMs
Python Apache License 2.0 UpdatedMar 21, 2025 -
3FS Public
Forked from deepseek-ai/3FSA high-performance distributed file system designed to address the challenges of AI training and inference workloads.
C++ MIT License UpdatedMar 20, 2025 -
dynamo Public
Forked from ai-dynamo/dynamoA Datacenter Scale Distributed Inference Serving Framework
Rust Apache License 2.0 UpdatedMar 18, 2025 -
preble Public
Forked from WukLab/prebleStateful LLM Serving
Python Apache License 2.0 UpdatedMar 11, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedMar 3, 2025 -
Mooncake Public
Forked from kvcache-ai/MooncakeMooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++ Apache License 2.0 UpdatedFeb 23, 2025 -
llumnix Public
Forked from AlibabaPAI/llumnixEfficient and easy multi-instance LLM serving
Python Apache License 2.0 UpdatedFeb 21, 2025 -
Awesome-ML-SYS-Tutorial Public
Forked from zhaochenyang20/Awesome-ML-SYS-TutorialMy learning notes/codes for ML SYS.
Python Apache License 2.0 UpdatedFeb 20, 2025 -
LLMSys-PaperList Public
Forked from AmberLJC/LLMSys-PaperListLarge Language Model (LLM) Systems Paper List
UpdatedFeb 20, 2025 -
slog Public
Forked from hungrybirder/slogLeveled execution logs for Go
Go Apache License 2.0 UpdatedAug 13, 2024 -
resume Public
Forked from sb2nov/resumeSoftware developer resume in Latex
TeX MIT License UpdatedJul 24, 2024 -
katalyst-core Public
Forked from kubewharf/katalyst-coreKatalyst aims to provide a universal solution to help improve resource utilization and optimize the overall costs in the cloud. This is the core components in Katalyst system, including multiple ag…
Go Apache License 2.0 UpdatedApr 25, 2024 -
katalyst-api Public
Forked from kubewharf/katalyst-apikatalyst aims to provide a universal solution 8000 to help improve resource utilization and optimize the overall costs in the cloud. This repo is the core api for Katalyst, including crd, clientSet, inf…
Go Apache License 2.0 UpdatedApr 25, 2024 -
HolisticTraceAnalysis Public
Forked from facebookresearch/HolisticTraceAnalysisA library to analyze PyTorch traces.
Python MIT License UpdatedMar 3, 2024 -
tuning_playbook Public
Forked from google-research/tuning_playbookA playbook for systematically maximizing the performance of deep learning models.
Other UpdatedFeb 6, 2024 -
kubefin Public
Forked from mrhello369/kubefinUnified cost allocation insights and optimization for Kubernetes across multi-cloud and multi-cluster
Go Apache License 2.0 UpdatedJan 4, 2024 -
parca Public
Forked from parca-dev/parcaContinuous profiling for analysis of CPU and memory usage, down to the line number and throughout time. Saving infrastructure cost, improving performance, and increasing reliability.
TypeScript Apache License 2.0 UpdatedDec 11, 2023 -
distribution Public
Forked from distribution/distributionThe toolkit to pack, ship, store, and deliver container content
Go Apache License 2.0 UpdatedSep 18, 2023 -
clusterdata Public
Forked from alibaba/clusterdatacluster data collected from production clusters in Alibaba for cluster management research
Jupyter Notebook UpdatedFeb 17, 2023 -
-
-
-
buildkit Public
Forked from moby/buildkitconcurrent, cache-efficient, and Dockerfile-agnostic builder toolkit
Go Apache L 2F33 icense 2.0 UpdatedJul 13, 2022