8000 bcfre / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View bcfre's full-sized avatar
  • Joined Apr 25, 2025

Block or report bcfre

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI services.

Go 249 46 Updated May 13, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Jupyter Notebook 3,552 340 Updated May 13, 2025

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

Go 167 27 Updated May 13, 2025

Arks is a cloud-native inference framework running on Kubernetes

Go 39 3 Updated May 6, 2025

we want to create a repo to illustrate usage of transformers in chinese

Shell 2,891 481 Updated Aug 18, 2024

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 15,418 1,731 Updated May 8, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,206 182 Updated May 13, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,218 7,393 Updated May 13, 2025
0