Stars
Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch
DLRover: An Automatic Distributed Deep Learning System
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
Cloud Native Artifacial Intelligence Model Format Specification
Cost-efficient and pluggable Infrastructure components for GenAI inference
A high-throughput and memory-efficient inference and serving engine for LLMs
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.
verl: Volcano Engine Reinforcement Learning for LLMs
A FinOps community-driven framework for building best practices, sharing stories, and D536 strengthening the discipline.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Overview of the peaks dectection algorithms available in Python
An open cloud native capacity solution which helps you achieve ultimate resource utilization in an intelligent and risk-free way.
A native gRPC client & server implementation with async/await support.
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Development repository for the Triton language and compiler
A fast and efficient cloud native application runtime
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
Apache HoraeDB (incubating) is a high-performance, distributed, cloud native time-series database.
Nydus - the Dragonfly image service, providing fast, secure and easy access to container images.
A Cluster API speaking operator for load balancers
A generic design document template for documenting Micro services.
润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新中国人的核心宗教,核心信念。
Reference implementation of the Filecoin protocol, written in Go
DoChat is a Dockerized WeChat (盒装微信) PC Windows Client for Linux