-
@alibaba Cloud
- Beijing, China
-
12:56
(UTC +08:00)
Lists (6)
Sort Name ascending (A-Z)
Starred repositories
Cloud Native Artifacial Intelligence Model Format Specification
Full-stack framework for building Multi-Agent Systems with memory, knowledge and reasoning.
Arks is a cloud-native inference framework running on Kubernetes
The Open All-in-One Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
A Datacenter Scale Distributed Inference Serving Framework
No fortress, purely open ground. OpenManus is Coming.
A live stream development of RL tunning for LLM agents
Cost-efficient and pluggable Infrastructure components for GenAI inference
🪄 Create rich visualizations with AI
A flexible distributed key-value database that is optimized for caching and other realtime workloads.
Supercharge Your LLM with the Fastest KV Cache Layer
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
GenAI inference performance benchmarking tool
Animated sprite editor & pixel art tool (Windows, macOS, Linux)
Universal LLM Deployment Engine with ML Compilation
[ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding
Kubernetes-friendly ML model management, deployment, and serving.
Get your documents ready for gen AI
JobSet: a k8s native API for distributed ML training and HPC workloads
AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…