Stars
A Datacenter Scale Distributed Inf 8000 erence Serving Framework
A lightweight data processing framework built on DuckDB and 3FS.
Cost-efficient and pluggable Infrastructure components for GenAI inference
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
verl: Volcano Engine Reinforcement Learning for LLMs
Train transformer language models with reinforcement learning.
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
Virtual Kubelet is an open source Kubernetes kubelet implementation.
🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
A perspective powered, user editable ray dashboard via ray serve
Distribute and run AI workloads magically in Python, like PyTorch for ML infra.
Efficient and easy multi-instance LLM serving
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
It is a high-performance causal inference (statistical model) computing library based on OLAP, which solves the performance bottleneck of the existing statistical model library (R/Python) under big…
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
A composable and fully extensible C++ execution engine library for data management systems.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
A tool for automatically discovering parallelisms in Python programs
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
Reliable agent framework built on top of OpenAI Assistants API. (Responses API soon)