- Hangzhou,China
- https://jessestutler.github.io/
Stars
A Go implementation of the Model Context Protocol (MCP), enabling seamless integration between LLM applications and external data sources and tools.
Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Marks issues and pull requests that have not had recent interaction
FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models.
Underlay and RDMA network solution of the Kubernetes, for bare metal, VM and any public cloud
Monokaix / go-gitlog
Forked from wadackel/go-gitlogGo (golang) package for providing a means to handle git-log.
Device-plugin for volcano vgpu which support hard resource isolation
Large-scale LLM inference engine
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Standardized Serverless ML Inference Platform on Kubernetes
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
Fully open reproduction of DeepSeek-R1
A Cloud Native Batch System (Project under CNCF)
Fortio load testing library, command line tool, advanced echo server and web UI in go (golang). Allows to specify a set query-per-second load and record latency histograms and other useful stats.
💥 A Lodash-style Go library based on Go 1.18+ Generics (map, filter, contains, find...)
🐥 A code review bot powered by ChatGPT
Open-source benchmark suite for cloud microservices
Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration
Virtual whiteboard for sketching hand-drawn like diagrams
Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas - Effortless optimization at its finest!