Pinned Loading
-
auto-deploy-mcp-server
auto-deploy-mcp-server PublicAutomatically deploy the MCP server using a CI/CD pipeline with integrated testing and AI-powered security checks.
Python 1
-
llm-kernel-patch
llm-kernel-patch Publicpatch transformer library llm part with specific llm cuda kernel
Cuda 1
-
Rainference
Rainference PublicSelf-hosted LLM inference stack with OpenAI-compatible API. Deploy LLaMA models on your own bare-metal Kubernetes cluster with built-in billing, analytics, and management dashboard
Python 1
-
Supply-Chain-Resilience-ai-agentic-system
Supply-Chain-Resilience-ai-agentic-system PublicA swarm of specialized AI agents that detect, assess, and autonomously respond to supply chain disruptions in real-time, using MCP server..
Python 1
-
-
Diffusion.cu
Diffusion.cu PublicDiffusion.cu A high-performance diffusion model built from scratch with core components optimized in CUDA for speed. It features custom Conv2d, GroupNorm, and attention using Flash Attention for ef…
If the problem persists, check the GitHub status page or contact support.