8000 ritazh (Rita Zhang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ritazh's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@microsoft @Azure @kubernetes @open-policy-agent @virtual-kubelet @kubernetes-sigs @deislabs

Block or report ritazh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Natural Language Web

Python 2,762 210 Updated May 23, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 16,235 1,290 Updated May 21, 2025

Model Context Protocol (MCP) server for Kubernetes and OpenShift

Go 194 35 Updated May 23, 2025

Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling

Python 25,326 4,416 Updated May 22, 2025

Cloud Native Agentic AI | Discord: https://bit.ly/kagentdiscord

Python 783 116 Updated May 23, 2025

mcp-use is the easiest way to interact with mcp servers with custom agents

Python 3,251 380 Updated May 22, 2025

⚡ Guidance, samples, and tools for HPC workloads on AKS clusters with RDMA and InfiniBand support, including GPUDirect RDMA.

Shell 12 9 Updated May 14, 2025

GenAI inference performance benchmarking tool

Python 42 12 Updated May 23, 2025

Constrain, log and scan your MCP connections for security vulnerabilities.

Python 661 54 Updated May 22, 2025

A comprehensive security checklist for MCP-based AI tools. Built by SlowMist to safeguard LLM plugin ecosystems.

485 37 Updated Apr 28, 2025

An open source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI artifact.

Go 799 82 Updated May 23, 2025

📦️ A fast, secure MCP server that extends its capabilities through WebAssembly plugins.

Rust 558 37 Updated May 19, 2025

A Model Context Protocol (MCP) server for Kubernetes that enables AI assistants like Claude, Cursor, and others to interact with Kubernetes clusters through natural language.

Python 488 77 Updated May 6, 2025

hyperlight-wasm is a rust library crate that enables Wasm Modules and components to be run inside lightweight Virtual Machine backed Sandbox. It is built on top of Hyperlight.

Rust 596 25 Updated May 21, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,081 374 Updated May 23, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 14,572 1,820 Updated May 23, 2025

Kubernetes RBAC authorizing HTTP proxy for a single upstream.

Go 614 204 Updated Apr 25, 2025

Use github actions cache to back the go build

Go 3 Updated Apr 5, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Jupyter Notebook 3,594 349 Updated May 21, 2025

Health checks for Azure N- and H-series VMs.

Shell 41 30 Updated Apr 29, 2025

Gateway API Inference Extension

Jupyter Notebook 298 88 Updated May 23, 2025

Hyperlight is a lightweight Virtual Machine Manager (VMM) designed to be embedded within applications. It enables safe execution of untrusted code within micro virtual machines with very low latenc…

Rust 3,587 125 Updated May 23, 2025

Agentic AI framework for enterprise workflow automation.

Python 1,353 93 Updated Apr 18, 2025

This Kubernetes fork is intended to provide long term support for Kubernetes releases, but is not an official release of the Kubernetes project. For more information, please see https://github.com/…

Go 16 6 Updated May 22, 2025

Basic Streamlit Application for testing, and displaying Multi-GPU LLM timings

Python 9 2 Updated Mar 30, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,884 7,558 Updated May 23, 2025

Containerized Python based Framework for running and visualizing benchmark workloads on any Kubernetes/ OpenShift and runtime kinds pods, kata containers and kubevirt virtual machines simply and sa…

Python 23 19 Updated May 21, 2025

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 447 78 Updated May 23, 2025

📦 Produce secure packages and containers with declarative configurations

Go 175 30 Updated May 22, 2025

InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing

Go 27 4 Updated Nov 27, 2024
Next
0