MCP Gateway is a reverse proxy and management layer for Model Context Protocol (MCP) servers, enabling scalable, session-aware routing and lifecycle management of MCP servers in Kubernetes environments.
- Overview
- Key Concepts
- Architecture
- Features
- Getting Started – Local Deployment
- Getting Started – Cloud Deployment (Azure)
This project provides:
- A data gateway for routing traffic to MCP servers with session affinity.
- A control plane for managing the MCP server lifecycle (deploy, update, delete).
- Enterprise-ready integration points including telemetry, access control and observability.
- MCP Server: A server implementing the Model Context Protocol, which typically exposes SSE or streamable HTTP endpoints.
- Adapters: Logical resources representing MCP servers in the gateway, managed under the
/adapters
scope. Designed to coexist with other resource types (e.g.,/agents
) in a unified AI development platform. - Session-Aware Stateful Routing: Ensures that all requests with a given
session_id
are consistently routed to the same MCP server instance.
flowchart LR
subgraph Clients["Clients"]
DataClient["Agent/MCP Data<br>Client"]
MgmtClient["Server Management<br>Client"]
end
subgraph DataPlane["Data Plane"]
Routing["Distributed Routing"]
end
subgraph ControlPlane["Control Plane"]
DeploymentManagement["Deployment Management"]
MetadataManagement["Metadata Management"]
end
subgraph Gateway["MCP Gateway"]
Auth["AuthN - Bearer<br>AuthZ - RBAC/ACL"]
Auth2["AuthN - Bearer<br>AuthZ - RBAC/ACL"]
DataPlane
ControlPlane
end
subgraph Cluster["Kubernetes Cluster"]
PodA["Server Pod<br>mcp-a-0"]
PodA1["Server Pod<br>mcp-a-1"]
PodB["Server Pod<br>mcp-b-0"]
end
DataClient -- SSE/<br>Streamable HTTP --> Auth
MgmtClient -- "CRUD /adapters" --> Auth2 --> ControlPlane
Auth --> Routing
Routing -- Session Affinity Routing --> PodA
Routing --> PodA1 & PodB
MetadataManagement --> Metadata[("Server<br>Metadata")]
DeploymentManagement -- "Deployment/Status Check"--> Cluster
POST /adapters
— Deploy and register a new MCP server.GET /adapters
— List all MCP servers the user can access.GET /adapters/{name}
— Retrieve metadata for a specific adapter.GET /adapters/{name}/status
— Check the deployment status.GET /adapters/{name}/logs
— Access the server's running logs.PUT /adapters/{name}
— Update the deployment.DELETE /adapters/{name}
— Remove the server.
GET /adapters/{name}/sse
— Establish an initial SSE connection.POST /adapters/{name}/messages
— Send subsequent requests usingsession_id
.POST /adapters/{name}/mcp
— Establish a streamable HTTP connection.
- Authentication and authorization support (production mode).
- Stateless reverse proxy with a distributed session store (production mode).
- Kubernetes-native deployment using StatefulSets and headless services.
docker run -d -p 5000:5000 --name registry registry:2.7
Build and push the MCP server images to your local registry (localhost:5000
).
docker build -f mcp-example-server/Dockerfile mcp-example-server -t localhost:5000/mcp-example:1.0.0
docker push localhost:5000/mcp-example:1.0.0
(Optional) Open dotnet/Microsoft.McpGateway.sln
with Visual Studio.
Publish the MCP Gateway image by right-clicking Publish
on Microsoft.McpGateway.Service
in Visual Studio, or run:
dotnet publish dotnet/Microsoft.McpGateway.Service/src/Microsoft.McpGateway.Service.csproj -c Release /p:PublishProfile=localhost_5000.pubxml
Apply the deployment manifests:
kubectl apply -f deployment/k8s/local-deployment.yml
Forward the gateway service port:
kubectl port-forward -n adapter svc/mcpgateway-service 8000:8000
-
Import the OpenAPI definition from
openapi/mcp-gateway.openapi.json
into tools like Postman, Bruno, or Swagger Editor. -
Send a request to create a new adapter resource:
POST http://localhost:8000/adapters Content-Type: application/json
{ "name": "mcp-example", "imageName": "mcp-example", "imageVersion": "1.0.0", "description": "test" }
-
After deploying the MCP server, use a client like MCP Inspector to test the connection.
To connect to the deployed
mcp-example
server, use:http://localhost:8000/adapters/mcp-example/mcp
(Streamable HTTP)
For other servers:
http://localhost:8000/adapters/{name}/mcp
(Streamable HTTP)http://localhost:8000/adapters/{name}/sse
(SSE)
To remove all deployed resources, delete the Kubernetes namespace:
kubectl delete namespace adapter
- An active Azure subscription with Owner access
- Install Azure CLI
- Install Docker Desktop
The cloud-deployed service needs authentication. Here we configure the basic bearer token authentication using Azure Entra ID.
- Go to App Registrations
- Create a single-tenant app registration
- Add a platform - Mobile and desktop applications
- Under Redirect URIs, add:
http://localhost
- Copy the Application (client) ID and Directory (tenant) ID from the overview page
Run the deployment script:
deployment/azure-deploy.ps1 -ResourceGroupName <resourceGroupName> -ClientId <appClientId> -Location <azureLocation>
Parameters:
Name | Description |
---|---|
ResourceGroupName |
All lowercase, letters and numbers only |
ClientId |
Client ID from your app registration |
Location |
Azure region (default: westus3 ) |
This script will:
-
Create a resource group named
<resourceGroupName>
-
Deploy Azure infrastructure via Bicep templates
Resource Name Resource Type acr<resourceGroupName> Container Registry cosmos<resourceGroupName> Azure Cosmos DB Account mg-aag-<resourceGroupName> Application Gateway mg-ai-<resourceGroupName> Application Insights mg-aks-<resourceGroupName> Kubernetes Service (AKS) mg-identity-<resourceGroupName> Managed Identity mg-pip-<resourceGroupName> Public IP Address mg-vnet-<resourceGroupName> Virtual Network -
Deploy Kubernetes resources (including
mcp-gateway
) to the provisioned AKS cluster
Note: It's recommended to use Managed Identity for credential-less authentication. This deployment follows that design.
The gateway service pulls the MCP server image from the newly provisioned Azure Container Registry (ACR) during deployment.
Build and push the MCP server image to ACR:
Note: Ensure that Docker Engine is running before proceeding.
az acr login -n acr<resourceGroupName>
docker build -f mcp-example-server/Dockerfile mcp-example-server -t acr<resourceGroupName>.azurecr.io/mcp-example:1.0.0
docker push acr<resourceGroupName>.azurecr.io/mcp-example:1.0.0
-
Import the OpenAPI spec from
openapi/mcp-gateway.openapi.json
into Postman, Bruno, or Swagger Editor -
Acquire a bearer token using this python script locally:
pip install azure-identity
from azure.identity import InteractiveBrowserCredential tenant_id = "<your-tenant-id>" client_id = "<your-client-id>" credential = InteractiveBrowserCredential(tenant_id=tenant_id, client_id=client_id) access_token = credential.get_token(f"{client_id}/.default").token print(access_token)
-
Send a POST request to create an adapter resource:
POST http://<resourceGroupName>.<location>.cloudapp.azure.com/adapters Authorization: Bearer <token> Content-Type: application/json
{ "name": "mcp-example", "imageName": "mcp-example", "imageVersion": "1.0.0", "description": "test" }
-
After deploying the MCP server, use a client like MCP Inspector to test the connection.
Note: A valid bearer token is still required in the Authorization header when connecting to the server.
-
To connect to the deployed
mcp-example
server, use:http://<resourceGroupName>.<location>.cloudapp.azure.com/adapters/mcp-example/mcp
(Streamable HTTP)
-
For other servers:
http://<resourceGroupName>.<location>.cloudapp.azure.com/adapters/{name}/mcp
(Streamable HTTP)http://<resourceGroupName>.<location>.cloudapp.azure.com/adapters/{name}/sse
(SSE)
-
To remove all deployed resources, delete the Azure resource group:
az group delete --name <resourceGroupName> --yes
-
TLS Configuration
Set up HTTPS on Azure Application Gateway (AAG) listener using valid TLS certificates. -
Network Security
Restrict incoming traffic within the virtual network and configure Private Endpoints for enhanced network security. -
Telemetry
Enable advanced telemetry, detailed metrics, and alerts to support monitoring and troubleshooting in production. -
Scaling
Adjust scaling formcp-gateway
services and MCP servers based on expected load. -
Authentication & Authorization
Set up OAuth 2.0 with Azure Entra ID (AAD) for authentication. Implement fine-grained access control using RBAC or custom ACLs foradapter
level permissions.
This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.
When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.
This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.
This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusio 8000 n or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.
The software may collect information about you and your use of the software and send it to Microsoft. Microsoft may use this information to provide services and improve our products and services. You may turn off the telemetry as described in the repository. There are also some features in the software that may enable you and Microsoft to collect data from users of your applications. If you use these features, you must comply with applicable law, including providing appropriate notices to users of your applications together with a copy of Microsoft’s privacy statement. Our privacy statement is located at https://go.microsoft.com/fwlink/?LinkID=824704. You can learn more about data collection and use in the help documentation and our privacy statement. Your use of the software operates as your consent to these practices.