AI Gateway Documentation

Overview

AI Gateway Overview
Overview of AI gateway capabilities
Quickstart
Get started quickly with AI Gateway setup and usage.
AI Gateway Capabilities
Learn about the core capabilities of AI Gateway.
AI providers
Learn about the various providers supported by AI Gateway.
AI Usage Governance
Understand how to manage and govern AI usage effectively.
Data Governance
Explore how AI Gateway helps enforce data governance policies.
Prompt Engineering
Best practices and tools for designing effective prompts.
Guardrails and Content Safety
Implement safeguards to ensure safe and compliant AI outputs.
Request Transformations
Customize and transform AI requests with Gateway features.
Streaming
Learn how AI Proxy streaming works.
Audit log
Learn about AI Gateway logging capabilities.
Monitor AI LLM Metrics
Explore how to monitor AI LLM metrics in AI Gateway.
Advanced Analytics
Access advanced analytics features in AI Gateway.

AI Gateway plugins

AI Azure Content Safety
Use Azure AI Content Safety to check and audit AI Proxy plugin messages before proxying them to an upstream LLM
AI Prompt Decorator
Prepend or append an array of llm/v1/chat messages to a user's chat history
AI Prompt Guard
Check llm/v1/chat or llm/v1/completions requests against a list of allowed or denied expressions
AI Prompt Template
Provide fill-in-the-blank AI prompts to users
AI Proxy
The AI Proxy plugin lets you transform and proxy requests to a number of AI providers and models.
AI Proxy Advanced
The AI Proxy Advanced plugin lets you transform and proxy requests to multiple AI providers and models at the same time. This lets you set up load balancing between targets.
AI RAG Injector
Create RAG pipelines by automatically injecting content from a vector database
AI Rate Limiting Advanced
Provides rate limiting for the providers used by any AI plugins.
AI Request Transformer
Use an LLM service to transform a client request body prior to proxying the request to the upstream server
AI Response Transformer
Use an LLM service to transform the upstream HTTP(S) prior to forwarding it to the client
AI Semantic Prompt Guard
Semantically and intelligently create allow and deny lists of topics that can be requested across every LLM.
AI PII Sanitizer
Protect sensitive information in client request or response bodies before they reach upstream services or clients
AI Prompt Compressor
Compress prompts before they are sent to LLMs to reduce costs, and improve latency
AI AWS Guardrails
Use AWS Guardrails to validate requests and/or responses in the AI Proxy plugin before forwarding them between clients and upstream LLMs.
AI MCP Proxy
Convert APIs into MCP tools, proxy MCP servers, expose multiple MCP tools for AI clients, and observe MCP traffic in real time.
AI LLM as Judge
Evaluate and optimize your Large Language Models with accuracy

MCP traffic gateway

MCP Traffic Gateway
This page is an introduction to MCP Traffic Gateway capabilites in Kong AI Gateway.
Secure MCP traffic
Secure GitHub MCP Server traffic with Kong Gateway and Kong AI Gateway
Govern MCP traffic
Use Kong AI Gateway to govern GitHub MCP traffic
Observe MCP traffic
Observe GitHub MCP traffic with Kong AI Gateway
Kong Konnect MCP Server
Get started with Kong Konnect MCP Server
Kong Konnect MCP Server tools
Check available Kong Konnect MCP Server tools
MCP logs
Learn about logs available for MCP traffic via Kong AI Gateway
MCP traffic metrics
Learn about metrics available for MCP traffic via Kong AI Gateway

AI load balancing

Load balancing with AI Proxy Advanced
Overview of load balancing and retry and fallback strategies in the AI Proxy Advanced plugin.
Consistent Hashing - AI Proxy Advanced
Set up consistent hashing for load balancing.
Lowest Latency - AI Proxy Advanced
Configure load balancing based on the lowest latency.
Lowest Usage - AI Proxy Advanced
Set up load balancing based on the lowest usage.
Priority - AI Proxy Advanced
Configure priority-based load balancing.
Round Robin - AI Proxy Advanced
Set up round-robin load balancing.
Semantic - AI Proxy Advanced
Set up semantic load balancing.

How-tos

Autogenerate MCP tools for Weather API
Learn how to use the AI MCP Proxy plugin to expose WeatherAPI endpoints as MCP tools, allowing AI clients like Cursor to query weather data.
Autogenerate MCP tools from a RESTful API
Learn how to use the AI MCP Proxy plugin to generate MCP from any RESTful API, including setting up a mock Node.js server for testing.
Collect Konnect audit logs
Learn how to configure you SIEM provider to collect Konnect logs and configure a Konnect audit log webhook.
Configure dynamic authentication to LLM providers using HashiCorp vault
Use HashiCorp Vault to securely store and reference API keys for OpenAI, Mistral, and other LLM providers in Kong AI Gateway.
Control accuracy of LLM models using the AI LLM as judge plugin
Learn how to compare LLM models accuracy using the AI LLM as Judge plugin
Control prompt size with the AI Compressor plugin
Learn how to use the AI Compressor plugin alongside the RAG Injector and AI Prompt Decorator plugins to keep prompts lean, reduce latency, and optimize LLM usage for cost efficiency
Enforce AI rate limits with a custom function
Configure the AI Proxy plugin to create a chat route using Cohere, and apply usage-based rate limiting with the AI Rate Limiting Advanced plugin.
Enforce responsible AI behavior using the AI Prompt Decorator plugin
Use the AI Prompt Decorator plugin to inject ethical and safety guidelines before proxying requests to Cohere via Kong AI Gateway.
Ensure chatbots adhere to compliance policies with the AI RAG Injector plugin
Learn how to configure the AI RAG Injector plugin.
Get started with AI Gateway
Learn how to quickly get started with AI Gateway
Guide survey classification behavior using the AI Prompt Decorator plugin
Use the AI Prompt Decorator plugin to enforce privacy-aware classification behavior when routing chat requests to Cohere via Kong AI Gateway.
Log MCP traffic for autogenerated MCP Weather API tools
Enable logging in the AI MCP Proxy plugin to capture MCP tool calls, then use the HTTP Log plugin to record and inspect the payloads and responses from the WeatherAPI tool.
Observe GitHub MCP traffic with Kong AI Gateway
Learn how to observe MCP traffic within GitHub remote MCP server with the AI Proxy Advanced and Kong Gateway Prometheus plugin
Observe MCP traffic for autogenerated MCP tools
Learn how to monitor traffic for autogenerated MCP tools using the AI MCP Proxy plugin and Prometheus, so you can track tool usage and latency.
Provide AI prompt templates for end users with the AI Prompt Template plugin and Mistral
Configure the AI Proxy plugin to route requests to a model provider like Mistral, then define reusable templates with the AI Prompt Template plugin to enforce consistent prompt formatting for tasks like summarization, code explanation, and Q&A.
Route OpenAI chat traffic using semantic balancing and Vault-stored keys
Use the AI Proxy Advanced plugin to route chat requests to OpenAI models based on semantic intent, secured with API keys stored in HashiCorp Vault.
Save LLM usage costs with AI Proxy Advanced semantic load balancing
Configure the AI Proxy Advanced plugin to optimize LLM usage and reduce costs by intelligently routing chat requests across multiple OpenAI models based on semantic similarity.
Send asynchronous requests to LLMs
Reduce costs by using llm/v1/files and llm/v1/batches route_types to send asynchronous batched requests to LLMs.
Set up AI Proxy Advanced with Anthropic in Kong Gateway
Configure the AI Proxy Advanced plugin to create a chat route using Anthropic.
Set up AI Proxy Advanced with Ollama
Configure the AI Proxy Advanced plugin to create a chat route using Ollama.
Set up AI Proxy Advanced with OpenAI in Kong Gateway
Configure the AI Proxy Advanced plugin to create a chat route using OpenAI.
Set up AI Proxy with Anthropic in Kong Gateway
Configure the AI Proxy plugin to create a chat route using Anthropic.
Set up AI Proxy with Ollama
Configure the AI Proxy Advanced plugin to create a chat route using Ollama.
Set up AI Proxy with OpenAI in Kong Gateway
Configure the AI Proxy plugin to create a chat route using OpenAI.
Store a Mistral API key as a secret in Konnect Config Store
Learn how to set up Konnect Config Store as a Vault backend and store a Mistral API key.
Store and rotate Mistral API keys as secrets in Google Cloud
Learn how to store and rotate secrets in Google Cloud with Kong Gateway, Mistral, and the AI Proxy plugin.
Transform a request body using OpenAI in Kong Gateway
Use the AI Request Transformer plugin with OpenAI to transform a client request body before proxying it.
Transform a response using OpenAI in Kong Gateway
Use the AI Response Transformer plugin with OpenAI to transform a response before returning it to the client.
Use Agno with AI Proxy in Kong AI Gateway
Connect Agno’s research agents to Kong AI Gateway with no code changes, enabling OpenAI-compatible inference through a proxy.
Use AI Prompt Guard plugin to govern your LLM traffic
Use the AI Prompt Guard plugin to filter LLM traffic based on regex rules that allow general IT questions and deny unsafe or off-topic content.
Use AI Semantic Prompt Guard plugin to govern your LLM traffic
Use the AI Semantic Prompt Guard plugin to enforce topic-level guardrails for LLM traffic, filtering prompts based on meaning.
Use AI Semantic Response Guard plugin to govern your LLM traffic
Use the AI Semantic Response Guard plugin to enforce topic-level guardrails on LLM responses, blocking outputs that fall outside approved categories.
Use AI to protect sensitive information in requests
Use the AI Sanitizer plugin to protect sensitive information in requests.
Use Azure Content Safety plugin
Learn how to use the Azure AI Content Safety plugin.
Use Kong AI Gateway to govern GitHub MCP traffic
Learn how to govern MCP traffic within GitHub remote MCP server with the AI Proxy Advanced and AI Prompt Guard plugins
Use LangChain with AI Proxy in Kong Gateway
Connect your LangChain integrations with Kong Gateway with no code changes.
Use the AI AWS Guardrails plugin
Learn how to use the AI AWS Guardrails plugin.
Use the AI GCP Model Armor plugin
Learn how to use the AI GCP Model Armor plugin.
Visualize AI Gateway metrics
Use a sample Elasticsearch, Logstash, and Kibana stack to visualize data from the AI Proxy plugin.
Visualize LLM traffic with Prometheus and Grafana
Learn how to monitor LLM traffic and visualize AI metrics in Grafana using the AI Proxy Advanced and Prometheus plugins in Kong Gateway.

AI Gateway Documentation

Overview

AI Gateway plugins

MCP traffic gateway

AI load balancing

How-tos

Help us make these docs great!

Still need help