Question 1

What is Ingate?

Accepted Answer

Ingate is an all-in-one AI gateway for enterprise that sits between your applications and LLM providers. It combines a transparent proxy, full observability, prompt versioning, automated evaluations, provider fallback, response caching, guardrails, budget controls, and security governance in a single platform. No SDK required, no code changes needed.

Question 2

Why use an AI gateway instead of separate tools?

Accepted Answer

Most AI infrastructure requires stitching together separate tools for observability, prompt management, evaluation, and request routing. Each tool adds its own SDK, configuration, and data silo. An all-in-one gateway like Ingate gives you a single integration point: one base URL change, and you get logging, evals, prompts, caching, guardrails, and budget controls working together. No SDK, no glue code, no data fragmentation.

Question 3

Does Ingate require an SDK?

Accepted Answer

No. Ingate works with any HTTP client in any language. Change your base URL to point at Ingate and your existing OpenAI, Anthropic, or custom LLM code works unchanged. Ingate auto-detects the provider from the request path, so most integrations don't even need an extra header. Setup takes under 5 minutes.

Question 4

What features does Ingate include?

Accepted Answer

Ingate includes: transparent proxy with SSE streaming, provider fallback, format translation (OpenAI Chat, Responses API, and Anthropic), response caching, PII redaction, prompt injection detection, 14 built-in evaluators, versioned prompt registry, interactive playground, datasets, session tracking, cost tracking, budget controls, audit logging, HMAC-signed webhooks, scoped API keys with rotation, multi-tenant RBAC, REST ingestion API, OpenTelemetry receiver, and Bring Your Own Storage (BYOS).

Question 5

How does Ingate handle security?

Accepted Answer

Provider API keys are encrypted with AES-256-GCM at rest. API keys are stored as SHA-256 hashes and never logged. Ingate supports scoped keys with per-provider and per-model restrictions, zero-downtime key rotation, PII redaction on request bodies, prompt injection detection, SSRF protection on outbound requests, and a complete audit log of administrative actions. Auth endpoints are rate-limited to prevent brute-force attacks.

Question 6

What plans does Ingate offer?

Accepted Answer

Ingate offers two plans. The Free plan includes the transparent proxy, SSE streaming, request logging, a 7-day retention period, and a built-in dashboard. The Enterprise plan adds prompt versioning, evaluations, datasets, playground, provider fallback, format translation, response caching, PII redaction, guardrails, webhooks, audit log, budget controls, cost tracking, Prometheus metrics, BYOS, custom retention, and a 99.9% uptime SLA.

API Shape	Example Path	Auto-Detected
OpenAI Chat Completions	`/v1/chat/completions`	Yes
OpenAI Responses API	`/v1/responses`	Yes
Anthropic Messages	`/v1/messages`	Yes
Ollama	`/api/generate`, `/api/chat`	Yes
Custom / Unknown	Any path	No, use header or default

Header	Required	Description
`X-Ingate-Key`	Yes	Your Ingate API key. Authenticates the request and determines your org, plan, and entitlements.
`X-Ingate-Provider`	No	Target provider name. Optional when auto-detection or a default provider can resolve the target.
`X-Ingate-Translate`	No	Set to `true` to opt into request/response format translation between API shapes.
`X-Ingate-User-Id`	No	Arbitrary user identifier. Logged with the request for per-user analytics and filtering.
`X-Ingate-Session-Id`	No	Arbitrary session identifier. Groups related requests in logs for conversation-level tracking.

Header	Description
`X-Ingate-Request-Id`	Unique UUID for this request. Use it to look up logs, debug issues, and correlate across systems.
`X-Ingate-Served-By`	Present only on fallback. Shows which provider actually served the response when the primary failed.

Field	Description
`model`	Model name from the request or response
`tokens`	Prompt, completion, and total token counts
`latency`	End-to-end request duration in milliseconds
`status`	HTTP status code from the provider
`cost`	Estimated cost based on model pricing
`user_id`	From `X-Ingate-User-Id` header, if set
`session_id`	From `X-Ingate-Session-Id` header, if set

Provider Config	Upstream Header	Example
Default (`Bearer`)	`Authorization: Bearer <key>`	OpenAI, Ollama
Custom header + scheme	`x-api-key: <key>`	Anthropic
Query parameter	Appended as `?key=<key>`	Legacy APIs

Proxy & Routing

How Routing Works

Provider Resolution

Supported API Shapes

Request Headers

Response Headers

Streaming

What Gets Logged

Auth Passthrough

Provider Key Injection