Question 1

What is Ingate?

Accepted Answer

Ingate is an all-in-one AI gateway for enterprise that sits between your applications and LLM providers. It combines a transparent proxy, full observability, prompt versioning, automated evaluations, provider fallback, response caching, guardrails, budget controls, and security governance in a single platform. No SDK required, no code changes needed.

Question 2

Why use an AI gateway instead of separate tools?

Accepted Answer

Most AI infrastructure requires stitching together separate tools for observability, prompt management, evaluation, and request routing. Each tool adds its own SDK, configuration, and data silo. An all-in-one gateway like Ingate gives you a single integration point: one base URL change, and you get logging, evals, prompts, caching, guardrails, and budget controls working together. No SDK, no glue code, no data fragmentation.

Question 3

Does Ingate require an SDK?

Accepted Answer

No. Ingate works with any HTTP client in any language. Change your base URL to point at Ingate and your existing OpenAI, Anthropic, or custom LLM code works unchanged. Ingate auto-detects the provider from the request path, so most integrations don't even need an extra header. Setup takes under 5 minutes.

Question 4

What features does Ingate include?

Accepted Answer

Ingate includes: transparent proxy with SSE streaming, provider fallback, format translation (OpenAI Chat, Responses API, and Anthropic), response caching, PII redaction, prompt injection detection, 14 built-in evaluators, versioned prompt registry, interactive playground, datasets, session tracking, cost tracking, budget controls, audit logging, HMAC-signed webhooks, scoped API keys with rotation, multi-tenant RBAC, REST ingestion API, OpenTelemetry receiver, and Bring Your Own Storage (BYOS).

Question 5

How does Ingate handle security?

Accepted Answer

Provider API keys are encrypted with AES-256-GCM at rest. API keys are stored as SHA-256 hashes and never logged. Ingate supports scoped keys with per-provider and per-model restrictions, zero-downtime key rotation, PII redaction on request bodies, prompt injection detection, SSRF protection on outbound requests, and a complete audit log of administrative actions. Auth endpoints are rate-limited to prevent brute-force attacks.

Question 6

What plans does Ingate offer?

Accepted Answer

Ingate offers two plans. The Free plan includes the transparent proxy, SSE streaming, request logging, a 7-day retention period, and a built-in dashboard. The Enterprise plan adds prompt versioning, evaluations, datasets, playground, provider fallback, format translation, response caching, PII redaction, guardrails, webhooks, audit log, budget controls, cost tracking, Prometheus metrics, BYOS, custom retention, and a 99.9% uptime SLA.

Evaluator	Description
`contains`	Output contains a specific substring
`not_contains`	Output does not contain the specified substring
`regex_match`	Output matches a regular expression
`is_json`	Output is valid JSON
`max_length`	Character count within maximum
`min_length`	Character count meets minimum
`latency_max`	Request latency within threshold (milliseconds)
`token_max`	Token usage within threshold
`status_code`	HTTP status code (exact or range match)

Evaluator	Description
`json_schema`	Validates JSON structure, checking required keys and value types
`similarity`	Word overlap similarity against a reference string
`reference_match`	Compare against reference (exact, contains, or similarity)
`webhook`	Send trace to an external URL for custom evaluation

Field	Description
`prompt`	The user's input message
`completion`	The LLM's response text
`request_body`	Full request payload (JSON)
`response_body`	Full response payload (JSON)
`provider`	Provider name (e.g. openai, anthropic)
`model`	Model identifier used for the request
`path`	Request path through the gateway
`method`	HTTP method (GET, POST, etc.)

Eval Engine

Overview

Static Evaluators

Dynamic Evaluators

LLM Judge

Create an Eval

JSON Output Check

Content Safety

Webhook Evaluator

LLM Judge

Running Evals

Running Evals Against Datasets

Available Fields