Skip to main content
v0.133.1
March 27, 2026

Updates

AI Gateway

  • Baseten is now available as a first-class inference provider, supporting chat completions and embeddings.
  • MCP servers now support per-user header-based auth overrides, allowing individual users to supply their own API tokens for MCP servers that require per-user credentials. MCP auth overrides are enabled by default.
  • OpenAPI-backed MCP servers are now processed through the full gateway hook pipeline, ensuring guardrails, auth, and tool settings apply consistently with remote MCP servers.
  • Virtual model routing targets can now include metadata_match conditions, so only targets whose key/value constraints match the incoming request metadata headers are eligible for routing.
  • TLS and proxy support is now available for remote MCP server connections, configurable via manifest tls_settings.
  • Azure OpenAI deployment configuration has been reworked — the previous region and data-zone fields are replaced by a deployment_type selector (global, datazone_us, datazone_eu, standard), and pricing lookup is updated accordingly.
  • The x-tfy-routing-config request header is no longer accepted for routing overrides. Routing configuration is now resolved exclusively from virtual model config, prompt config, or tenant-level load-balance settings. Clients that relied on this header must migrate to one of those mechanisms.
    Breaking Change: We are retiring the x-tfy-routing-config request header. Use Virtual Models instead. Read more

AI Engineering

  • GCR (gcr.io) container registries are now supported alongside Artifact Registry for GCP Docker registry integrations.
  • The OpenTelemetry collector now accepts traces and metrics over gRPC (port 4317) in addition to HTTP.
  • Platform policies can now be applied to volume manifests, extending enforcement and mutation support to volume deployments.

Release instructions

  • Update the truefoundry Helm chart to version 0.133.1.
  • Update the tfy-llm-gateway chart to version 0.133.0.
v0.132.5
March 24, 2026

Updates

AI Gateway

  • Google Model Armor guardrails now support a mutate operation mode that can redact or transform request and response content via sensitive data protection, in addition to the existing validate mode.
  • MCP tool-call guardrails are now enforced on SSE streaming responses in addition to JSON responses, with consistent blocking and error handling across both transports.
  • The PromptFoo guardrail integration has been removed from both the gateway and the control plane.
  • Speech-to-text proxy support has been added for Vertex, with a corresponding code snippet available in the playground.
  • Added support for sticky routing for virtual models.
  • Guardrail requests to Palo Alto can now include custom metadata fields (e.g. AI model name, app user, app name) via a configurable metadata_key_mapping
  • Fixed structured output (response_format: json_schema) handling for Anthropic models on Vertex.
  • Added support for disabling individual tools on an MCP Server
  • New code snippet integrations have been added for the playground, including Pydantic, Agno, CrewAI, Instructor, Langroid, OpenAI Agents, OpenAI Swarm, Strands Agents, and Codex.
  • New format for MCP Gateway endpoint.
    Breaking Change: We are updating the format of MCP Gateway endpoints. Read more

AI Engineering

  • HashiCorp Vault integrations now require kv_mount_path and support an optional root_path.
  1. kv_mount_path is required parameter
  2. Key path used in Secret FQN should be updated to following path: <root_path>/<secretPath>/<secretKeyInJSON>.
ExistingNew
Model Integration Auth Datatenant1:hashicorp:secret-store:vault::kv1/data/my-root-path/my-secret-path/my-secret-keytenant1:hashicorp:secret-store:vault::my-root-path/my-secret-path/my-secret-key/value
Virtual Account Sync Secretkv1/data/my-root-path/my-secret-path/my-secret-keymy-root-path/my-secret-path/my-secret-key/value
  • SCIM provisioning is now supported for SAML-based external OAuth identity providers.
  • Additional AWS instance families are now available for EKS cluster provisioning, including Graviton-based (T4g, M6g/M7g, C6g/C7g, R6g/R7g), ML accelerator (Trn1, Inf2), and newer GPU families (P5e, P6-B200, G6e, G7e).
  • A generic custom secret store integration is now available as an alternative to the built-in TrueFoundry DB or HashiCorp Vault secret backends.
  • Added Support for Databricks Job Task in Flyte workflows to trigger Databricks jobs from Flyte workflows.
  • The YAML spec/apply workflow has been removed from platform settings configuration drawers; settings are now edited exclusively through the form UI.
  • Simplified GitOps CI/CD — tfy apply now supports directory-level --diffs-only with automatic dependency resolution, replacing complex per-file CI/CD scripts with a single command. Dry runs now succeed even when a PR adds interdependent resources. Read more

Release instructions

  • Update the truefoundry Helm chart to version 0.132.5.
  • Update the tfy-llm-gateway chart to version 0.132.2.
v0.127.3
2026-03-10

Updates

  • Added support for time-to-first-token(TTFT) based timeout using header x-tfy-ttft-timeout-ms. Read more
  • Added support for finetuned model integration in Google Vertex AI.
  • Added support to configure custom KMS Key ARN in AWS Secret Manager/AWS Parameter Store for Secret Store integration.

Release instructions

  • Update truefoundry helm chart version to 0.127.3.
  • Update tfy-llm-gateway helm chart version to 0.127.0.
v0.126.2
2026-03-09

New: Enhanced Model integration flow allowing to verify the models before enabling

Get easy option to verify if the models are working before adding them to your Integration. This helps resolve any authentication or infra issue before showing the models to your end users.

Overview dashboard for AI Gateway

Get a high-level summary of your AI Gateway’s health, cost, errors, and top usage patterns Read more

Updates

Release instructions

  • Update truefoundry helm chart version to 0.126.2.
  • Update tfy-llm-gateway helm chart version to 0.126.2.
v0.125.10
2026-03-02

Updates

  • Added new role Secret Group Access for Secret Group to allow users to only read secret value but not update.
  • Added support for Validation in PII Guardrails.
  • Added support for TTS (Text-to-Speech) & STT (Speech-to-Text) Models for Groq. TTS STT
  • Added support for Realtime models in Google Gemini model. Read more
  • Added support for compaction API for OpenAI. Read more

Release instructions

  • Update truefoundry helm chart version to 0.125.10.
  • Update tfy-llm-gateway helm chart version to 0.125.2. Requires following update in the values file if being installed as standalone application:
    1. Set global.tenantName as the name of tenant
    2. Set global.controlPlaneURL as the Control Plane URL
    3. Remove any URL set under env
global:
  # This is the reference to the secrets we created in the previous step
  imagePullSecrets:
    - name: "truefoundry-image-pull-secret"

  # Choose the resource tier as per your needs
  resourceTier: medium # or small or large
  controlPlaneURL: <CONTROL_PLANE_URL> # eg. https://example-company.truefoundry.cloud
  tenantName: <TENANT_NAME>

# Optional: Istio configuration (if using Istio instead of standard ingress)
# istio:
#   virtualservice:
#     hosts:
#       - <GATEWAY_ENDPOINT_HOST>
#     enabled: true
#     retries:
#       enabled: true
#       retryOn: gateway-error
#     gateways:
#       - istio-system/tfy-wildcard
#     annotations: {}
v0.122.3
2026-02-24

Updates

  • Added support for TTS (Text-to-Speech) & STT (Speech-to-Text) Models for ElevenLabs, Cartesia, Deepgram, Vertex AI, Gemini. TTS STT
  • Added support for Deepgram Model Integration. Read more
  • Added support for Cartesia Model Integration. Read more
  • Added support for ElevanLabs Model Integration. Read more
  • Enable tfy apply for MCP Server integrations.

Release instructions

  • Update truefoundry helm chart version to 0.122.3.
  • Update tfy-llm-gateway helm chart version to 0.122.0.
v0.121.0
2026-02-17

New: Added support for Build Secret in Dockerfile Deployment

Docker build secrets allow you to securely pass sensitive information like private repository credentials, API keys, or authentication tokens during the Docker image build process. Read more

More Updates

  • Improvements in Hashicorp Vault integration:
    • Added support for custom KV mount path
    • Added support for custom Namespace

Release instructions

  • Update truefoundry helm chart version to 0.121.0.
  • Update tfy-llm-gateway helm chart version to 0.121.0.
v0.120.2
2026-02-13

Updates

  • Expanded TrueFoundry managed Guardrail integration with Prompt Injection Guardrail. Read more
  • Added support for custom CORS and Security Headers in TrueFoundry Helm Chart. Read more
global:
  proxy:
    cors:
      enabled: true|false
      allowedOrigins: string[] // Provide a list of allowed origins to restrict CORS. Defaults to '*'.
    enforceSecurityHeaders: true|false // Enforces security headers like X-Content-Type-Options, Referrer-Policy, Strict-Transport-Security

Release instructions

  • Update truefoundry helm chart version to 0.120.2.
v0.119.4
2026-02-11

Updates

  • Added support for Image Edit API in Google Vertex AI & Gemini Models.
  • Expanded TrueFoundry managed Guardrail integration with Cedar Guardrail for MCP tools. Read more

Release instructions

  • Update truefoundry helm chart version to 0.119.4.
v0.118.2
2026-02-06

New: More TrueFoundry managed Guardrails

We are expanding our TrueFoundry managed Guardrails with addition of Regex Pattern Match, TrueFoundry PII, & Content Moderation Guardrails. (Only available in SaaS AI Gateway)

More Updates

  • Improved: Cost Attribution for Gemini CLI
    Breaking Change: We are making it compulsory to register the model in TrueFoundry with exact Model name as Model ID if using Gemini CLI. Read more
  • Improved: Google GCP Integrations now support giving Key File Content as TrueFoundry Secret FQN.

Release instructions

  • Update truefoundry helm chart version to 0.118.2.
v0.117.2
2026-02-04

New: TrueFoundry managed Guardrails

We are releasing TrueFoundry managed Guardrails such as Secret Detection, Code Safety Linter & SQL Sanitizer and will be adding more soon.

More Updates

  • New: Added support for configuring Embedding Model for Semantic Caching in AI Gateway Requests. Read more
  • New: Added support for Audio Transcript, Audio Translation & Text to Speech HTTP APIs in Google Vertex models.

Release instructions

  • Update truefoundry helm chart version to 0.117.2.
v0.116.3
2026-02-01

Improvement: MCP Guardrails and Guardrail Config Schema Change

We have launched a feature to enable MCP Guardrails. This allows us to configure policies at the Gateway layer to apply certain guardrails to specific MCP servers and models.
Breaking Change: We are changing schema for Guardrail Config. Read more

New: Data Access Rules for AI Gateway Request Logs and Metrics

Data Access Rules allow you to control who can access which request logs and metrics in the AI Gateway. Gateway Datasets provide access to subsets of traces and metrics by applying filters, enabling fine-grained access control based on users, teams, roles, and data scopes.

Improvement: Data Routing Rules to Configure Storage For AI Gateway Request Logs and Metrics

Data Routing Rules allows you to configure where your request logs (traces) and metrics are stored. You can choose between control plane managed storage or bring your own customer-managed storage. Read more

More Updates

  • Change: Revamp of External Identity by introducing Identity Provider. Read more
  • New: Added support to Shutdown cluster connected to TrueFoundry.
  • Behavior Change: Team Managers can now view all the Virtual Accounts owned by their Team.
  • Allow assigning Custom Roles to Virtual Account.
  • Added support for image model in Google Gemini.
  • Allow updating default roles to allow Team Managers manage their Virtual Accounts without making them Admin. Read more
  • New: Create MCP Servers using OpenAPI spec. Read more

Release instructions

  • Update truefoundry helm chart version to 0.116.3.
v0.115.2
2026-01-23

New: Allow query with multiple sub-agents

You can now use multiple sub-agents along with mcp-servers in AI Gateway and create useful super-agents. Read more

More Updates

  • Added support for using MCP Server name in MCP Gateway URL.
  • Added support for Certificate based auth in Azure Model Integrations.
  • Added support for thinkingLevel in Gemini-3 series Models.
  • Added support reasoning in Azure Anthropic Models.
  • Added support for customisation of base URL in Palo Alto Prisma AIRS Guardrails.

Release instructions

  • Update truefoundry helm chart version to 0.115.2.
v0.114.1
2026-01-19

New: AI Gateway Settings

You can now configure Global AI Gateway related settings from Control. These settings Retry, Logging Mode, Additional OTEL Config.

More Updates

  • Added support for Files API in Anthropic.
  • Bugfix: fixed Prompt tokens calculation for Google Vertex Anthropic models.
  • Added cached tokens info in Groq response.
  • Added support for Anthropic models in Azure Foundry.
  • Change: AI Gateway > Controls > OTEL Config tab is merged to AI Gateway > Controls > Settings
  • Change: Settings > Default Data Location section is moved to AI Gateway > Controls > Data Routing tab.
  • Behavior Change: For workloads with Auto-Shutdown enabled along with On-Demand nodes, truefoundry now automatically adds pod disruption budget for max availbility of 25%. This reduces disruption of these workloads in case of node consolidation.

Release instructions

  • Update truefoundry helm chart version to 0.114.1.
v0.113.2
2026-01-16

New: TrueFoundry Agent Hub

Centralized platform for building, registering, discovering, and orchestrating AI agents within an organization. This will support building complex agents as well as registering and using pre-existing agents. Read more

More Updates

  • Added support of Real-time SSE streaming for streamable-http transport in MCP Gateway.
  • Update: username and password fields are now optional for SMTP integration.

Release instructions

  • Update truefoundry helm chart version to 0.113.2.
v0.112.1
2026-01-14

Improvement: Simplified management for MCP Servers

MCP Server integrations are now standalone, top-level resources with their own permissions and simplified management.
Breaking Change: We are removing MCP Server Groups. MCP Servers will no longer be nested under groups. Read more

More Updates

  • Added support for Google Gemini image models
  • Tiered pricing in Google Vertex AI models
  • Behavior Change Guardrails would now run in parallel to reduce latency in AI Gateway. Read more
  • Added support for certificate based auth in Azure OpenAI Integration
  • Added suport for External identity in MCP Gateway

Release instructions

  • Update truefoundry helm chart version to 0.112.1.
v0.111.2
2026-01-12

Create your own custom roles and assign to User

You can now create your custom tenant level roles and assign to users. Read more

More Updates

  • Bugfix - fixed SLA cutoff in priority based routing config
  • Added support for xAI model provider in AI Gateway
  • Added Request Failure metrics for tools in MCP metrics
  • Behavior Change: For production GPU workloads, truefoundry now automatically adds pod disruption budget for max availbility of 25%. This reduces disruption of GPU workloads in case of node consolidation.

Release instructions

  • Update truefoundry helm chart version to 0.111.2.
v0.110.3
2026-01-05

New: Oauth Inbound Auth for MCP Gateway

When using MCP Servers in Cursor/VSCode, you can now use OAuth for authentication without need to hardcode the token in mcp.json. Read more

More Updates

  • Behavior Change In AWS Paramter store, we now store the secrets as SecureString instead of String parameter type. Read more
  • A request body size limit added in AI Gateway requests (default: 50 MB)
  • Bugfix - Fixed Assumed Role based auth for AWS Bedrock Guardrails
  • Bugfix - Removed default request timeout of 5 min within AI Gateway

Release instructions

  • Update truefoundry helm chart version to 0.110.3.
v0.109.3
2025-12-22

New: Support for SCIM

TrueFoundry now supports SCIM for SAML based SSO. SCIM enabled automatic user/team management using IdP users/groups. Read more
Enable SCIM checkbox in SAML SSO configuration

Improved Rate Limit Config

  • Rule IDs must be static (no {} placeholders). Use rate_limit_applies_per to create per-entity rate limit instead of dynamic rule IDs. Read more

More Updates

  • Added support for API key based auth in AWS Bedrock model integration.
  • Behavior Change: Tenant Admin can now access all entities(Models, MCP Server, Guardrail, Agent) in AI Gateway.

Release instructions

  • Update truefoundry helm chart version to 0.109.3.
Breaking Change: Removed support for Dynamic Rule ID in Rate Limit config. Read more
v0.108.2
2025-12-17

New: Use git tags to deploy your code

You can now use tags to deploy your code to have better version control of your releases.

More Updates

  • Bug fixes and improvements

Release instructions

  • Update truefoundry helm chart version to 0.108.2.
v0.107.1
2025-12-15

New: Request Caching

You can now support both Exact match and Semantic caching in AI Gateway requests. Read more

More Updates

  • Error message improvement in Self-Hosted models response via AI Gateway.
  • Added Embedding model support in Cloudera.

Release instructions

  • Update truefoundry helm chart version to 0.107.1.
v0.106.2
2025-12-10

New: Support for External Identity

You can now use externally vended JWT tokens to authenticate to TrueFoundry. Read more

More Updates

  • Fixed Gemini 3 Pro Model usage in Agent Response.
  • Added support for custom Slug in Model integrations.
  • Added TrueFoundry integration with Goose. Read more

Release instructions

  • Update truefoundry helm chart version to 0.106.2.
v0.105.2
2025-12-08

New: Budget Usage

You can now monitor the usage metrics for each Budget rule added. Read more

More Updates

  • Added support for Deepseek model in Vertex AI.

Release instructions

  • Update truefoundry helm chart version to 0.105.2.
v0.104.2
2025-12-02

New: Configure location to store your AI Gateway Request and Metrics

We have added support to configure location on which AI Gateway Request and Metrics would be stored. This helps in complying with local Data Residency laws and privacy policies..
This feature is only available in SaaS TrueFoundry AI Gateway

Updates and Bug Fixes

  • Added support for Finetune API in Vertex Model as well. Read more
  • Added support for thought_singnature in Google Gemini and Vertex model response

Release instructions

  • Update truefoundry helm chart version to 0.104.2.
v0.103.1
2025-11-28

Updates and Bug Fixes

  • Added 404 in status codes used for default Fallback.
  • Added support for GCP workload identify authentication for Vertex models. Read more
  • Added support for Media resolution in Vertex Models. Read more
  • Added support for none as value in reasoning_effort. Read more

Release instructions

  • Update truefoundry helm chart version to 0.103.1.
v0.102.5
2025-11-26

New: we now support Virtual Models

Create reusable virtual models with intelligent routing configurations to distribute requests across multiple model providers. Read more

More Updates

  • Updates on Prisma Guardrail:
    • We now pass tfy.request.conversation_id and traceId to all the request allowing to group messages.
    • We slice the payload to max size of 1.5 MB when sending to Prisma.
  • New Routing Metrics: visualize effect of different AI Gateway configs like Rate limit, Budget limit, Routing, Load Balancing or Fallback.
  • New Metrics introduced:
    • Latency per output token
    • Model usage per user and per model
  • Added basic tracing support for requests via Gemini CLI.
  • Added support for AWS IAM role based auth in AWS SQS based async service. Read more

Release instructions

  • Update truefoundry helm chart version to 0.102.5.
v0.101.2
2025-11-21

Improved Budget Limit Config

  • Added support for Budget per week.
  • Windows for budget usage will now be a fixed. (Day starts with 00:00 UTC, Week starts with Mon, Month starts with 1st) Read more
  • Added support for setting Alerts to get warned before limit reaches. Read more

More Updates

  • Behavior Change: Rate limit and Budget limit rules will now be evaluated for all matching rules and the first one in order will be applied. This enables user to set priority of rules by adjusting the order in config.
  • We now support AWS IAM role based auth when adding AWS bedrock integrations in TrueFoundry SaaS Gateway. Read more
  • Bug fix: fixed Patronus AI guardrail validation, fixed adding ‘Origin’ header to API request options in Promptfoo guardrail.
  • Added support for configuring multiple AI Gateways based on BU, region, etc. Read more
    Breaking Change: Removed support for Dynamic Rule ID in Budget config. Read more

Release instructions

  • Update truefoundry helm chart version to 0.101.2.
v0.100.2
2025-11-18

Override prompts when re-routing/falling back to different model

More Updates

  • Added support for MiniMax model in Vertex.
  • Bug fix: resolved issue of index number handling in case of parallel tool calls with Anthropic models

Release instructions

  • Update truefoundry helm chart version to 0.100.2.