True foundry research companion

satya - 5/31/2026, 8:41:23 PM

what is it

  1. started in 2021
  2. As an MLops tool in cloud and onprem
  3. Pivoted to LLM Ops (LLM gateway, deploy, finetune, rag, prompt lib)
  4. Further pivot to MCP and Agentic gateways and governance in the cloud and on prem

satya - 5/31/2026, 8:43:05 PM

how does True Foundry compares to MLflow

how does True Foundry compares to MLflow

Search for: how does True Foundry compares to MLflow

satya - 5/31/2026, 8:53:25 PM

MLFlow and True Foundry

  1. MLflow
  2. ****************
  3. MLflow is opensource and widely used for running and evaluating models during training
  4. Also expanded into AI space such as prompt effectiveness, evals, and cost control of LLMs
  5. Free but managing and hosting is on enterprises
  6. ---------------
  7. True Foundry
  8. ***********************
  9. Managed
  10. Commercial only
  11. LLM and Agentic shift seem strong
  12. What an enterprise has been implementing internally over the last couple of years using LangChain ecosystem and libraries and utilities are now offered by True Foundry
  13. ----------------------
  14. Caution
  15. ************************
  16. QUestion to ask is the licensing cost
  17. Vendor lock in
  18. Probably good for the bottom 60% of the LLM capabilites
  19. However if one is to go to agentic core or similar cloud based platform these may be out of the box
  20. The "agent run time" is likely to be managed and controlled by frontier companies
  21. So it is not clear how it will integrate into that 40% space

satya - 5/31/2026, 8:57:47 PM

So questions to ask are from the above

  1. Cost factors
  2. vendor lock
  3. How does it compare to native aws and azure offerings
  4. How can it help in "operationalizing" ai in enterprises, even at mid and lower levels
  5. What would an enterprise want in its "operational ai platform"?

satya - 5/31/2026, 9:06:16 PM

Another detailed question

  1. If one is to use say Bedrock, what is the overlap of this functionality?
  2. what will one be missing from that that True Foundry still provides?
  3. Can you list the "absolute" requirements of an ai operational platform that one would want?

satya - 5/31/2026, 9:47:09 PM

Key features interested in licensing

  1. AI Gateway: Routes LLM calls, enforces rate limits and quotas, meters usage per team, applies semantic caching and guardrails
  2. MCP Gateway: Central registry for all MCP servers with per-server RBAC, OAuth 2.0, and environment grouping (dev/staging/prod)
  3. Agent Gateway: Governs multi-agent workflows, traces agent-to-tool calls, enforces agent-level access policies, supports human-in-the-loop approvals
  4. Prompt Management: Versioned prompt templates stored centrally and shared across teams
  5. Control Plane UI: Single dashboard for administering all three gateways

satya - 5/31/2026, 9:52:54 PM

Hyper scaler like aws vs True Foundry

  1. First of all very similar features in hyper scalers
  2. Both offer agent registries, mcp registries, rbac, administration etc.
  3. True Foundry may be more focused, at a higher cost

satya - 5/31/2026, 10:11:17 PM

Ok, true competitors

  1. Only hyper scalers honestly
  2. There are many LLM gateways but none for MCP and Agentic layers along with rbac and control planes
  3. Likely the later will be done by Frontier labs likely

satya - 5/31/2026, 10:13:30 PM

Summary

  1. if adopts aws agentcore for example, it has most of the functionality. so just use it
  2. Or use frontier agentic run times: Copilot, agentcore, Google, etc
  3. Space is still evolving

satya - 6/1/2026, 5:07:01 PM

Devops capabilities of True foundry summarizied

  1. Deploy AI assets, LLM Inference Servers, MCPs, and Agents in their runtime environments
  2. Dashboards to manage the assets and deploy
  3. Observability dashboards
  4. Logging dashboards
  5. Access control of asset dashboards
  6. CI/CD pipelines for DevOps automation via Git and other tools
  7. Environment promotion ? promote MCP servers or agents from dev ? staging ? prod with approval gates
  8. Version management and rollbacks ? track deployed versions, roll back to a prior version on failure
  9. Health checks and alerting ? liveness/readiness probes on deployed assets with failure alerts
  10. Cost attribution per asset ? track which agent, MCP server, or team is driving LLM spend
  11. Rate limiting and quota enforcement per asset ? cap consumption per agent or MCP server
  12. Secret and credential management ? centralized storage of API keys, OAuth tokens, and credentials without hardcoding in agent or MCP server code

satya - 6/1/2026, 5:13:56 PM

Security features

  1. Authentication & Identity: every agent, user, and service account has a verified identity before touching any resource; no anonymous access to MCP servers or LLM endpoints
  2. Authorization & RBAC: fine-grained control over who can invoke which agent, which MCP tool, and which LLM ? enforced at the gateway, not the application
  3. Secret & Credential Management: API keys, OAuth tokens, and service credentials never hardcoded; centrally stored, rotated, and audited
  4. Audit Logging & Non-repudiation: immutable, tamper-evident logs of every LLM call, tool invocation, and agent action ? who did what, when, and with what result
  5. Data Residency & Sovereignty: control over which regions or infrastructure handle AI traffic and data, especially for HIPAA, ITAR, and EU AI Act compliance
  6. Input/Output Guardrails: PII detection, prompt injection detection, and content filtering applied at the gateway before reaching the model or tool
  7. Prompt Injection Defense: agents that call external tools or ingest external content are vulnerable to malicious instructions embedded in tool outputs; the gateway needs to detect and block these
  8. Blast Radius Containment: rate limits, quotas, and environment isolation (dev/staging/prod) so a compromised or runaway agent cannot escalate or exhaust resources across the enterprise
  9. MCP Server Trust Verification: ensuring agents only connect to registered, verified MCP servers and not rogue or shadow tool endpoints
  10. Zero Trust Networking: gateway-to-tool and agent-to-agent traffic is authenticated and encrypted in transit; no implicit trust based on network location

satya - 6/1/2026, 5:33:49 PM

Some quoted competitors

  1. AWS Bedrock + AgentCore
  2. Azure AI Foundry + Azure API Management
  3. Kong AI Gateway
  4. Cloudflare AI Gateway
  5. TrueFoundry
  6. Portkey
  7. LiteLLM

satya - 6/1/2026, 6:45:05 PM

Few architecture pictures

Few architecture pictures

satya - 6/1/2026, 6:45:45 PM

LLM gateway arch

satya - 6/1/2026, 6:46:56 PM

MCP Gateway arch

satya - 6/1/2026, 6:48:43 PM

Skills registry

satya - 6/1/2026, 6:49:52 PM

True Foundry architectural drawings

Show images for: True Foundry architectural drawings

satya - 6/1/2026, 6:51:59 PM

Control plane architecture

satya - 6/1/2026, 6:52:29 PM

Architecture page

Architecture page

satya - 6/1/2026, 6:53:27 PM

Control plane

satya - 6/1/2026, 6:54:29 PM

gateway plane

satya - 6/1/2026, 6:55:51 PM

Compute plane

satya - 6/1/2026, 6:56:23 PM

Security management overview

Security management overview

satya - 6/1/2026, 6:57:10 PM

Security model

satya - 6/1/2026, 6:58:27 PM

working with IDPs