Evaluation and Governance
How to know enterprise AI works, and how to ship it safely. Operational practice, not slogans.
How to know enterprise AI works, and how to ship it safely. Operational practice, not slogans.
Why enterprise AI programs succeed or stall, and how to tell which is happening to yours.
The patterns that distinguish production AI from demos.
Two disciplines determine whether enterprise AI earns operational trust: evaluation, the practice of measuring whether a system actually works in production; and governance, the delivery of policy as code, controls, and accountable workflows. Both remain underspecified. Evaluation in many organizat
Two enterprises with comparable AI ambition, similar vendor stacks, and similar talent pools routinely produce materially different results. The divergence almost never traces to the technology choice. It traces to the operating model, the design authority, build capacity, governance regime, run mo
By 2026, enterprise AI systems are no longer differentiated primarily by which large language model they use. The frontier models, Anthropic's Claude Opus 4.7, OpenAI's GPT-5.2, Google's Gemini 3 Pro, are converging on capability for the median enterprise workload. What separates production-grade s
The most consequential layer of the AI buildout is not the foundation models themselves but what sits between them and the organizations that deploy them: architecture, integration, evaluation, and governance. The public record has clarified the picture rather than settled it. The applied layer is
Morgan Stanley shipped two assistants in eighteen months. The visible artefact in both cases was the model. The invisible artefact, the part that decided whether the rollouts compounded, was the evaluation harness underneath.
In February 2024 Klarna announced an AI assistant doing the work of 700 agents. Eighteen months later it was rehiring humans. The numbers in between are a teaching case for the applied layer.
The editorial spine of the publication. What production AI actually looks like, and how the discipline matures over time.
This is Applied Layer AI, a brand new site by Muneeb that's just getting started. Things will be up and running here shortly, but you can subscribe in the meantime if you'd like to stay up to date and receive emails when new content is published!