Archive - The AI Runtime

Architecting the Auto-Updating Enterprise AI Data Loop

Tackling the AI Information Silos problem

23 hrs ago • The AI Runtime

6:07

AI Made Individuals Faster. Why Are Teams Still Slow?

The Next Bottleneck in AI Systems Is Context Transfer

Jun 20 • The AI Runtime

16:40

State Machines for AI Agents: A field guide from Forward-Deployed Engineering

Agents reason. State machines remember.

Jun 18 • The AI Runtime

The "Self-Improving" AI Myth (And What 60 Production Deployments Actually Do)

Sixty production deployments converged on a three-layer architecture where the eval surface, not the base model, is the moat.

Jun 15 • The AI Runtime

Dario Amodei’s “Policy on the AI Exponential” Describes a World Banking AI Already Lives In

His new essay asks regulators to build an FAA for AI models. If you ship AI inside a bank, you already work in that regime, and there is a five-minute…

Jun 11 • The AI Runtime

Harness Half-Life: A Field Playbook for Catching Agent Decay

The harness engineering discourse names what to build. The Model Reliability Engineering arc names how long the build lasts, what kills it, and what to…

Jun 8 • The AI Runtime

The AI Eval Gate Cheat Sheet

Most AI projects die in the gap between "works in the demo" and "works in production."

Jun 8 • The AI Runtime

Two Ways to Shrink an AI Model. Only One Keeps the Output.

Quantization changes the numbers. Lossless compression removes the wasted bits and keeps every output identical, for about 30% less memory.

Jun 7 • The AI Runtime

18:36

Two Ways to Shrink an AI Model. Only One Keeps the Output.

Quantization changes the numbers. Lossless compression removes the wasted bits and keeps every output identical, for about 30% less memory.

Jun 5 • The AI Runtime

The Anatomy of an AI Legal Agent

The leading AI legal research tools still hallucinate on up to a third of queries, so the production answer in law is not a better model but a harness…

Jun 3 • The AI Runtime

The Model Is the Smallest Part: A Free Field Guide to Production AI

Sixteen published deep-dives, four modules, one operating thesis. The harness around the model is the product. Free.

Jun 3 • The AI Runtime

Why Every Browser Harness Wrapper Is on Borrowed Time

Six hundred lines of code, no abstractions, and the argument that every wrapper around the LLM is on borrowed time.

Jun 1 • The AI Runtime

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts