Blog

Essays and notes on AI governance, repo risk review, and bounded remediation planning.

Enforcement & Governance

2026-04-252 min read

AI Governance Audit Before Enterprise Security Review

Use this page when enterprise security or procurement review pressure is active and you need a clear Baseline Sprint fit review before work begins.

Read article →

2026-03-314 min read

Measured Autonomous Maintenance: Proof That the System Can Run Without Constant Operator Intervention

A current, customer-facing look at how we verify autonomous maintenance in production: live metrics, low intervention rates, and a proof loop that stays honest about what is measured.

Read article →

2026-03-166 min read

AI Governance Leaderboard: We Scanned 21 Top Repos Before RSA 2026

We ran our governance scanner against 21 of the most popular AI agent frameworks, ML libraries, and AI SDKs. The average score was 53/100. Only 2 repos are on track for EU AI Act readiness. Here are the full results.

Read article →

2026-03-164 min read

AI Coding Agents Need Enforcement Ladders, Not More Prompts

75% of AI coding models introduce regressions on sustained maintenance. The fix is not better prompts -- it is structural enforcement at five levels, from conversation to pre-commit hooks.

Read article →

2026-03-158 min read

How to Prove AI Compliance to Your Auditor (Before They Ask)

Your auditor will ask how you govern AI systems. A monitoring dashboard is not the answer. Here is the compliance evidence framework that maps to SOC 2, EU AI Act, and Colorado AI Act requirements.

Read article →

2026-03-1511 min read

Mapping the Enforcement Ladder to NIST AI RMF: A Compliance Crosswalk

NIST AI Risk Management Framework defines four functions: Govern, Map, Measure, Manage. Here is how structural enforcement maps to each function -- with a concrete crosswalk table for compliance teams.

Read article →

2026-03-157 min read

We Built the Agent Command Center Karpathy Asked For

Andrej Karpathy asked for an agent command center. We had already built it -- plus the governance layer he didn't ask for. Here's the direct mapping from his tweet to our production system managing 6 AI agents.

Read article →

2026-03-156 min read

LLM Reasoning Is Fake — That's Why You Need Enforcement

Pedro Domingos says LLM reasoning is fake. He's right. And that's the strongest argument for structural enforcement — not better prompts, not bigger models, but verification layers that catch what reasoning misses.

Read article →

2026-03-155 min read

The 477:1 Problem

4,768 violations detected. 18 promoted to structural enforcement. That 477:1 ratio is the real bottleneck in AI self-improvement -- and most teams don't even measure it.

Read article →

2026-03-156 min read

Why Detection-Based AI Governance Fails (And What to Do Instead)

Six funded companies detect AI agent violations at runtime. None prevent them structurally. Here's why the detection paradigm has a ceiling — and what prevent-by-construction looks like in production.

Read article →

2026-03-147 min read

The Enforcement Ladder: What 4 Labs Converged On (And the Layer They Skipped)

Four AI labs independently built the same agent architecture. None of them built the governance layer. The enforcement ladder is the missing piece that turns 75% regression rates into less than 5%.

Read article →

2026-03-106 min read

AI Governance That Learns: Why Checklists Won't Save You From the EU AI Act

The EU AI Act takes effect August 2, 2026. Static checklists and dashboards cannot meet the 'continuous iterative' standard. Learn what structural enforcement means and why it matters.

Read article →

Context Engineering

2026-03-155 min read

Your AI Agent Forgets Its Rules Every 45 Minutes — Here's the Fix

Every long-running AI agent hits context compression. Your system prompts, project rules, and behavioral constraints get silently dropped. Here's a production-proven hook that flushes critical knowledge to persistent storage before compression hits.

Read article →

2026-03-155 min read

Context Consistency Destroys Multi-Agent Teams

When 6 agents share context without consistency guarantees, they diverge silently. Here's what we learned from running a production multi-agent system with cross-agent signal routing.

Read article →

2026-03-156 min read

How the Enforcement Ladder Maps to Anthropic's Context Engineering Framework

Anthropic published their context engineering guide. Their 'Right Altitude' framework maps directly to the enforcement ladder we've been running in production for 6 months. Here's the technical mapping — and the layer they left out.

Read article →

2026-03-154 min read

Your Context Is Poisoned

4,768 violations across 6 autonomous agents exposed 4 context failure modes. Here's what poisoned context looks like in production and how structural enforcement prevents it.

Read article →

Competitive Analysis

2026-03-176 min read

Token Security Is an Innovation Sandbox Finalist. Here Is What That Means for AI Agent Governance.

Token Security, an NHI identity security startup backed by $28M from Notable Capital, was selected as an RSAC 2026 Innovation Sandbox finalist. Their identity-first approach to AI agent security addresses who agents are -- but not what they do. Here is the identity-behavioral gap enterprises need to close.

Read article →

2026-03-166 min read

What Okta's Entry Into Agent Governance Means for Enterprises

Okta announced 'Okta for AI Agents' at Showcase 2026, extending enterprise IAM to non-human identities. Here is what it covers, what it does not, and what the identity-behavioral governance gap means for teams building AI agent systems.

Read article →

2026-03-164 min read

Structural Enforcement vs Arthur AI: Middleware Guardrails Compared

Arthur AI ships middleware guardrails and model monitoring. Structural enforcement prevents violations permanently. Two AI governance philosophies compared.

Read article →

2026-03-165 min read

Structural Enforcement vs Invariant (Snyk): Trace Analysis Compared

Invariant Labs (acquired by Snyk) analyzes agent traces to detect security issues. Structural enforcement prevents them permanently. Two approaches compared.

Read article →

2026-03-164 min read

Structural Enforcement vs Lasso Security: Behavioral Detection Compared

Lasso Security detects behavioral drift at sub-50ms. Structural enforcement eliminates the drift permanently. Two approaches to AI agent governance compared.

Read article →

2026-03-164 min read

Structural Enforcement vs Singulr AI: Runtime Governance Compared

Singulr AI detects agent violations at runtime. Structural enforcement prevents them permanently. Two governance architectures compared.

Read article →

2026-03-1510 min read

Why Your AI Governance Tool Costs $100K/Year (And Still Doesn't Work)

Enterprise AI governance platforms charge $50-200K annually for monitoring dashboards. Here is what you are actually paying for, what you are not getting, and what a structural alternative costs.

Read article →

2026-03-124 min read

Three Principles That Separate AI Agents That Ship From AI Agents That Don't

Token fungibility, the inverted 80/20, and clarity precedes execution. Three frameworks from Nate Jones' convergence thesis that explain why 94% of AI agent projects never reach production.

Read article →

2026-03-117 min read

Autoresearch is Mainstream. Now Make It Production-Grade.

Karpathy proved autoresearch works with crude hill climbing and 700 iterations. Production-grade requires three missing pieces: enforcement, convergence verification, and skill accumulation.

Read article →

Tutorials & How-To

2026-03-173 min read

Add a Governance Score Badge to Your GitHub README in 30 Seconds

Show your project's AI governance posture with a shields.io-style badge. Copy one line of markdown, paste it in your README, done. Free, always up to date, links to a full scan.

Read article →

2026-03-154 min read

Pre-Compaction Memory Flush

Your AI agent forgets its most important rules every 45 minutes. One L5 hook -- 12 lines of Python -- prevents it permanently. Here's the pattern and why the community is adopting it.

Read article →

2026-03-115 min read

Your Next AI Agent Should Cost $0 to Train

Fine-tuned domain agents on consumer hardware. Unsloth + Qwen3.5-4B dropped fine-tuning to 5GB VRAM. The economics of custom AI agents just changed.

Read article →

Case Studies

2026-03-11Case Study

Hugging Face Transformers Governance Audit: 45/100 Enforcement Score

Early governance signals (CLAUDE.md, AGENTS.md) show awareness, but 68 potential secrets, 1,303 TODOs, and zero enforcement hooks reveal that awareness has not yet translated into structural enforcement.

View saved audit →

2026-03-11Case Study

Django Governance Audit: 29/100 Enforcement Score

The most deployed Python web framework has 1,995 test files but zero enforcement hooks and no AI agent instructions, leaving governance to manual review alone.

View saved audit →

2026-03-11Case Study

Pydantic Governance Audit: 29/100 Enforcement Score

The data validation library underpinning FastAPI and LangChain has solid test coverage but zero enforcement hooks and no AI agent instructions.

View saved audit →

2026-03-11Case Study

scikit-learn Governance Audit: 18/100 Enforcement Score

The foundational ML library has zero hardcoded secrets (best in our portfolio) but zero enforcement hooks and embedded test structure that hides coverage from governance tools.

View saved audit →

2026-03-11Case Study

CrewAI Governance Audit: 13/100 Enforcement Score

The leading multi-agent framework scores lowest in our portfolio -- zero test files at root, 56 potential secrets, and no AI agent instructions in the very infrastructure designed to orchestrate AI agents.

View saved audit →

2026-03-09Case Study

LangChain Governance Audit: 40/100 Enforcement Score

Early governance signals (CLAUDE.md, AGENTS.md) exist but zero enforcement hooks, 25 potential hardcoded secrets, and monorepo complexity create significant gaps.

View saved audit →

2026-03-09Case Study

FastAPI Governance Audit: 29/100 Enforcement Score

Strong test coverage (583 test files) is undermined by zero automated enforcement hooks and no AI agent instructions, leaving the project vulnerable to governance drift.

View saved audit →

Blog posts explain the model. They are not the proof page.

See Walseth AI's current measured operating proof

Use this when you need current repo findings

First paid move when the signal is real

Enforcement & Governance

Context Engineering

Competitive Analysis

Tutorials & How-To

Case Studies