Cleanlab's 2025 production survey puts the number at 5%. That's the share of AI agents in production with what anyone would call mature monitoring. Meanwhile, CB Insights ranks agent observability as the most commercially dynamic category in generative AI, with 75-plus startups competing for the space. So capital is flooding in. The question worth sitting with: flooding in toward what, exactly?
Gartner reports 67% of enterprises see measurable model degradation within twelve months. KPMG finds 75% of leaders prioritize security, compliance, and auditability. Both stats describe organizations watching whether the system stays inside the lines. Neither describes anyone watching whether the system is pointed at the right target. Process health and behavioral correctness are different problems. Almost everything being built right now instruments the first one.
That 5% figure doesn't just say monitoring is immature. It says the industry hasn't yet agreed on what mature monitoring would even measure.
