Agent Observability Has a 37-Point Blind Spot Between "Did It Run?" and "Did It Work?"