Foundations

Foundations

Where Intelligence Earns Its Keep in Browser Automation

A browser automation workflow runs for the hundredth time. One open-source framework barely touches a language model. The other sends the full page state to one at every step, same as it did on run number one. Same task, same browser. Wildly different bills at the end of the month.
Both frameworks recently rewrote their architectures from the ground up. Both migrated to the same low-level browser protocol. Then they diverged on a question that has no obvious right answer, one with consequences reaching well beyond browsers into any workflow where intelligence could, in theory, touch every step.

Where Intelligence Earns Its Keep in Browser Automation
Abrowser automation workflow runs for the hundredth time. One open-source framework barely touches a language model. The other sends the full page state to one at every step, same as it did on run number one. Same task, same browser. Wildly different bills at the end of the month.
Both frameworks recently rewrote their architectures from the ground up. Both migrated to the same low-level browser protocol. Then they diverged on a question that has no obvious right answer, one with consequences reaching well beyond browsers into any workflow where intelligence could, in theory, touch every step.
The Flight Simulator Problem

More than 35 companies are now building practice environments for AI agents. Anthropic has discussed spending over a billion dollars on them. The pitch is intuitive: you wouldn't let a pilot fly without simulator time. Deeptune just raised $43 million from a16z on exactly this logic, building pixel-perfect replicas of Slack, Salesforce, and ticketing tools where models can rehearse. But flight simulators work because physics holds still. The live web does not. And there's a subtler problem inside these training environments, one that lives deeper than fidelity.
The Flight Simulator Problem
More than 35 companies are now building practice environments for AI agents. Anthropic has discussed spending over a billion dollars on them. The pitch is intuitive: you wouldn't let a pilot fly without simulator time. Deeptune just raised $43 million from a16z on exactly this logic, building pixel-perfect replicas of Slack, Salesforce, and ticketing tools where models can rehearse. But flight simulators work because physics holds still. The live web does not. And there's a subtler problem inside these training environments, one that lives deeper than fidelity.

Worth Reading





