CURRENT | Echoes

The Testing Substrate

The Thread That Runs Through the Browser

By Rina Takahashi— June 19, 2026

Feature image for article: The Thread That Runs Through the Browser

In 2004, a developer at ThoughtWorks wrote a JavaScript tool to stop his colleagues from manually clicking through a billing app. He named it Selenium, as a joke. Twenty-two years later, AI agents open browsers using protocols descended from that tool, reading page structures originally built for screen readers, deciding what to do on sites they've never encountered. Every layer of browser automation was a reasonable response to the problem directly in front of it. The problem in front of it kept quietly changing.

The Testing Substrate

The Thread That Runs Through the Browser

By Rina Takahashi— June 19, 2026

In 2004, a developer at ThoughtWorks wrote a JavaScript tool to stop his colleagues from manually clicking through a billing app. He named it Selenium, as a joke. Twenty-two years later, AI agents open browsers using protocols descended from that tool, reading page structures originally built for screen readers, deciding what to do on sites they've never encountered. Every layer of browser automation was a reasonable response to the problem directly in front of it. The problem in front of it kept quietly changing.

Before the Click

Four Checks Before a Click

Before an agent can decide what to click, something has to confirm the click will actually land. Playwright runs four actionability checks on every single click action. Visible. Stable. Receives events. Enabled. All four must pass simultaneously, or the action doesn't fire.

The third check is worth pausing on. An element can be fully rendered, motionless, and enabled, yet a transparent overlay sitting above it will silently capture the click. Playwright calculates the exact pointer coordinates and asks: at this pixel, which element would actually receive the event? The documentation concedes that intentional overlay behavior is "indistinguishable from a bug." The infrastructure can't always tell the difference.

These checks formalized what earlier automation left to guesswork and sleep timers. They're also what agents inherit. Playwright MCP exposes this same machinery to LLMs. When an agent clicks a button, these four checks still run underneath. Agent reasoning sits on top of a layer that's still working out whether the action is physically possible.

Before the Click

Four Checks Before a Click

Before an agent can decide what to click, something has to confirm the click will actually land. Playwright runs four actionability checks on every single click action. Visible. Stable. Receives events. Enabled. All four must pass simultaneously, or the action doesn't fire.

The third check is worth pausing on. An element can be fully rendered, motionless, and enabled, yet a transparent overlay sitting above it will silently capture the click. Playwright calculates the exact pointer coordinates and asks: at this pixel, which element would actually receive the event? The documentation concedes that intentional overlay behavior is "indistinguishable from a bug." The infrastructure can't always tell the difference.

These checks formalized what earlier automation left to guesswork and sleep timers. They're also what agents inherit. Playwright MCP exposes this same machinery to LLMs. When an agent clicks a button, these four checks still run underneath. Agent reasoning sits on top of a layer that's still working out whether the action is physically possible.

TAKE NOTE

Visible doesn't mean perceptible. An element at opacity:0 passes. One set to display:none does not. The check is about rendering geometry, not what a human would notice.

Stable means not animating. A button mid-CSS transition fails, even if it's fully visible at its current position. The click might land where the element was, not where it ends up.

Receives events performs a hit-test at the exact click coordinates. If a cookie banner, modal backdrop, or loading overlay would intercept the pointer at that pixel, the check fails.

Enabled catches disabled form elements that still look clickable. A grayed-out submit button renders fine; Playwright won't touch it.

If any check fails, Playwright retries the full sequence until timeout. It doesn't skip ahead.

force: true bypasses all of this. It exists because sometimes the infrastructure's judgment is wrong and the developer knows better.

The Familiar Breakage

A QA Engineer Watches History Repeat Itself

The Familiar Breakage

A QA Engineer Watches History Repeat Itself

The Standard and the Cycle

The W3C Moment

In June 2018, the W3C published a spec describing software that "emulates the actions of a real person using the browser." They meant testing tools. That sentence now reads like a product description for AI agents. WebDriver's authors knew they were building something more general than a test harness. They just scoped it as one. The word "primarily" left room for everything that followed.

The Standard and the Cycle

The Screen Scraping Cycle

In 1986, IBM shipped a tool that let programs read characters off a terminal screen because mainframe applications had no APIs. The assumption was that this was temporary. Forty years and several technology generations later, browser agents automate the web using the same basic logic: parsing whatever the human sees. Each generation believed it was building a workaround. Each time, the workaround became infrastructure.

The Standard and the Cycle

The W3C Moment

In June 2018, the W3C published a spec describing software that "emulates the actions of a real person using the browser." They meant testing tools. That sentence now reads like a product description for AI agents. WebDriver's authors knew they were building something more general than a test harness. They just scoped it as one. The word "primarily" left room for everything that followed.

The Standard and the Cycle

The Screen Scraping Cycle

In 1986, IBM shipped a tool that let programs read characters off a terminal screen because mainframe applications had no APIs. The assumption was that this was temporary. Forty years and several technology generations later, browser agents automate the web using the same basic logic: parsing whatever the human sees. Each generation believed it was building a workaround. Each time, the workaround became infrastructure.

The Standard and the Cycle

The W3C Moment

In June 2018, the W3C published a spec describing software that "emulates the actions of a real person using the browser." They meant testing tools. That sentence now reads like a product description for AI agents. WebDriver's authors knew they were building something more general than a test harness. They just scoped it as one. The word "primarily" left room for everything that followed.

The Standard and the Cycle

The Screen Scraping Cycle

In 1986, IBM shipped a tool that let programs read characters off a terminal screen because mainframe applications had no APIs. The assumption was that this was temporary. Forty years and several technology generations later, browser agents automate the web using the same basic logic: parsing whatever the human sees. Each generation believed it was building a workaround. Each time, the workaround became infrastructure.