Benchmarks, behavioral comparisons, and field findings from real-world browser automation at scale.
CLI vs MCP benchmark across 99 QA practice sites — timing, behavioral diffs, anomalies, and cost analysis.
Test suite findings for the JavaScript, Python, and Java API clients — pass rates, confirmed bugs, and regression coverage.