Open Source QA Research

Two interfaces.
One question.

99 QA practice sites across 5 categories — tested with Vibium CLI and Vibium MCP. Which is faster, cheaper, and more capable?

vibium v26.3.18 Chrome 148 claude-sonnet-4-6 Intel i9-10910 · 64 GB macOS 26.3.1 Apr–May 2026
99
Sites Tested
5.0×
CLI Faster (avg)
5–9×
CLI Cheaper
~10s
Typical MCP Time
— Preview
At a Glance
Tap left/right to navigate · auto-advances every few seconds
— Results
Category Breakdown
Total wall-clock time across all sites in each category (lower is better)
CLI
MCP
Category Sites CLI Total MCP Total Speed Ratio Cost
General Practice 28
108s
864s
8.0× ~5× cheaper
Automation Testing 41
256s
1,066s
4.2× ~7× cheaper
Security Testing 7 / 11
21s
109s
5.2× ~2.4× cheaper
API Testing 16
51s
167s
3.3× ~6× cheaper
Performance Testing 3
14s
58s
4.1× ~6× cheaper
All Sites 95 / 99 450s total 2,263s total 5.0× 5–9× cheaper

Bar widths relative to longest MCP total (General Practice 864s). 4 Security Testing sites require local Docker or account — excluded from timing.

— Community
Built by testers, for testers
Practice sites created by the testing community — featured in this benchmark
Jason Arbon
59-category testing checklist index covering WCAG A/AA/AAA, ARIA, security, privacy, code quality, i18n, GenAI, DevOps, and more. Each category links to a full interactive checklist.

CLI 1,704ms · MCP 7,939ms · 4.7×
Jason Huggins
Test Track — 15 structured modules Basic → Expert: buttons, inputs, modals, alerts, drag & drop, canvas, 3D chess. CLI 2,251ms · MCP 4,732ms · 2.1×

var.parts — Vibium-branded robot parts shop with 12 products and a working cart. CLI 2,199ms · MCP 8,271ms · 3.8×
Paul Grossman
UK testing sandbox with county picker, contact form, social links, reCAPTCHA, and more. 54 interactive MCP elements. One of the narrowest General Practice ratios — heavy page content compresses the gap.

CLI 5,720ms · MCP 10,979ms · 1.9×
— Insights
Key Findings
Patterns and surprises from 95 timed site runs
Speed

CLI tracks complexity — MCP doesn't

MCP time is uniformly ~7–50s per site regardless of complexity. CLI time scales with site: from 383ms (ATM Practice) to 87s (Expand Testing timeout). Simplest sites have the widest ratios.

Widest Ratio

Automation Bookstore: 36×

CLI completes in 491ms — the fastest single-site time in the entire dataset. A simple filter-only SPA with no waits exposes MCP's full per-tool-call overhead.

Narrowest Ratio

Evil Tester: 2.6×

Alert pre-stubbing via eval adds equivalent overhead to both interfaces, nearly equalizing them. Sleep floors and complex setup collapse the gap across the dataset.

Behavioral Difference

OWASP Juice Shop: MCP loads, CLI fails

CLI receives an Angular SSR Application Error. MCP fully loads the app with 15 products. The most significant cross-interface divergence — same URL, completely different outcomes.

Behavioral Difference

SPAs: MCP maps where CLI sees nothing

On React/Vue SPAs like TestDino (216 refs), Lambdatest Playground (309 refs), and GreenKart (126 refs), browser_map enumerates all elements. CLI vibium map returns nothing — requiring eval-only workflows.

Behavioral Difference

HTTP sites: MCP follows redirect, CLI fails

On http:// origins (Global SQA Demo, Travel Agileway), MCP silently navigates while CLI throws a BiDi "unknown error". MCP is more permissive with insecure origins.

Anomaly

SpaceTraders: MCP was faster (0.7×)

The only site where MCP beat CLI. CLI cold-started at 14.9s on this CDN-heavy docs site; MCP hit a warmer cache at 10.2s. Excluding it, the API batch ratio is 4.8×.

Cost

CLI is 5–9× cheaper per site

Each MCP browser_navigate + browser_map pair generates its own LLM turn. CLI dispatches via bash and the model barely participates in execution. For pure automation, CLI wins on cost by a wide margin.

— Environment
Test Machine
All timings recorded on this machine — reference for cross-machine comparison
CPUIntel Core i9-10910 @ 3.60GHz · 10 cores / 20 threads
RAM64 GB
OSmacOS 26.3.1 (Build 25D771280a)
Nodev25.8.0
vibiumv26.3.18
Chrome148.0.7778.168
Modelclaude-sonnet-4-6
NetworkWi-Fi · Router 10.0.0.1
Dates2026-04-22 to 2026-05-20