Skip to main content
⌘K
News
Our Tools
中文工具
AI Directory
News
Our Tools
中文工具
AI Directory
Home
/
Tags
/
Benchmark
Tag
Benchmark
1
article
Advertisement
Articles
1
AI & ML
4 min
Claude Sonnet Leads Real-World Web Agent Benchmark — But Only Completes 1 in 3 Tasks
A new benchmark called ClawBench tests AI agents on 153 real tasks across 144 live production websites — booki...
RE
RECATOOLS Editorial
1 May
Advertisement