Benchmarks developed to evaluate an agent's ability to browse the web and synthesize information from multiple sources.
benchmarkagentic

Notes

Date approximate.