SOTAVerified

Autonomous Web Navigation

Evaluating agents on the task of navigating on the Web to solve a user given task/instruction

Papers

Showing 19 of 9 papers

TitleStatusHype
Magma: A Foundation Model for Multimodal AI AgentsCode5
Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic SystemsCode5
REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real WebsitesCode3
ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web AgentsCode1
Multimodal Web Navigation with Instruction-Finetuned Foundation Models0
Context manipulation attacks : Web agents are susceptible to corrupted memory0
Understanding HTML with Large Language Models0
WEPO: Web Element Preference Optimization for LLM-based Web Navigation0
"What's important here?": Opportunities and Challenges of Using LLMs in Retrieving Information from Web Interfaces0
Show:102550

No leaderboard results yet.