SOTAVerified

Autonomous Web Navigation

Evaluating agents on the task of navigating on the Web to solve a user given task/instruction

Papers

Showing 19 of 9 papers

TitleStatusHype
Context manipulation attacks : Web agents are susceptible to corrupted memory0
REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real WebsitesCode3
Magma: A Foundation Model for Multimodal AI AgentsCode5
WEPO: Web Element Preference Optimization for LLM-based Web Navigation0
ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web AgentsCode1
Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic SystemsCode5
"What's important here?": Opportunities and Challenges of Using LLMs in Retrieving Information from Web Interfaces0
Multimodal Web Navigation with Instruction-Finetuned Foundation Models0
Understanding HTML with Large Language Models0
Show:102550

No leaderboard results yet.