SOTAVerified

Conversational Web Navigation

The problem of conversational web navigation is described as follow: a digital agent controls a web browser and follows user instructions to solve real-world tasks in a multi-turn dialogue fashion. It was introduced alongside the WebLINX benchmark (Lù, Kasner, Reddy, 2024), and complements tasks such as Autonomous Web Navigation. It is one of many problems tackled by generalist (web) agents.

Papers

Showing 13 of 3 papers

TitleStatusHype
From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents0
On the Multi-turn Instruction Following for Conversational Web AgentsCode1
WebLINX: Real-World Website Navigation with Multi-Turn DialogueCode5
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Llama-2-13BOverall score25.21Unverified
2S-LLaMA-2.7BOverall score25.02Unverified
3Llama-2-7BOverall score24.57Unverified
4Flan-T5-3BOverall score23.77Unverified
5S-LLaMA-1.3BOverall score23.73Unverified
6GPT-3.5FOverall score21.22Unverified
7MindAct-3BOverall score20.94Unverified
8Fuyu-8BOverall score19.97Unverified
9Flan-T5-780MOverall score17.27Unverified
10Pix2Act-1.3BOverall score16.88Unverified