SOTAVerified

Conversational Web Navigation

The problem of conversational web navigation is described as follow: a digital agent controls a web browser and follows user instructions to solve real-world tasks in a multi-turn dialogue fashion. It was introduced alongside the WebLINX benchmark (Lù, Kasner, Reddy, 2024), and complements tasks such as Autonomous Web Navigation. It is one of many problems tackled by generalist (web) agents.

Papers

No papers found.

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Llama-2-13BOverall score25.21Unverified
2S-LLaMA-2.7BOverall score25.02Unverified
3Llama-2-7BOverall score24.57Unverified
4Flan-T5-3BOverall score23.77Unverified
5S-LLaMA-1.3BOverall score23.73Unverified
6GPT-3.5FOverall score21.22Unverified
7MindAct-3BOverall score20.94Unverified
8Fuyu-8BOverall score19.97Unverified
9Flan-T5-780MOverall score17.27Unverified
10Pix2Act-1.3BOverall score16.88Unverified