SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4041–4050 of 5548 papers

Title	Date	Tasks	Status	Hype
Dual Task Framework for Improving Persona-grounded Dialogue Dataset	Feb 11, 2022	Benchmarking	—Unverified	0
High Fidelity RF Clutter Modeling and Simulation	Feb 10, 2022	BenchmarkingVocal Bursts Intensity Prediction	—Unverified	0
Lightweight Jet Reconstruction and Identification as an Object Detection Task	Feb 9, 2022	Benchmarkingobject-detection	—Unverified	0
BIQ2021: A Large-Scale Blind Image Quality Assessment Database	Feb 8, 2022	BenchmarkingBlind Image Quality Assessment	—Unverified	0
ECRECer: Enzyme Commission Number Recommendation and Benchmarking based on Multiagent Dual-core Learning	Feb 8, 2022	BenchmarkingLanguage Modelling	CodeCode Available	1
Comparative Study Between Distance Measures On Supervised Optimum-Path Forest Classification	Feb 8, 2022	Anomaly DetectionBenchmarking	CodeCode Available	0
What are the best systems? New perspectives on NLP Benchmarking	Feb 8, 2022	Benchmarking	CodeCode Available	1
RECOVER: sequential model optimization platform for combination drug repurposing identifies novel synergistic compounds in vitro	Feb 7, 2022	BenchmarkingModel Optimization	CodeCode Available	1
Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration	Feb 7, 2022	BenchmarkingEvolutionary Algorithms	CodeCode Available	0
Benchmarking and Analyzing Point Cloud Classification under Corruptions	Feb 7, 2022	BenchmarkingClassification	CodeCode Available	1

Show:10 25 50

← PrevPage 405 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified