SOTAVerified

Benchmarking

Papers

Showing 201210 of 5548 papers

TitleStatusHype
Authorship Obfuscation in Multilingual Machine-Generated Text DetectionCode2
GenoTEX: An LLM Agent Benchmark for Automated Gene Expression Data AnalysisCode2
Evaluating Large-Vocabulary Object Detectors: The Devil is in the DetailsCode2
GlobalGeoTree: A Multi-Granular Vision-Language Dataset for Global Tree Species ClassificationCode2
A Dynamic Points Removal Benchmark in Point Cloud MapsCode2
Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and BenchmarkCode2
Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine PerceptionCode2
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision TasksCode2
Event-Based Motion MagnificationCode2
Exponentially Faster Language ModellingCode2
Show:102550
← PrevPage 21 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified