SOTAVerified

Benchmarking

Papers

Showing 27712780 of 5548 papers

TitleStatusHype
Arena 4.0: A Comprehensive ROS2 Development and Benchmarking Platform for Human-centric Navigation Using Generative-Model-based Environment Generation0
MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines0
Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific LeaderboardsCode0
ASR Benchmarking: Need for a More Representative Conversational DatasetCode0
Efficacy of Synthetic Data as a Benchmark0
Hard-Label Cryptanalytic Extraction of Neural Network ModelsCode0
PARAPHRASUS : A Comprehensive Benchmark for Evaluating Paraphrase Detection ModelsCode0
Improve Machine Learning carbon footprint using Parquet dataset format and Mixed Precision training for regression models -- Part IICode0
WER We Stand: Benchmarking Urdu ASR Models0
The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event DetectionCode0
Show:102550
← PrevPage 278 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified