SOTAVerified

Benchmarking

Papers

Showing 871880 of 5548 papers

TitleStatusHype
AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMMCode1
Benchmarking Bias Mitigation Algorithms in Representation Learning through Fairness MetricsCode1
JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 MinutesCode1
Job-SDF: A Multi-Granularity Dataset for Job Skill Demand Forecasting and BenchmarkingCode1
Benchmarking Low-Shot Robustness to Natural Distribution ShiftsCode1
Jojajovai: A Parallel Guarani-Spanish Corpus for MT BenchmarkingCode1
ClearPose: Large-scale Transparent Object Dataset and BenchmarkCode1
EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level PlanningCode1
Benchmarking and scaling of deep learning models for land cover image classificationCode1
Benchmarking Local Robustness of High-Accuracy Binary Neural Networks for Enhanced Traffic Sign RecognitionCode1
Show:102550
← PrevPage 88 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified