SOTAVerified

Benchmarking

Papers

Showing 13811390 of 5548 papers

TitleStatusHype
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language ModelsCode1
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI AcceleratorsCode2
LLM4Mat-Bench: Benchmarking Large Language Models for Materials Property PredictionCode1
IdeaBench: Benchmarking Large Language Models for Research Idea GenerationCode0
Pedestrian Trajectory Prediction with Missing Data: Datasets, Imputation, and BenchmarkingCode1
Benchmark Data Repositories for Better Benchmarking0
XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAMCode3
EMGBench: Benchmarking Out-of-Distribution Generalization and Adaptation for ElectromyographyCode1
AndroidLab: Training and Systematic Benchmarking of Android Autonomous AgentsCode3
AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite ImageryCode1
Show:102550
← PrevPage 139 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified