SOTAVerified

Benchmarking

Papers

Showing 14611470 of 5548 papers

TitleStatusHype
Just Rank: Rethinking Evaluation with Word and Sentence SimilaritiesCode1
Robotic Manipulation Datasets for Offline Compositional Reinforcement LearningCode1
Benchmarking Graph Neural Networks on Dynamic Link PredictionCode1
Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource LanguagesCode1
Explainable Global Wildfire Prediction Models using Graph Neural NetworksCode1
Exploring Graph Tasks with Pure LLMs: A Comprehensive Benchmark and InvestigationCode1
Benchmarking Graph Neural Networks for FMRI analysisCode1
RobustPointSet: A Dataset for Benchmarking Robustness of Point Cloud ClassifiersCode1
Deluca -- A Differentiable Control Library: Environments, Methods, and BenchmarkingCode1
BiCo-Net: Regress Globally, Match Locally for Robust 6D Pose EstimationCode1
Show:102550
← PrevPage 147 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified