SOTAVerified

Benchmarking

Papers

Showing 751760 of 5548 papers

TitleStatusHype
Geometric Deep Learning for Structure-Based Drug Design: A SurveyCode1
A Comprehensive Study of the Robustness for LiDAR-based 3D Object Detectors against Adversarial AttacksCode1
A Multifaceted Benchmarking of Synthetic Electronic Health Record Generation ModelsCode1
BeHonest: Benchmarking Honesty in Large Language ModelsCode1
Benchmarking and Analyzing 3D Human Pose and Shape Estimation Beyond AlgorithmsCode1
AdaPool: Exponential Adaptive Pooling for Information-Retaining DownsamplingCode1
EgoToM: Benchmarking Theory of Mind Reasoning from Egocentric VideosCode1
CIPCaD-Bench: Continuous Industrial Process datasets for benchmarking Causal Discovery methodsCode1
Bench4KE: Benchmarking Automated Competency Question GenerationCode1
ClimART: A Benchmark Dataset for Emulating Atmospheric Radiative Transfer in Weather and Climate ModelsCode1
Show:102550
← PrevPage 76 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified