SOTAVerified

Benchmarking

Papers

Showing 381390 of 5548 papers

TitleStatusHype
PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease SegmentationCode2
PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket ConditioningCode2
ClimateLearn: Benchmarking Machine Learning for Weather and Climate ModelingCode2
Are large language models superhuman chemists?Code2
Class-incremental Learning for Time Series: Benchmark and EvaluationCode2
COALA: A Practical and Vision-Centric Federated Learning PlatformCode2
Challenges and Opportunities in Offline Reinforcement Learning from Visual ObservationsCode2
CausalGym: Benchmarking causal interpretability methods on linguistic tasksCode2
Authorship Obfuscation in Multilingual Machine-Generated Text DetectionCode2
Event-Based Motion MagnificationCode2
Show:102550
← PrevPage 39 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified