SOTAVerified

Benchmarking

Papers

Showing 30913100 of 5548 papers

TitleStatusHype
JoinGym: An Efficient Query Optimization Environment for Reinforcement LearningCode1
Decoding the Enigma: Benchmarking Humans and AIs on the Many Facets of Working MemoryCode1
The Extractive-Abstractive Axis: Measuring Content "Borrowing" in Generative Language Models0
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language ModelsCode1
Benchmarking Potential Based Rewards for Learning Humanoid LocomotionCode2
On the Real-Time Semantic Segmentation of Aphid Clusters in the Wild0
Efficient Prediction of Peptide Self-assembly through Sequential and Graphical EncodingCode1
Examining the Effects of Degree Distribution and Homophily in Graph Learning ModelsCode1
Towards Heterogeneous Long-tailed Learning: Benchmarking, Metrics, and ToolboxCode1
Approaches for benchmarking single-cell gene regulatory network inference methods0
Show:102550
← PrevPage 310 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified