SOTAVerified

Benchmarking

Papers

Showing 27012710 of 5548 papers

TitleStatusHype
Benchmarking Multi-Domain Active Learning on Image Classification0
Benchmarking and Enhancing Disentanglement in Concept-Residual Models0
A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval0
Event-based Continuous Color Video Decompression from Single Frames0
Enhancing Ligand Pose Sampling for Molecular DockingCode1
LucidDreaming: Controllable Object-Centric 3D Generation0
Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning AlgorithmsCode1
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy EvaluationCode1
Seg2Reg: Differentiable 2D Segmentation to 1D Regression Rendering for 360 Room Layout Reconstruction0
TaskBench: Benchmarking Large Language Models for Task AutomationCode6
Show:102550
← PrevPage 271 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified