SOTAVerified

Benchmarking

Papers

Showing 231240 of 5548 papers

TitleStatusHype
WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous DrivingCode2
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph PriorCode2
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible GuidanceCode2
SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing IndustryCode2
Benchmarking Complex Instruction-Following with Multiple Constraints CompositionCode2
Craftium: An Extensible Framework for Creating Reinforcement Learning EnvironmentsCode2
CoIR: A Comprehensive Benchmark for Code Information Retrieval ModelsCode2
Benchmarking Predictive Coding Networks -- Made SimpleCode2
FairMedFM: Fairness Benchmarking for Medical Imaging Foundation ModelsCode2
MMLongBench-Doc: Benchmarking Long-context Document Understanding with VisualizationsCode2
Show:102550
← PrevPage 24 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified