SOTAVerified

Benchmarking

Papers

Showing 351375 of 5548 papers

TitleStatusHype
POPGym: Benchmarking Partially Observable Reinforcement LearningCode2
Fortuna: A Library for Uncertainty Quantification in Deep LearningCode2
Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint)Code2
Benchmarking the Robustness of LiDAR Semantic Segmentation ModelsCode2
Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based MethodCode2
PyPop7: A Pure-Python Library for Population-Based Black-Box OptimizationCode2
Why do tree-based models still outperform deep learning on typical tabular data?Code2
Immersive Neural Graphics PrimitivesCode2
LaMAR: Benchmarking Localization and Mapping for Augmented RealityCode2
rPPG-Toolbox: Deep Remote PPG ToolboxCode2
State-specific protein-ligand complex structure prediction with a multi-scale deep generative modelCode2
Building Normalizing Flows with Stochastic InterpolantsCode2
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code GenerationCode2
Panoptic Scene Graph GenerationCode2
Why do tree-based models still outperform deep learning on tabular data?Code2
VMAS: A Vectorized Multi-Agent Simulator for Collective Robot LearningCode2
Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and LeaderboardingCode2
The ArtBench Dataset: Benchmarking Generative Models with ArtworksCode2
DaisyRec 2.0: Benchmarking Recommendation for Rigorous EvaluationCode2
Challenges and Opportunities in Offline Reinforcement Learning from Visual ObservationsCode2
Fast Vision Transformers with HiLo AttentionCode2
BARS: Towards Open Benchmarking for Recommender SystemsCode2
K-LITE: Learning Transferable Visual Models with External KnowledgeCode2
Deep Visual Geo-localization BenchmarkCode2
Multi-Class Road User Detection With 3+1D Radar in the View-of-Delft DatasetCode2
Show:102550
← PrevPage 15 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified