| POPGym: Benchmarking Partially Observable Reinforcement Learning | Mar 3, 2023 | BenchmarkingGPU | CodeCode Available | 2 |
| Fortuna: A Library for Uncertainty Quantification in Deep Learning | Feb 8, 2023 | Bayesian InferenceBenchmarking | CodeCode Available | 2 |
| Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint) | Jan 14, 2023 | Benchmarking | CodeCode Available | 2 |
| Benchmarking the Robustness of LiDAR Semantic Segmentation Models | Jan 3, 2023 | Autonomous DrivingBenchmarking | CodeCode Available | 2 |
| Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based Method | Dec 22, 2022 | 4k8k | CodeCode Available | 2 |
| PyPop7: A Pure-Python Library for Population-Based Black-Box Optimization | Dec 12, 2022 | BenchmarkingEvolutionary Algorithms | CodeCode Available | 2 |
| Why do tree-based models still outperform deep learning on typical tabular data? | Nov 28, 2022 | Benchmarking | CodeCode Available | 2 |
| Immersive Neural Graphics Primitives | Nov 24, 2022 | BenchmarkingNeRF | CodeCode Available | 2 |
| LaMAR: Benchmarking Localization and Mapping for Augmented Reality | Oct 19, 2022 | BenchmarkingDiversity | CodeCode Available | 2 |
| rPPG-Toolbox: Deep Remote PPG Toolbox | Oct 3, 2022 | BenchmarkingData Augmentation | CodeCode Available | 2 |
| State-specific protein-ligand complex structure prediction with a multi-scale deep generative model | Sep 30, 2022 | BenchmarkingBlind Docking | CodeCode Available | 2 |
| Building Normalizing Flows with Stochastic Interpolants | Sep 30, 2022 | BenchmarkingDensity Estimation | CodeCode Available | 2 |
| MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation | Aug 17, 2022 | BenchmarkingCode Generation | CodeCode Available | 2 |
| Panoptic Scene Graph Generation | Jul 22, 2022 | BenchmarkingPanoptic Scene Graph Generation | CodeCode Available | 2 |
| Why do tree-based models still outperform deep learning on tabular data? | Jul 18, 2022 | Benchmarking | CodeCode Available | 2 |
| VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning | Jul 7, 2022 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and Leaderboarding | Jul 4, 2022 | BenchmarkingDocument Ranking | CodeCode Available | 2 |
| The ArtBench Dataset: Benchmarking Generative Models with Artworks | Jun 22, 2022 | BenchmarkingConditional Image Generation | CodeCode Available | 2 |
| DaisyRec 2.0: Benchmarking Recommendation for Rigorous Evaluation | Jun 22, 2022 | BenchmarkingRecommendation Systems | CodeCode Available | 2 |
| Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations | Jun 9, 2022 | Benchmarkingcontinuous-control | CodeCode Available | 2 |
| Fast Vision Transformers with HiLo Attention | May 26, 2022 | BenchmarkingEfficient ViTs | CodeCode Available | 2 |
| BARS: Towards Open Benchmarking for Recommender Systems | May 19, 2022 | BenchmarkingClick-Through Rate Prediction | CodeCode Available | 2 |
| K-LITE: Learning Transferable Visual Models with External Knowledge | Apr 20, 2022 | BenchmarkingDescriptive | CodeCode Available | 2 |
| Deep Visual Geo-localization Benchmark | Apr 7, 2022 | BenchmarkingData Augmentation | CodeCode Available | 2 |
| Multi-Class Road User Detection With 3+1D Radar in the View-of-Delft Dataset | Apr 1, 2022 | 3D Object DetectionBenchmarking | CodeCode Available | 2 |