| NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results | Apr 22, 2024 | 4kImage Enhancement | CodeCode Available | 5 |
| MobileSAMv2: Faster Segment Anything to Everything | Dec 15, 2023 | DecoderKnowledge Distillation | CodeCode Available | 5 |
| PVUW 2024 Challenge on Complex Video Understanding: Methods and Results | Jun 24, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 4 |
| FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation | Mar 19, 2024 | Translationvalid | CodeCode Available | 4 |
| The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report | Apr 14, 2025 | Super-Resolutionvalid | CodeCode Available | 3 |
| The OpenLAM Challenges | Jan 20, 2025 | valid | CodeCode Available | 3 |
| TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation | Apr 25, 2024 | 3D Human Pose EstimationHuman Mesh Recovery | CodeCode Available | 3 |
| SynCode: LLM Generation with Grammar Augmentation | Mar 3, 2024 | Code Generationvalid | CodeCode Available | 3 |
| FinanceBench: A New Benchmark for Financial Question Answering | Nov 20, 2023 | How to refund a wrong transaction in PhonePeQuestion Answering | CodeCode Available | 3 |
| Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought | Oct 3, 2022 | Mathematical ReasoningQuestion Answering | CodeCode Available | 3 |
| Rectified Flow: A Marginal Preserving Approach to Optimal Transport | Sep 29, 2022 | valid | CodeCode Available | 3 |
| DoWhy: An End-to-End Library for Causal Inference | Nov 9, 2020 | Causal Inferencevalid | CodeCode Available | 3 |
| PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket Conditioning | Jun 24, 2025 | BenchmarkingDrug Discovery | CodeCode Available | 2 |
| SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks | Jun 12, 2025 | GitHub issue resolutionvalid | CodeCode Available | 2 |
| NTIRE 2025 Challenge on Image Super-Resolution (4): Methods and Results | Apr 20, 2025 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results | Apr 17, 2025 | Raindrop RemovalRain Removal | CodeCode Available | 2 |
| NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results | Apr 14, 2025 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 2 |
| Modifying Large Language Model Post-Training for Diverse Creative Writing | Mar 21, 2025 | DiversityLanguage Modeling | CodeCode Available | 2 |
| DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry | Mar 17, 2025 | valid | CodeCode Available | 2 |
| Tests for model misspecification in simulation-based inference: from local distortions to global model checks | Dec 19, 2024 | Anomaly Detectionmodel | CodeCode Available | 2 |
| M^3-20M: A Large-Scale Multi-Modal Molecule Dataset for AI-driven Drug Design and Discovery | Dec 8, 2024 | Drug DesignMolecular Property Prediction | CodeCode Available | 2 |
| Towards Generative Ray Path Sampling for Faster Point-to-Point Ray Tracing | Oct 31, 2024 | valid | CodeCode Available | 2 |
| MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses | Oct 9, 2024 | scientific discoveryvalid | CodeCode Available | 2 |
| Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent | Jul 31, 2024 | Translationvalid | CodeCode Available | 2 |
| Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages | Jul 3, 2024 | Language Modellingvalid | CodeCode Available | 2 |