| RM-R1: Reward Modeling as Reasoning | May 5, 2025 | MathReinforcement Learning (RL) | CodeCode Available | 2 |
| OBELiX: A Curated Dataset of Crystal Structures and Experimentally Measured Ionic Conductivities for Lithium Solid-State Electrolytes | Feb 20, 2025 | | CodeCode Available | 2 |
| pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models | Jun 23, 2022 | Knowledge Tracingvalid | CodeCode Available | 2 |
| Lemur: Harmonizing Natural Language and Code for Language Agents | Oct 10, 2023 | | CodeCode Available | 2 |
| FewJoint: A Few-shot Learning Benchmark for Joint Language Understanding | Sep 17, 2020 | Few-Shot Learning | CodeCode Available | 2 |
| ForesightNav: Learning Scene Imagination for Efficient Exploration | Apr 22, 2025 | Efficient ExplorationNavigate | CodeCode Available | 2 |
| MARFT: Multi-Agent Reinforcement Fine-Tuning | Apr 21, 2025 | | CodeCode Available | 2 |
| DiSA: Diffusion Step Annealing in Autoregressive Image Generation | May 26, 2025 | DenoisingImage Generation | CodeCode Available | 2 |
| Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models | May 27, 2025 | Concept Alignmentobject-detection | CodeCode Available | 2 |
| TaskCraft: Automated Generation of Agentic Tasks | Jun 11, 2025 | | CodeCode Available | 2 |
| Audio synthesizer inversion in symmetric parameter spaces with approximately equivariant flow matching | Jun 8, 2025 | | CodeCode Available | 2 |
| LeanExplore: A search engine for Lean 4 declarations | Jun 4, 2025 | Automated Theorem Proving | CodeCode Available | 2 |
| Improving spliced alignment by modeling splice sites with deep learning | Jun 15, 2025 | | CodeCode Available | 2 |
| any4: Learned 4-bit Numeric Representation for LLMs | Jul 7, 2025 | GPUGSM8K | CodeCode Available | 2 |
| Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task | Sep 24, 2018 | Semantic ParsingText to SQL | CodeCode Available | 2 |
| Session-based Social Recommendation via Dynamic Graph Attention Networks | Feb 25, 2019 | Graph AttentionRecommendation Systems | CodeCode Available | 2 |
| Bag of Tricks and A Strong Baseline for Deep Person Re-identification | Mar 17, 2019 | Person Re-Identification | CodeCode Available | 2 |
| Measuring Coding Challenge Competence With APPS | May 20, 2021 | BIG-bench Machine LearningCode Generation | CodeCode Available | 2 |
| Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling | Jul 6, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Learning To Describe Player Form in The MLB | Sep 11, 2021 | Contrastive LearningForm | CodeCode Available | 2 |
| Learning Efficient Online 3D Bin Packing on Packing Configuration Trees | Sep 29, 2021 | 3D Bin PackingDeep Reinforcement Learning | CodeCode Available | 2 |
| DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing | Nov 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| NeRF in the Dark: High Dynamic Range View Synthesis from Noisy Raw Images | Nov 26, 2021 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models | Jan 16, 2022 | Few-Shot LearningQuestion Answering | CodeCode Available | 2 |
| ERS: a novel comprehensive endoscopy image dataset for machine learning, compliant with the MST 3.0 specification | Jan 21, 2022 | BIG-bench Machine Learningimage-classification | CodeCode Available | 2 |
| Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology | Mar 19, 2025 | Cross-Modal RetrievalDiagnostic | CodeCode Available | 2 |
| Cedille: A large autoregressive French language model | Feb 7, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 2 |
| Iterative Corresponding Geometry: Fusing Region and Depth for Highly Efficient 3D Tracking of Textureless Objects | Mar 10, 2022 | 3D Object Tracking6D Pose Estimation | CodeCode Available | 2 |
| ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning | Mar 19, 2022 | Chart Question AnsweringLogical Reasoning | CodeCode Available | 2 |
| scikit-fda: A Python Package for Functional Data Analysis | Nov 4, 2022 | Model Selection | CodeCode Available | 2 |
| TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation | Apr 12, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Perturbation Augmentation for Fairer NLP | May 25, 2022 | Fairness | CodeCode Available | 2 |
| HaGRID - HAnd Gesture Recognition Image Dataset | Jun 16, 2022 | Gesture RecognitionHand Detection | CodeCode Available | 2 |
| Unsupervised High-Resolution Portrait Gaze Correction and Animation | Jul 1, 2022 | Image InpaintingVocal Bursts Intensity Prediction | CodeCode Available | 2 |
| A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning | Aug 16, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| Visual Prompting via Image Inpainting | Sep 1, 2022 | ColorizationEdge Detection | CodeCode Available | 2 |
| Scalable SoftGroup for 3D Instance Segmentation on Point Clouds | Sep 17, 2022 | 3D Instance SegmentationInstance Segmentation | CodeCode Available | 2 |
| SinDiffusion: Learning a Diffusion Model from a Single Natural Image | Nov 22, 2022 | DenoisingDiversity | CodeCode Available | 2 |
| Diffusion Probabilistic Models beat GANs on Medical Images | Dec 14, 2022 | DenoisingDiversity | CodeCode Available | 2 |
| Physics-Informed Neural Networks for Prognostics and Health Management of Lithium-Ion Batteries | Jan 2, 2023 | Management | CodeCode Available | 2 |
| Human-in-the-loop Embodied Intelligence with Interactive Simulation Environment for Surgical Robot Learning | Jan 1, 2023 | | CodeCode Available | 2 |
| Robust Dynamic Radiance Fields | Jan 5, 2023 | | CodeCode Available | 2 |
| ClimaX: A foundation model for weather and climate | Jan 24, 2023 | modelSelf-Supervised Learning | CodeCode Available | 2 |
| SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections | Feb 2, 2023 | Scene Generation | CodeCode Available | 2 |
| EdgeYOLO: An Edge-Real-Time Object Detector | Feb 15, 2023 | Data AugmentationEdge-computing | CodeCode Available | 2 |
| DIRE for Diffusion-Generated Image Detection | Mar 16, 2023 | | CodeCode Available | 2 |
| A Dynamic Multi-Scale Voxel Flow Network for Video Prediction | Mar 17, 2023 | Video Prediction | CodeCode Available | 2 |
| Leapfrog Diffusion Model for Stochastic Trajectory Prediction | Mar 20, 2023 | Denoisingmodel | CodeCode Available | 2 |
| Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection | Mar 21, 2023 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 |
| DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment | May 20, 2024 | Contrastive LearningDomain Adaptation | CodeCode Available | 2 |