| EMOv2: Pushing 5M Vision Model Frontier | Dec 9, 2024 | Image Generationmodel | CodeCode Available | 2 |
| PSP-HDRI+: A Synthetic Dataset Generator for Pre-Training of Human-Centric Computer Vision Models | Jul 11, 2022 | Keypoint Estimation | CodeCode Available | 2 |
| OpenBox: A Python Toolkit for Generalized Black-box Optimization | Apr 26, 2023 | Experimental Design | CodeCode Available | 2 |
| When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute | Feb 24, 2021 | GPULanguage Modeling | CodeCode Available | 2 |
| ICML 2023 Topological Deep Learning Challenge : Design and Results | Sep 26, 2023 | Deep Learning | CodeCode Available | 2 |
| Longhorn: State Space Models are Amortized Online Learners | Jul 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CCPL: Contrastive Coherence Preserving Loss for Versatile Style Transfer | Jul 11, 2022 | Image-to-Image TranslationStyle Transfer | CodeCode Available | 2 |
| A mmWave Software-Defined Array Platform for Wireless Experimentation at 24-29.5 GHz | Sep 17, 2024 | | CodeCode Available | 2 |
| Empirical Asset Pricing with Large Language Model Agents | Sep 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements | Jun 27, 2025 | | CodeCode Available | 2 |
| DCoM: Active Learning for All Learners | Jul 1, 2024 | Active LearningAll | CodeCode Available | 2 |
| Foundation Models for Remote Sensing and Earth Observation: A Survey | Oct 22, 2024 | Earth ObservationHumanitarian | CodeCode Available | 2 |
| PMC-LLaMA: Towards Building Open-source Language Models for Medicine | Apr 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SWE-bench Goes Live! | May 29, 2025 | | CodeCode Available | 2 |
| Uncertainty-Informed Deep Learning Models Enable High-Confidence Predictions for Digital Histopathology | Apr 9, 2022 | AttributeUncertainty Quantification | CodeCode Available | 2 |
| Accelerated Policy Learning with Parallel Differentiable Simulation | Apr 14, 2022 | Deep Reinforcement Learning | CodeCode Available | 2 |
| SimVP: Simpler yet Better Video Prediction | Jun 9, 2022 | PredictionVideo Prediction | CodeCode Available | 2 |
| Rethinking Imitation-based Planner for Autonomous Driving | Sep 19, 2023 | Autonomous DrivingData Augmentation | CodeCode Available | 2 |
| Contrastive Flow Matching | Jun 5, 2025 | | CodeCode Available | 2 |
| Conformal prediction interval for dynamic time-series | Oct 18, 2020 | Conformal PredictionEnsemble Learning | CodeCode Available | 2 |
| ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale | Mar 24, 2023 | | CodeCode Available | 2 |
| video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models | Jun 18, 2025 | Audio captioningLarge Language Model | CodeCode Available | 2 |
| InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition | May 21, 2025 | Earth ObservationObject | CodeCode Available | 2 |
| DiscoveryBench: Towards Data-Driven Discovery with Large Language Models | Jul 1, 2024 | Code GenerationSociology | CodeCode Available | 2 |
| Investigating image-based fallow weed detection performance on Raphanus sativus and Avena sativa at speeds up to 30 km h^-1 | May 17, 2023 | | CodeCode Available | 2 |
| Training Socially Aligned Language Models on Simulated Social Interactions | May 26, 2023 | | CodeCode Available | 2 |
| Stabilizing Transformer Training by Preventing Attention Entropy Collapse | Mar 11, 2023 | Automatic Speech Recognitionimage-classification | CodeCode Available | 2 |
| End-to-End Vectorized HD-map Construction with Piecewise Bezier Curve | Jun 16, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| Solving Data Quality Problems with Desbordante: a Demo | Jul 27, 2023 | Anomaly DetectionDescriptive | CodeCode Available | 2 |
| Dense Text-to-Image Generation with Attention Modulation | Aug 24, 2023 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models | Sep 7, 2023 | TruthfulQA | CodeCode Available | 2 |
| PyGraft: Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips | Sep 7, 2023 | BenchmarkingKnowledge Graphs | CodeCode Available | 2 |
| MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model | Nov 25, 2024 | Novel View Synthesis | CodeCode Available | 2 |
| Joint Audio and Speech Understanding | Sep 25, 2023 | | CodeCode Available | 2 |
| AdaLomo: Low-memory Optimization with Adaptive Learning Rate | Oct 16, 2023 | | CodeCode Available | 2 |
| Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? | Jul 15, 2024 | Code Generation | CodeCode Available | 2 |
| Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting | Oct 16, 2023 | | CodeCode Available | 2 |
| Learning for CasADi: Data-driven Models in Numerical Optimization | Dec 10, 2023 | | CodeCode Available | 2 |
| Tokenize Anything via Prompting | Dec 14, 2023 | DecoderVisual Prompting | CodeCode Available | 2 |
| Diffusion Models without Classifier-free Guidance | Feb 17, 2025 | Conditional Image GenerationImage Generation | CodeCode Available | 2 |
| FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning | Apr 1, 2025 | Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA) | CodeCode Available | 2 |
| BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models | Jan 20, 2024 | Backdoor Attack | CodeCode Available | 2 |
| General Flow as Foundation Affordance for Scalable Robot Learning | Jan 21, 2024 | Prediction | CodeCode Available | 2 |
| VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Feb 25, 2024 | Pose EstimationTransfer Learning | CodeCode Available | 2 |
| DBConformer: Dual-Branch Convolutional Transformer for EEG Decoding | Jun 26, 2025 | EEGEeg Decoding | CodeCode Available | 2 |
| CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification | Mar 14, 2024 | ClassificationCrowd Counting | CodeCode Available | 2 |
| GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction | Apr 18, 2024 | Graph structure learningJoint Entity and Relation Extraction | CodeCode Available | 2 |
| RRHF: Rank Responses to Align Language Models with Human Feedback without tears | Apr 11, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| GSGAN: Adversarial Learning for Hierarchical Generation of 3D Gaussian Splats | Jun 5, 2024 | 3D-Aware Image Synthesis3D Generation | CodeCode Available | 2 |
| CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuning | Jun 7, 2024 | Instruction FollowingMath | CodeCode Available | 2 |