| MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-Experts | May 2, 2024 | Combinatorial OptimizationMixture-of-Experts | CodeCode Available | 3 |
| On the use of deep learning for phase recovery | Aug 2, 2023 | Deep Learning | CodeCode Available | 3 |
| Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models | Mar 19, 2024 | Hallucination | CodeCode Available | 3 |
| NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer | May 24, 2024 | Novel View Synthesis | CodeCode Available | 3 |
| Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model | Jan 28, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 3 |
| MAPIE: an open-source library for distribution-free uncertainty quantification | Jul 25, 2022 | Conformal PredictionMulti-class Classification | CodeCode Available | 3 |
| PhysX: Physical-Grounded 3D Asset Generation | Jul 16, 2025 | 3D GenerationImage to 3D | CodeCode Available | 3 |
| Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation | Apr 5, 2024 | DecoderMamba | CodeCode Available | 3 |
| HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale | Sep 9, 2024 | Code GenerationFault localization | CodeCode Available | 3 |
| X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages | May 7, 2023 | AttributeInstruction Following | CodeCode Available | 3 |
| DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes | Nov 18, 2024 | Autonomous DrivingSurface Reconstruction | CodeCode Available | 3 |
| Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning | Jan 26, 2023 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 3 |
| LLM4CP: Adapting Large Language Models for Channel Prediction | Jun 20, 2024 | PredictionTime Series Analysis | CodeCode Available | 3 |
| Universal Actions for Enhanced Embodied Foundation Models | Jan 17, 2025 | | CodeCode Available | 3 |
| ChatRex: Taming Multimodal LLM for Joint Perception and Understanding | Nov 27, 2024 | | CodeCode Available | 3 |
| DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting | Nov 26, 2024 | Camera CalibrationDepth Estimation | CodeCode Available | 3 |
| OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection | Feb 27, 2025 | Action DetectionBenchmarking | CodeCode Available | 3 |
| Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting | Mar 14, 2024 | 3DGS3D Reconstruction | CodeCode Available | 3 |
| MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition | Apr 26, 2024 | Emotion RecognitionMulti-Label Learning | CodeCode Available | 3 |
| DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders | Dec 22, 2022 | ColorizationDecoder | CodeCode Available | 3 |
| PCDCNet: A Surrogate Model for Air Quality Forecasting with Physical-Chemical Dynamics and Constraints | May 26, 2025 | Deep Learning | CodeCode Available | 3 |
| MACE: Mass Concept Erasure in Diffusion Models | Mar 10, 2024 | Text-to-Image Generation | CodeCode Available | 3 |
| MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding | Feb 22, 2024 | Computational EfficiencyPrediction | CodeCode Available | 3 |
| TopoTune : A Framework for Generalized Combinatorial Complex Neural Networks | Oct 9, 2024 | Graph Neural Network | CodeCode Available | 3 |
| FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations | Nov 16, 2024 | Visual Storytelling | CodeCode Available | 3 |
| DoWhy: An End-to-End Library for Causal Inference | Nov 9, 2020 | Causal Inferencevalid | CodeCode Available | 3 |
| Relative Pose Estimation through Affine Corrections of Monocular Depth Priors | Jan 9, 2025 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 3 |
| DistiLLM: Towards Streamlined Distillation for Large Language Models | Feb 6, 2024 | Instruction FollowingKnowledge Distillation | CodeCode Available | 3 |
| TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos | Apr 24, 2025 | MMEVideo MME | CodeCode Available | 3 |
| R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization | Mar 17, 2025 | | CodeCode Available | 3 |
| Music2Latent: Consistency Autoencoders for Latent Audio Compression | Aug 12, 2024 | Audio CompressionInformation Retrieval | CodeCode Available | 3 |
| Advanced Video Inpainting Using Optical Flow-Guided Efficient Diffusion | Dec 1, 2024 | DenoisingOptical Flow Estimation | CodeCode Available | 3 |
| MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection | Apr 9, 2024 | Anomaly DetectionDecoder | CodeCode Available | 3 |
| A Survey on the Memory Mechanism of Large Language Model based Agents | Apr 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| ACEGEN: Reinforcement learning of generative chemical agents for drug discovery | May 7, 2024 | BenchmarkingDecision Making | CodeCode Available | 3 |
| Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning | Oct 11, 2022 | reinforcement-learningReinforcement Learning | CodeCode Available | 3 |
| RiNALMo: General-Purpose RNA Language Models Can Generalize Well on Structure Prediction Tasks | Feb 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Embodied Understanding of Driving Scenarios | Mar 7, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 |
| Personalized Image Generation with Deep Generative Models: A Decade Survey | Feb 18, 2025 | Image GenerationPersonalized Image Generation | CodeCode Available | 3 |
| R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO | May 22, 2025 | Reinforcement Learning (RL) | CodeCode Available | 3 |
| Datasheet for the Pile | Jan 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition | Apr 23, 2024 | DecoderDiversity | CodeCode Available | 3 |
| imitation: Clean Imitation Learning Implementations | Nov 22, 2022 | Imitation Learningreinforcement-learning | CodeCode Available | 3 |
| Efficient Video Action Detection with Token Dropout and Context Refinement | Apr 17, 2023 | Action DetectionDecoder | CodeCode Available | 3 |
| Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization | May 23, 2024 | | CodeCode Available | 3 |
| LLM-Pruner: On the Structural Pruning of Large Language Models | May 19, 2023 | Text Generationzero-shot-classification | CodeCode Available | 3 |
| BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model | Sep 20, 2023 | 8kLanguage Modeling | CodeCode Available | 3 |
| HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction | Nov 27, 2024 | 3DGS | CodeCode Available | 3 |
| EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training | May 14, 2024 | Data AugmentationSelf-Supervised Learning | CodeCode Available | 3 |
| White-Box Transformers via Sparse Rate Reduction | Jun 1, 2023 | Representation Learning | CodeCode Available | 3 |