| ResumeFlow: An LLM-facilitated Pipeline for Personalized Resume Generation and Refinement | Feb 9, 2024 | HallucinationLanguage Modelling | CodeCode Available | 3 |
| SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and Verification | May 16, 2023 | DecoderLanguage Modeling | CodeCode Available | 3 |
| Detecting As Labeling: Rethinking LiDAR-camera Fusion in 3D Object Detection | Nov 13, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 3 |
| VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration | Apr 12, 2022 | Speech DenoisingSpeech Enhancement | CodeCode Available | 3 |
| OmniGenBench: A Modular Platform for Reproducible Genomic Foundation Models Benchmarking | May 20, 2025 | Benchmarking | CodeCode Available | 3 |
| FlashDMoE: Fast Distributed MoE in a Single Kernel | Jun 5, 2025 | 16kCPU | CodeCode Available | 3 |
| RoHM: Robust Human Motion Reconstruction via Diffusion | Jan 16, 2024 | Denoising | CodeCode Available | 3 |
| Secrets of RLHF in Large Language Models Part I: PPO | Jul 11, 2023 | | CodeCode Available | 3 |
| CoMotion: Concurrent Multi-person 3D Motion | Apr 16, 2025 | 3D Pose EstimationPose Estimation | CodeCode Available | 3 |
| FinRL-DeepSeek: LLM-Infused Risk-Sensitive Reinforcement Learning for Trading Agents | Feb 11, 2025 | | CodeCode Available | 3 |
| AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation | Jun 12, 2025 | Video Generation | CodeCode Available | 3 |
| Learning Neural PDE Solvers with Parameter-Guided Channel Attention | Apr 27, 2023 | PDE Surrogate ModelingWeather Forecasting | CodeCode Available | 3 |
| Is Mamba Effective for Time Series Forecasting? | Mar 17, 2024 | Computational EfficiencyMamba | CodeCode Available | 3 |
| Objaverse-XL: A Universe of 10M+ 3D Objects | Jul 11, 2023 | DiversityNovel View Synthesis | CodeCode Available | 3 |
| Data Poisoning in LLMs: Jailbreak-Tuning and Scaling Laws | Aug 6, 2024 | Data Poisoning | CodeCode Available | 3 |
| StopThePop: Sorted Gaussian Splatting for View-Consistent Real-time Rendering | Feb 1, 2024 | Novel View Synthesis | CodeCode Available | 3 |
| Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model | May 11, 2022 | Packet Loss ConcealmentSpeech Enhancement | CodeCode Available | 3 |
| Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models | Apr 19, 2023 | Logical Reasoning | CodeCode Available | 3 |
| Deep learning tools for the measurement of animal behavior in neuroscience | Sep 30, 2019 | Deep LearningPose Estimation | CodeCode Available | 3 |
| Fast Sampling of Diffusion Models with Exponential Integrator | Apr 29, 2022 | GPU | CodeCode Available | 3 |
| Video4DGen: Enhancing Video and 4D Generation through Mutual Optimization | Apr 5, 2025 | 3D GenerationVideo Alignment | CodeCode Available | 3 |
| High-Fidelity Audio Compression with Improved RVQGAN | Jun 11, 2023 | Audio CompressionAudio Generation | CodeCode Available | 3 |
| BigVGAN: A Universal Neural Vocoder with Large-Scale Training | Jun 9, 2022 | Audio GenerationAudio Synthesis | CodeCode Available | 3 |
| Accelerating Scientific Discovery with Generative Knowledge Extraction, Graph-Based Representation, and Multimodal Intelligent Graph Reasoning | Mar 18, 2024 | Graph SamplingKnowledge Graphs | CodeCode Available | 3 |
| Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding | Feb 22, 2024 | DiversityScene Understanding | CodeCode Available | 3 |
| Towards Visual Grounding: A Survey | Dec 28, 2024 | Phrase GroundingReferring Expression | CodeCode Available | 3 |
| G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems | Jun 9, 2025 | Large Language Model | CodeCode Available | 3 |
| Digitizing Touch with an Artificial Multimodal Fingertip | Nov 4, 2024 | ARC | CodeCode Available | 3 |
| DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks | Jun 13, 2024 | Benchmarking | CodeCode Available | 3 |
| CORL: Research-oriented Deep Offline Reinforcement Learning Library | Oct 13, 2022 | BenchmarkingD4RL | CodeCode Available | 3 |
| Data Filtering Networks | Sep 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| FastMap: Revisiting Dense and Scalable Structure from Motion | May 7, 2025 | GPU | CodeCode Available | 3 |
| ToRL: Scaling Tool-Integrated RL | Mar 30, 2025 | Mathreinforcement-learning | CodeCode Available | 3 |
| Safety of Multimodal Large Language Models on Images and Texts | Feb 1, 2024 | Survey | CodeCode Available | 3 |
| Low-Rank Few-Shot Adaptation of Vision-Language Models | May 28, 2024 | Few-Shot Learningparameter-efficient fine-tuning | CodeCode Available | 3 |
| OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text | Jun 12, 2024 | In-Context Learning | CodeCode Available | 3 |
| Large Spatial Model: End-to-end Unposed Images to Semantic 3D | Oct 24, 2024 | 3D ReconstructionAttribute | CodeCode Available | 3 |
| FlatQuant: Flatness Matters for LLM Quantization | Oct 12, 2024 | Quantization | CodeCode Available | 3 |
| Optimal Stepsize for Diffusion Sampling | Mar 27, 2025 | DenoisingImage Generation | CodeCode Available | 3 |
| DOGS: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus | May 22, 2024 | 3DGS3D Reconstruction | CodeCode Available | 3 |
| MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model | Apr 30, 2024 | Motion GenerationMotion Synthesis | CodeCode Available | 3 |
| Benchmarking LLMs via Uncertainty Quantification | Jan 23, 2024 | BenchmarkingUncertainty Quantification | CodeCode Available | 3 |
| Olympus: A Universal Task Router for Computer Vision Tasks | Dec 12, 2024 | | CodeCode Available | 3 |
| A guide to convolution arithmetic for deep learning | Mar 23, 2016 | Deep Learning | CodeCode Available | 3 |
| ARC Prize 2024: Technical Report | Dec 5, 2024 | ARCProgram Synthesis | CodeCode Available | 3 |
| Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation | Apr 15, 2024 | Contrastive LearningDescriptive | CodeCode Available | 3 |
| LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis | Apr 3, 2024 | 3D Reconstruction4D reconstruction | CodeCode Available | 3 |
| Defeating Prompt Injections by Design | Mar 24, 2025 | | CodeCode Available | 3 |
| SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks | Nov 20, 2023 | DiversityImage Segmentation | CodeCode Available | 3 |
| MeshXL: Neural Coordinate Field for Generative 3D Foundation Models | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |