| DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks | Jun 13, 2024 | Benchmarking | CodeCode Available | 3 |
| CORL: Research-oriented Deep Offline Reinforcement Learning Library | Oct 13, 2022 | BenchmarkingD4RL | CodeCode Available | 3 |
| Data Filtering Networks | Sep 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| FastMap: Revisiting Dense and Scalable Structure from Motion | May 7, 2025 | GPU | CodeCode Available | 3 |
| ToRL: Scaling Tool-Integrated RL | Mar 30, 2025 | Mathreinforcement-learning | CodeCode Available | 3 |
| Safety of Multimodal Large Language Models on Images and Texts | Feb 1, 2024 | Survey | CodeCode Available | 3 |
| Low-Rank Few-Shot Adaptation of Vision-Language Models | May 28, 2024 | Few-Shot Learningparameter-efficient fine-tuning | CodeCode Available | 3 |
| OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text | Jun 12, 2024 | In-Context Learning | CodeCode Available | 3 |
| Large Spatial Model: End-to-end Unposed Images to Semantic 3D | Oct 24, 2024 | 3D ReconstructionAttribute | CodeCode Available | 3 |
| FlatQuant: Flatness Matters for LLM Quantization | Oct 12, 2024 | Quantization | CodeCode Available | 3 |
| Optimal Stepsize for Diffusion Sampling | Mar 27, 2025 | DenoisingImage Generation | CodeCode Available | 3 |
| DOGS: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus | May 22, 2024 | 3DGS3D Reconstruction | CodeCode Available | 3 |
| MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model | Apr 30, 2024 | Motion GenerationMotion Synthesis | CodeCode Available | 3 |
| Benchmarking LLMs via Uncertainty Quantification | Jan 23, 2024 | BenchmarkingUncertainty Quantification | CodeCode Available | 3 |
| Olympus: A Universal Task Router for Computer Vision Tasks | Dec 12, 2024 | | CodeCode Available | 3 |
| A guide to convolution arithmetic for deep learning | Mar 23, 2016 | Deep Learning | CodeCode Available | 3 |
| ARC Prize 2024: Technical Report | Dec 5, 2024 | ARCProgram Synthesis | CodeCode Available | 3 |
| Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation | Apr 15, 2024 | Contrastive LearningDescriptive | CodeCode Available | 3 |
| LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis | Apr 3, 2024 | 3D Reconstruction4D reconstruction | CodeCode Available | 3 |
| Defeating Prompt Injections by Design | Mar 24, 2025 | | CodeCode Available | 3 |
| SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks | Nov 20, 2023 | DiversityImage Segmentation | CodeCode Available | 3 |
| MeshXL: Neural Coordinate Field for Generative 3D Foundation Models | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Faithful Logical Reasoning via Symbolic Chain-of-Thought | May 28, 2024 | Logical Reasoning | CodeCode Available | 3 |
| Multimodal Table Understanding | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| KV-Edit: Training-Free Image Editing for Precise Background Preservation | Feb 24, 2025 | Text-based Image Editing | CodeCode Available | 3 |
| DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving | Aug 1, 2024 | | CodeCode Available | 3 |
| VideoGen-Eval: Agent-based System for Video Generation Evaluation | Mar 30, 2025 | DiversityVideo Generation | CodeCode Available | 3 |
| LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis | May 5, 2025 | ChatbotDecoder | CodeCode Available | 3 |
| JAFAR: Jack up Any Feature at Any Resolution | Jun 10, 2025 | Feature Upsampling | CodeCode Available | 3 |
| VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation | Dec 30, 2024 | Video GenerationVideo Quality Assessment | CodeCode Available | 3 |
| GENERator: A Long-Context Generative Genomic Foundation Model | Feb 11, 2025 | model | CodeCode Available | 3 |
| EVEv2: Improved Baselines for Encoder-Free Vision-Language Models | Feb 10, 2025 | Decoder | CodeCode Available | 3 |
| SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language Model | Nov 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Half-Inverse Gradients for Physical Deep Learning | Mar 18, 2022 | Deep Learning | CodeCode Available | 3 |
| pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction | Dec 19, 2023 | 3D ReconstructionGeneralizable Novel View Synthesis | CodeCode Available | 3 |
| OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models | Feb 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| DisCo: Disentangled Control for Realistic Human Dance Generation | Jun 30, 2023 | Attribute | CodeCode Available | 3 |
| ^2DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials | Jun 20, 2024 | Drug DiscoveryMolecular Property Prediction | CodeCode Available | 3 |
| DARWIN 1.5: Large Language Models as Materials Science Adapted Learners | Dec 16, 2024 | Large Language ModelMulti-Task Learning | CodeCode Available | 3 |
| NeuralOM: Neural Ocean Model for Subseasonal-to-Seasonal Simulation | May 27, 2025 | Computational EfficiencyGraph Neural Network | CodeCode Available | 3 |
| A Comprehensive Survey on Segment Anything Model for Vision and Beyond | May 14, 2023 | | CodeCode Available | 3 |
| HLOB -- Information Persistence and Structure in Limit Order Books | May 29, 2024 | Deep Learning | CodeCode Available | 3 |
| Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Aug 14, 2024 | 3D Object Detection3D Object Tracking | CodeCode Available | 3 |
| Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap | Jan 3, 2025 | Recommendation SystemsWorld Knowledge | CodeCode Available | 3 |
| Mini-Splatting: Representing Scenes with a Constrained Number of Gaussians | Mar 21, 2024 | Binarization | CodeCode Available | 3 |
| Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow | Oct 9, 2024 | | CodeCode Available | 3 |
| Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams | Jun 12, 2024 | cross-modal alignmentLanguage Modelling | CodeCode Available | 3 |
| Opportunities and Risks of LLMs for Scalable Deliberation with Polis | Jun 20, 2023 | | CodeCode Available | 3 |
| RePlay: a Recommendation Framework for Experimentation and Production Use | Sep 11, 2024 | Recommendation Systems | CodeCode Available | 3 |
| Deep Reinforcement Learning | Oct 15, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 3 |