| Very high resolution canopy height maps from RGB imagery using self-supervised vision transformer and convolutional decoder trained on Aerial Lidar | Apr 14, 2023 | DecoderSelf-Supervised Learning | CodeCode Available | 2 |
| SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors | Nov 28, 2023 | DecoderTexture Synthesis | CodeCode Available | 2 |
| InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior | Jul 10, 2024 | BenchmarkingDecoder | CodeCode Available | 2 |
| GAIA-1: A Generative World Model for Autonomous Driving | Sep 29, 2023 | Autonomous Driving | CodeCode Available | 2 |
| NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers | Apr 18, 2023 | In-Context LearningSpeech Synthesis | CodeCode Available | 2 |
| Efficient compilation of expressive problem space specifications to neural network solvers | Jan 24, 2024 | | CodeCode Available | 2 |
| 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting | Dec 14, 2023 | 3DGSImage Generation | CodeCode Available | 2 |
| Simple diffusion: End-to-end diffusion for high resolution images | Jan 26, 2023 | Conditional Image GenerationDenoising | CodeCode Available | 2 |
| A Generalist Neural Algorithmic Learner | Sep 22, 2022 | Graph Neural NetworkLearning to Execute | CodeCode Available | 2 |
| AdFlush: A Real-World Deployable Machine Learning Solution for Effective Advertisement and Web Tracker Prevention | May 13, 2024 | BlockingCPU | CodeCode Available | 2 |
| AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion | Mar 10, 2025 | Video Generation | CodeCode Available | 2 |
| Can Go AIs be adversarially robust? | Jun 18, 2024 | Diversity | CodeCode Available | 2 |
| Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement | Aug 17, 2023 | Bandwidth ExtensionDecoder | CodeCode Available | 2 |
| Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition | Nov 22, 2022 | NeRFTalking Face Generation | CodeCode Available | 2 |
| Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models | Jan 30, 2023 | Audio GenerationText-to-Video Generation | CodeCode Available | 2 |
| Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models | Apr 5, 2024 | Data Augmentation | CodeCode Available | 2 |
| The Long Tail of Context: Does it Exist and Matter? | Oct 3, 2022 | Recommendation Systems | CodeCode Available | 2 |
| OneFormer3D: One Transformer for Unified Point Cloud Segmentation | Nov 24, 2023 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 2 |
| GENIUS: A Generative Framework for Universal Multimodal Search | Mar 25, 2025 | Information RetrievalQuantization | CodeCode Available | 2 |
| Addressing Concept Shift in Online Time Series Forecasting: Detect-then-Adapt | Mar 22, 2024 | Data AugmentationTime Series | CodeCode Available | 2 |
| Designing Inherently Interpretable Machine Learning Models | Nov 2, 2021 | BIG-bench Machine LearningInterpretable Machine Learning | CodeCode Available | 2 |
| Prompt Injection attack against LLM-integrated Applications | Jun 8, 2023 | | CodeCode Available | 2 |
| From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D | Mar 29, 2025 | Spatial Reasoning | CodeCode Available | 2 |
| Autonomous clustering by fast find of mass and distance peaks | May 13, 2024 | AstronomyClustering | CodeCode Available | 2 |
| Reconstructive Visual Instruction Tuning | Oct 12, 2024 | Denoising | CodeCode Available | 2 |
| Risk-Aware Off-Road Navigation via a Learned Speed Distribution Map | Mar 25, 2022 | Motion PlanningUnity | CodeCode Available | 2 |
| Risk-mediated dynamic regulation of effective contacts de-synchronizes outbreaks in metapopulation epidemic models | Feb 20, 2025 | | CodeCode Available | 2 |
| Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding | Sep 5, 2024 | Question AnsweringScene Understanding | CodeCode Available | 2 |
| FreeTumor: Advance Tumor Segmentation via Large-Scale Tumor Synthesis | Jun 3, 2024 | SegmentationTumor Segmentation | CodeCode Available | 2 |
| From implicit learning to explicit representations | Apr 5, 2022 | | CodeCode Available | 2 |
| Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in Clutter | Mar 12, 2025 | Zero-shot Generalization | CodeCode Available | 2 |
| Video Probabilistic Diffusion Models in Projected Latent Space | Feb 15, 2023 | Video Generation | CodeCode Available | 2 |
| PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment | Jun 27, 2023 | Camera Pose EstimationPose Estimation | CodeCode Available | 2 |
| Safety of Sampled-Data Systems with Control Barrier Functions via Approximate Discrete Time Models | Mar 22, 2022 | | CodeCode Available | 2 |
| λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space | Feb 7, 2024 | Concept AlignmentGPU | CodeCode Available | 2 |
| Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training | Jul 31, 2023 | Organ SegmentationRepresentation Learning | CodeCode Available | 2 |
| MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare | Dec 13, 2022 | 3D Object Detection6D Pose Estimation | CodeCode Available | 2 |
| Quantifying Memorization Across Neural Language Models | Feb 15, 2022 | FairnessMemorization | CodeCode Available | 2 |
| A General Language Assistant as a Laboratory for Alignment | Dec 1, 2021 | Imitation Learning | CodeCode Available | 2 |
| Wukong: Towards a Scaling Law for Large-Scale Recommendation | Mar 4, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Fine-Grained Face Swapping via Regional GAN Inversion | Nov 25, 2022 | DisentanglementFace Swapping | CodeCode Available | 2 |
| 3DGen: Triplane Latent Diffusion for Textured Mesh Generation | Mar 9, 2023 | DiversityGPU | CodeCode Available | 2 |
| Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks | Jun 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models | May 29, 2023 | Attribute | CodeCode Available | 2 |
| Situational Graphs for Robot Navigation in Structured Indoor Environments | Feb 24, 2022 | Pose EstimationRobot Navigation | CodeCode Available | 2 |
| Neural Kernel Surface Reconstruction | May 31, 2023 | Surface Reconstruction | CodeCode Available | 2 |
| MachMap: End-to-End Vectorized Solution for Compact HD-Map Construction | Jun 17, 2023 | Autonomous DrivingDecoder | CodeCode Available | 2 |
| First Place Solution of KDD Cup 2021 & OGB Large-Scale Challenge Graph Prediction Track | Jun 15, 2021 | | CodeCode Available | 2 |
| MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark | Oct 24, 2024 | | CodeCode Available | 2 |
| VidChapters-7M: Video Chapters at Scale | Sep 25, 2023 | Dense Video CaptioningNavigate | CodeCode Available | 2 |