| Identity-aware Graph Neural Networks | Jan 25, 2021 | Graph ClassificationGraph Property Prediction | CodeCode Available | 2 |
| Real-World Super-Resolution via Kernel Estimation and Noise Injection | Jun 19, 2020 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| What Matters in Learning from Offline Human Demonstrations for Robot Manipulation | Aug 6, 2021 | Imitation Learningreinforcement-learning | CodeCode Available | 2 |
| Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning | Sep 24, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 |
| S3IM: Stochastic Structural SIMilarity and Its Unreasonable Effectiveness for Neural Fields | Aug 14, 2023 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models | Dec 8, 2022 | | CodeCode Available | 2 |
| A Survey on Deep Learning based Time Series Analysis with Frequency Transformation | Feb 4, 2023 | Deep LearningTime Series | CodeCode Available | 2 |
| Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive Survey | Apr 21, 2025 | Computational EfficiencyInformation Retrieval | CodeCode Available | 2 |
| ShortGPT: Layers in Large Language Models are More Redundant Than You Expect | Mar 6, 2024 | Quantization | CodeCode Available | 2 |
| Movie101v2: Improved Movie Narration Benchmark | Apr 20, 2024 | Video Captioning | CodeCode Available | 2 |
| JailBreakV: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks | Apr 3, 2024 | LLM Jailbreak | CodeCode Available | 2 |
| Visual Language Maps for Robot Navigation | Oct 11, 2022 | 3D ReconstructionImage Captioning | CodeCode Available | 2 |
| Fast Online Object Tracking and Segmentation: A Unifying Approach | Dec 12, 2018 | ObjectObject Tracking | CodeCode Available | 2 |
| Autonomous Driving on Curvy Roads Without Reliance on Frenet Frame: A Cartesian-Based Trajectory Planning Method | Feb 3, 2022 | Autonomous DrivingCollision Avoidance | CodeCode Available | 2 |
| PLAPT: Protein-Ligand Binding Affinity Prediction Using Pretrained Transformers | Feb 8, 2024 | Drug DiscoveryPrediction | CodeCode Available | 2 |
| EmoSphere-SER: Enhancing Speech Emotion Recognition Through Spherical Representation with Auxiliary Classification | May 26, 2025 | Emotion Recognitionregression | CodeCode Available | 2 |
| Residual and bidirectional LSTM for epileptic seizure detection | Jun 17, 2024 | EEGElectroencephalogram (EEG) | CodeCode Available | 2 |
| ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training | Feb 28, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| On-Device Training Under 256KB Memory | Jun 30, 2022 | Lifelong learningQuantization | CodeCode Available | 2 |
| Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation | Sep 30, 2024 | Cross-Modal RetrievalDynamic Time Warping | CodeCode Available | 2 |
| DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing | Dec 12, 2023 | Image GenerationImage Morphing | CodeCode Available | 2 |
| HairStep: Transfer Synthetic to Real Using Strand and Depth Maps for Single-View 3D Hair Modeling | Mar 5, 2023 | | CodeCode Available | 2 |
| MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface Reconstruction | Jun 1, 2022 | 3D ReconstructionMulti-View 3D Reconstruction | CodeCode Available | 2 |
| Joint Discriminative and Generative Learning for Person Re-identification | Apr 15, 2019 | Image GenerationImage-to-Image Translation | CodeCode Available | 2 |
| Thought Anchors: Which LLM Reasoning Steps Matter? | Jun 23, 2025 | counterfactualSentence | CodeCode Available | 2 |
| Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation | Jun 4, 2023 | Semantic Segmentation | CodeCode Available | 2 |
| Inference-Friendly Models With MixAttention | Sep 23, 2024 | | CodeCode Available | 2 |
| FRAME: A Modular Framework for Autonomous Map Merging: Advancements in the Field | Apr 27, 2024 | Point Cloud Registration | CodeCode Available | 2 |
| Very high resolution canopy height maps from RGB imagery using self-supervised vision transformer and convolutional decoder trained on Aerial Lidar | Apr 14, 2023 | DecoderSelf-Supervised Learning | CodeCode Available | 2 |
| SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors | Nov 28, 2023 | DecoderTexture Synthesis | CodeCode Available | 2 |
| InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior | Jul 10, 2024 | BenchmarkingDecoder | CodeCode Available | 2 |
| GAIA-1: A Generative World Model for Autonomous Driving | Sep 29, 2023 | Autonomous Driving | CodeCode Available | 2 |
| NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers | Apr 18, 2023 | In-Context LearningSpeech Synthesis | CodeCode Available | 2 |
| Efficient compilation of expressive problem space specifications to neural network solvers | Jan 24, 2024 | | CodeCode Available | 2 |
| 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting | Dec 14, 2023 | 3DGSImage Generation | CodeCode Available | 2 |
| Simple diffusion: End-to-end diffusion for high resolution images | Jan 26, 2023 | Conditional Image GenerationDenoising | CodeCode Available | 2 |
| A Generalist Neural Algorithmic Learner | Sep 22, 2022 | Graph Neural NetworkLearning to Execute | CodeCode Available | 2 |
| AdFlush: A Real-World Deployable Machine Learning Solution for Effective Advertisement and Web Tracker Prevention | May 13, 2024 | BlockingCPU | CodeCode Available | 2 |
| AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion | Mar 10, 2025 | Video Generation | CodeCode Available | 2 |
| Can Go AIs be adversarially robust? | Jun 18, 2024 | Diversity | CodeCode Available | 2 |
| Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement | Aug 17, 2023 | Bandwidth ExtensionDecoder | CodeCode Available | 2 |
| Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition | Nov 22, 2022 | NeRFTalking Face Generation | CodeCode Available | 2 |
| Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models | Jan 30, 2023 | Audio GenerationText-to-Video Generation | CodeCode Available | 2 |
| Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models | Apr 5, 2024 | Data Augmentation | CodeCode Available | 2 |
| The Long Tail of Context: Does it Exist and Matter? | Oct 3, 2022 | Recommendation Systems | CodeCode Available | 2 |
| OneFormer3D: One Transformer for Unified Point Cloud Segmentation | Nov 24, 2023 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 2 |
| GENIUS: A Generative Framework for Universal Multimodal Search | Mar 25, 2025 | Information RetrievalQuantization | CodeCode Available | 2 |
| Addressing Concept Shift in Online Time Series Forecasting: Detect-then-Adapt | Mar 22, 2024 | Data AugmentationTime Series | CodeCode Available | 2 |
| Designing Inherently Interpretable Machine Learning Models | Nov 2, 2021 | BIG-bench Machine LearningInterpretable Machine Learning | CodeCode Available | 2 |
| Prompt Injection attack against LLM-integrated Applications | Jun 8, 2023 | | CodeCode Available | 2 |