| TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of Medical Images | Jan 30, 2022 | Brain Tumor SegmentationImage Segmentation | CodeCode Available | 2 |
| Free-form language-based robotic reasoning and grasping | Mar 17, 2025 | FormRobotic Grasping | CodeCode Available | 2 |
| Assessment of Reinforcement Learning for Macro Placement | Feb 21, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| SAD: Segment Any RGBD | May 23, 2023 | 3D Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Hybrid Convolutional and Attention Network for Hyperspectral Image Denoising | Mar 15, 2024 | DenoisingHyperspectral Image Denoising | CodeCode Available | 2 |
| VOS: Learning What You Don't Know by Virtual Outlier Synthesis | Feb 2, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| A Survey of Reasoning with Foundation Models | Dec 17, 2023 | Medical DiagnosisSurvey | CodeCode Available | 2 |
| CompletionFormer: Depth Completion with Convolutions and Vision Transformers | Apr 25, 2023 | Depth CompletionDepth Estimation | CodeCode Available | 2 |
| Can Vehicle Motion Planning Generalize to Realistic Long-tail Scenarios? | Apr 11, 2024 | Autonomous DrivingMotion Planning | CodeCode Available | 2 |
| Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments | Jan 23, 2022 | geo-localizationMetric Learning | CodeCode Available | 2 |
| Pix2NeRF: Unsupervised Conditional π-GAN for Single Image to Neural Radiance Fields Translation | Feb 26, 2022 | 3D-Aware Image SynthesisImage Generation | CodeCode Available | 2 |
| A Physics-informed Diffusion Model for High-fidelity Flow Field Reconstruction | Nov 26, 2022 | Vocal Bursts Intensity Prediction | CodeCode Available | 2 |
| A-Bench: Are LMMs Masters at Evaluating AI-generated Images? | Jun 5, 2024 | | CodeCode Available | 2 |
| Inserting Anybody in Diffusion Models via Celeb Basis | Jun 1, 2023 | | CodeCode Available | 2 |
| Sparse4D v3: Advancing End-to-End 3D Detection and Tracking | Nov 20, 2023 | Autonomous DrivingDenoising | CodeCode Available | 2 |
| GPU Performance Portability needs Autotuning | Apr 30, 2025 | GPU | CodeCode Available | 2 |
| Bicubic++: Slim, Slimmer, Slimmest -- Designing an Industry-Grade Super-Resolution Network | May 3, 2023 | 4kImage Super-Resolution | CodeCode Available | 2 |
| LimSim: A Long-term Interactive Multi-scenario Traffic Simulator | Jul 13, 2023 | Autonomous Driving | CodeCode Available | 2 |
| LEDNet: Joint Low-light Enhancement and Deblurring in the Dark | Feb 7, 2022 | DeblurringLow-light Image Deblurring and Enhancement | CodeCode Available | 2 |
| DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences | Jun 5, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| Securing AI Agents with Information-Flow Control | May 29, 2025 | | CodeCode Available | 2 |
| xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token | May 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Automated Bioinformatics Analysis via AutoBA | Sep 6, 2023 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| All-In-One Metrical And Functional Structure Analysis With Neighborhood Attentions on Demixed Audio | Jul 31, 2023 | AllDownbeat Tracking | CodeCode Available | 2 |
| DataDream: Few-shot Guided Dataset Generation | Jul 15, 2024 | ClassificationDataset Generation | CodeCode Available | 2 |
| UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial Optimization Problems | Jun 29, 2024 | Combinatorial OptimizationGraph Neural Network | CodeCode Available | 2 |
| Partial-to-Partial Shape Matching with Geometric Consistency | Apr 18, 2024 | | CodeCode Available | 2 |
| Learning to Reason for Long-Form Story Generation | Mar 28, 2025 | FormMath | CodeCode Available | 2 |
| Post-Training Sparse Attention with Double Sparsity | Aug 11, 2024 | | CodeCode Available | 2 |
| Group Robust Preference Optimization in Reward-free RLHF | May 30, 2024 | | CodeCode Available | 2 |
| FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height Plugin | Nov 18, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and Collisions | Jun 29, 2025 | Computational EfficiencyGPU | CodeCode Available | 2 |
| Neighborhood Attention Transformer | Apr 14, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs | Jul 15, 2025 | Code GenerationSafety Alignment | CodeCode Available | 2 |
| CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis | Jul 18, 2024 | Decision MakingDiagnostic | CodeCode Available | 2 |
| Exploring Radar Data Representations in Autonomous Driving: A Comprehensive Review | Dec 8, 2023 | Autonomous DrivingRetrieval | CodeCode Available | 2 |
| mFollowIR: a Multilingual Benchmark for Instruction Following in Retrieval | Jan 31, 2025 | Instruction FollowingRetrieval | CodeCode Available | 2 |
| GRID: A Platform for General Robot Intelligence Development | Oct 2, 2023 | | CodeCode Available | 2 |
| 4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling | Nov 29, 2023 | | CodeCode Available | 2 |
| Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning | Jun 26, 2023 | HallucinationVisual Question Answering | CodeCode Available | 2 |
| Deep Portrait Quality Assessment. A NTIRE 2024 Challenge Survey | Apr 17, 2024 | Survey | CodeCode Available | 2 |
| OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization | Oct 25, 2024 | Imitation Learning | CodeCode Available | 2 |
| LiDAR Snowfall Simulation for Robust 3D Object Detection | Mar 28, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Accelerated Hierarchical Density Clustering | May 20, 2017 | Clustering | CodeCode Available | 2 |
| EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech | Jun 12, 2024 | Emotional Speech Synthesistext-to-speech | CodeCode Available | 2 |
| UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes | May 29, 2025 | Texture Synthesis | CodeCode Available | 2 |
| Compressing Large Language Models using Low Rank and Low Precision Decomposition | May 29, 2024 | Quantization | CodeCode Available | 2 |
| DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction | Apr 21, 2023 | In-Context LearningText to SQL | CodeCode Available | 2 |
| Towards Robust Multimodal Sentiment Analysis with Incomplete Data | Sep 30, 2024 | Multimodal Sentiment AnalysisSentiment Analysis | CodeCode Available | 2 |
| Repo2Run: Automated Building Executable Environment for Code Repository at Scale | Feb 19, 2025 | | CodeCode Available | 2 |