| Graphs Meet AI Agents: Taxonomy, Progress, and Future Opportunities | Jun 22, 2025 | Reinforcement Learning (RL) | CodeCode Available | 2 |
| Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation | Feb 28, 2024 | Semantic SegmentationTAG | CodeCode Available | 2 |
| Three New Validators and a Large-Scale Benchmark Ranking for Unsupervised Domain Adaptation | Aug 15, 2022 | Domain AdaptationUnsupervised Domain Adaptation | CodeCode Available | 2 |
| LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents | Feb 13, 2024 | BenchmarkingModel Selection | CodeCode Available | 2 |
| Learning from All Vehicles | Mar 22, 2022 | AllAutonomous Driving | CodeCode Available | 2 |
| LambdaNetworks: Modeling Long-Range Interactions Without Attention | Feb 17, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| Next Patch Prediction for Autoregressive Visual Generation | Dec 19, 2024 | Image GenerationPrediction | CodeCode Available | 2 |
| The Stable Artist: Steering Semantics in Diffusion Latent Space | Dec 12, 2022 | Image Generation | CodeCode Available | 2 |
| LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters | May 27, 2024 | BenchmarkingGSM8K | CodeCode Available | 2 |
| PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image Understanding | Aug 18, 2024 | Language ModellingQuestion Answering | CodeCode Available | 2 |
| SegViTv2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers | Jun 9, 2023 | Continual LearningContinual Semantic Segmentation | CodeCode Available | 2 |
| CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion | Oct 19, 2022 | Camera Pose EstimationDepth Estimation | CodeCode Available | 2 |
| Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation | Apr 9, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 2 |
| Active Generalized Category Discovery | Mar 7, 2024 | Active Learningimbalanced classification | CodeCode Available | 2 |
| COLD: A Benchmark for Chinese Offensive Language Detection | Jan 16, 2022 | | CodeCode Available | 2 |
| Accurate and Efficient Stereo Matching via Attention Concatenation Volume | Sep 23, 2022 | Stereo Matching | CodeCode Available | 2 |
| Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark | Mar 10, 2025 | Autonomous DrivingBenchmarking | CodeCode Available | 2 |
| PFGM++: Unlocking the Potential of Physics-Inspired Generative Models | Feb 8, 2023 | Image GenerationPlaying the Game of 2048 | CodeCode Available | 2 |
| SocialCircle+: Learning the Angle-based Conditioned Interaction Representation for Pedestrian Trajectory Prediction | Sep 23, 2024 | counterfactualPedestrian Trajectory Prediction | CodeCode Available | 2 |
| CAMAv2: A Vision-Centric Approach for Static Map Element Annotation | Jul 31, 2024 | | CodeCode Available | 2 |
| Depth Information Assisted Collaborative Mutual Promotion Network for Single Image Dehazing | Mar 2, 2024 | Depth EstimationImage Dehazing | CodeCode Available | 2 |
| Text Image Inpainting via Global Structure-Guided Diffusion Models | Jan 26, 2024 | Image InpaintingScene Text Recognition | CodeCode Available | 2 |
| Robot Trajectron: Trajectory Prediction-based Shared Control for Robot Manipulation | Feb 4, 2024 | PositionRobot Manipulation | CodeCode Available | 2 |
| RoboDepth: Robust Out-of-Distribution Depth Estimation under Corruptions | Oct 23, 2023 | Autonomous DrivingDepth Estimation | CodeCode Available | 2 |
| A Survey on Deep Neural Network Pruning-Taxonomy, Comparison, Analysis, and Recommendations | Aug 13, 2023 | Adversarial RobustnessNetwork Pruning | CodeCode Available | 2 |
| Scaling Laws for Galaxy Images | Apr 3, 2024 | Domain Adaptation | CodeCode Available | 2 |
| UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase | Sep 11, 2023 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| Facial Appearance Capture at Home with Patch-Level Reflectance Prior | Jun 4, 2025 | | CodeCode Available | 2 |
| GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control | May 28, 2025 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion | Jul 3, 2024 | Point Cloud Completion | CodeCode Available | 2 |
| CodePDE: An Inference Framework for LLM-driven PDE Solver Generation | May 13, 2025 | Code Generation | CodeCode Available | 2 |
| Pan-Mamba: Effective pan-sharpening with State Space Model | Feb 19, 2024 | MambaPansharpening | CodeCode Available | 2 |
| Deform3DGS: Flexible Deformation for Fast Surgical Scene Reconstruction with Gaussian Splatting | May 28, 2024 | | CodeCode Available | 2 |
| HoTPP Benchmark: Are We Good at the Long Horizon Events Forecasting? | Jun 20, 2024 | BenchmarkingPoint Processes | CodeCode Available | 2 |
| Correspondence-Free Non-Rigid Point Set Registration Using Unsupervised Clustering Analysis | Jun 27, 2024 | Clustering | CodeCode Available | 2 |
| A generalizable 3D framework and model for self-supervised learning in medical imaging | Jan 20, 2025 | Medical Image SegmentationSelf-Supervised Learning | CodeCode Available | 2 |
| Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting | Feb 25, 2023 | Motion Planning | CodeCode Available | 2 |
| On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 Outlook | Oct 11, 2024 | EthicsFairness | CodeCode Available | 2 |
| On the Feasibility of Using LLMs to Autonomously Execute Multi-host Network Attacks | Jan 27, 2025 | | CodeCode Available | 2 |
| All-In-One Medical Image Restoration via Task-Adaptive Routing | May 30, 2024 | AllDenoising | CodeCode Available | 2 |
| One Net to Rule Them All: Domain Randomization in Quadcopter Racing Across Different Platforms | Apr 30, 2025 | All | CodeCode Available | 2 |
| FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models | May 20, 2025 | Video CompressionVideo Understanding | CodeCode Available | 2 |
| Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning | May 9, 2024 | parameter-efficient fine-tuningVisual Prompting | CodeCode Available | 2 |
| PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistency | Mar 13, 2024 | In-Context LearningText to SQL | CodeCode Available | 2 |
| Depth-Aware Generative Adversarial Network for Talking Head Video Generation | Mar 13, 2022 | 3D geometryGenerative Adversarial Network | CodeCode Available | 2 |
| CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation | Jun 12, 2025 | | CodeCode Available | 2 |
| Reverse Forward Curriculum Learning for Extreme Sample and Demonstration Efficiency in Reinforcement Learning | May 6, 2024 | Reinforcement Learning (RL) | CodeCode Available | 2 |
| MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions | Aug 16, 2023 | Motion Expressions Guided Video SegmentationObject | CodeCode Available | 2 |
| SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer | Apr 4, 2024 | motion predictionNeRF | CodeCode Available | 2 |