| Test-Time Domain Generalization via Universe Learning: A Multi-Graph Matching Approach for Medical Image Segmentation | Mar 17, 2025 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry | Mar 17, 2025 | valid | CodeCode Available | 2 |
| Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt | May 14, 2025 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 2 |
| Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models | May 15, 2025 | Mathreinforcement-learning | CodeCode Available | 2 |
| Relational Graph Transformer | May 16, 2025 | Graph Neural Network | CodeCode Available | 2 |
| AdaptThink: Reasoning Models Can Learn When to Think | May 19, 2025 | Math | CodeCode Available | 2 |
| AD-AGENT: A Multi-agent Framework for End-to-end Anomaly Detection | May 19, 2025 | Anomaly DetectionCode Generation | CodeCode Available | 2 |
| FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models | May 19, 2025 | Disaster ResponseVision and Language Navigation | CodeCode Available | 2 |
| GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent | May 22, 2025 | | CodeCode Available | 2 |
| Ranked Entropy Minimization for Continual Test-Time Adaptation | May 22, 2025 | Test-time Adaptation | CodeCode Available | 2 |
| Training Long-Context LLMs Efficiently via Chunk-wise Optimization | May 22, 2025 | 16kGPU | CodeCode Available | 2 |
| Training-Free Multi-Step Audio Source Separation | May 26, 2025 | Audio Source SeparationDenoising | CodeCode Available | 2 |
| Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning | May 26, 2025 | Decision MakingHierarchical Reinforcement Learning | CodeCode Available | 2 |
| WeatherEdit: Controllable Weather Editing with 4D Gaussian Field | May 26, 2025 | 3D Generation3DGS | CodeCode Available | 2 |
| HyperMotion: DiT-Based Pose-Guided Human Image Animation of Complex Motions | May 29, 2025 | Image AnimationVideo Generation | CodeCode Available | 2 |
| Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization and Temporal Motion Modulation | May 29, 2025 | Portrait AnimationVideo Alignment | CodeCode Available | 2 |
| TC-GS: A Faster Gaussian Splatting Module Utilizing Tensor Cores | May 30, 2025 | 3DGS | CodeCode Available | 2 |
| When Large Multimodal Models Confront Evolving Knowledge:Challenges and Pathways | May 30, 2025 | Continual LearningImage Augmentation | CodeCode Available | 2 |
| ViStoryBench: Comprehensive Benchmark Suite for Story Visualization | May 30, 2025 | Story Visualization | CodeCode Available | 2 |
| Hogwild! Inference: Parallel LLM Generation via Concurrent Attention | Apr 8, 2025 | | CodeCode Available | 2 |
| DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing Scenes | Jun 2, 2025 | Natural Language QueriesNavigate | CodeCode Available | 2 |
| Savage-Dickey density ratio estimation with normalizing flows for Bayesian model comparison | Jun 4, 2025 | Density Ratio Estimation | CodeCode Available | 2 |
| VideoMolmo: Spatio-Temporal Grounding Meets Pointing | Jun 5, 2025 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 2 |
| ORV: 4D Occupancy-centric Robot Video Generation | Jun 3, 2025 | Video Generation | CodeCode Available | 2 |
| Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better | Jun 10, 2025 | Image Generation | CodeCode Available | 2 |
| Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction | Jun 9, 2025 | Reinforcement Learning (RL) | CodeCode Available | 2 |
| Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20^th century Urban Landscapes with Satellite Imageries | Jun 11, 2025 | SegmentationSelf-Supervised Learning | CodeCode Available | 2 |
| UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting | Jun 11, 2025 | DiversityRepresentation Learning | CodeCode Available | 2 |
| CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video Models | Jun 11, 2025 | counterfactualDescriptive | CodeCode Available | 2 |
| Language Modeling by Language Models | Jun 25, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket Conditioning | Jun 24, 2025 | BenchmarkingDrug Discovery | CodeCode Available | 2 |
| LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS | Nov 28, 2023 | Knowledge DistillationNeRF | CodeCode Available | 2 |
| RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking | Jun 20, 2025 | 6D Pose EstimationObject | CodeCode Available | 2 |
| SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning | Jun 18, 2025 | Caption GenerationDescriptive | CodeCode Available | 2 |
| AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration | Mar 16, 2025 | Camera Calibration | CodeCode Available | 2 |
| Learning to See in the Extremely Dark | Jun 26, 2025 | DenoisingExposure Correction | CodeCode Available | 2 |
| Closed-form Continuous-time Neural Models | Jun 25, 2021 | FormSentiment Analysis | CodeCode Available | 2 |
| Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning | Jan 25, 2025 | Answer GenerationMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| When Language Model Meets Private Library | Oct 31, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models | Jun 24, 2023 | GPU | CodeCode Available | 2 |
| MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning | Apr 14, 2025 | Machine TranslationReinforcement Learning (RL) | CodeCode Available | 2 |
| Visual Reinforcement Learning with Imagined Goals | Jul 12, 2018 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics | Nov 18, 2024 | Vision-Language-Action | CodeCode Available | 2 |
| The Replica Dataset: A Digital Replica of Indoor Spaces | Jun 13, 2019 | 3D Scene ReconstructionInstruction Following | CodeCode Available | 2 |
| Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model | Mar 8, 2025 | Image Quality AssessmentLanguage Modeling | CodeCode Available | 2 |
| Multi-Objective Molecule Generation using Interpretable Substructures | Feb 8, 2020 | DiversityDrug Design | CodeCode Available | 2 |
| Neural Network Compression Framework for fast model inference | Feb 20, 2020 | BinarizationCPU | CodeCode Available | 2 |
| Towards Backdoor Attacks and Defense in Robust Machine Learning Models | Feb 25, 2020 | BIG-bench Machine LearningClustering | CodeCode Available | 2 |
| Adversarial Attacks and Defenses on Graphs: A Review, A Tool and Empirical Studies | Mar 2, 2020 | Adversarial Attack | CodeCode Available | 2 |
| On the Planning Abilities of Large Language Models - A Critical Investigation | Sep 21, 2023 | | CodeCode Available | 2 |