| MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences | Mar 29, 2026 | | —Unverified | 0 |
| Benchmarking Multi-View BEV Object Detection with Mixed Pinhole and Fisheye Cameras | Mar 29, 2026 | | —Unverified | 0 |
| Spatial Orthogonal Refinement for Robust RGB-Event Visual Object Tracking | Mar 29, 2026 | | —Unverified | 0 |
| Can LLMs Beat Classical Hyperparameter Optimization Algorithms? A Study on autoresearch | Mar 29, 2026 | | —Unverified | 0 |
| BCMDA: Bidirectional Correlation Maps Domain Adaptation for Mixed Domain Semi-Supervised Medical Image Segmentation | Mar 29, 2026 | | —Unverified | 0 |
| SGS-Intrinsic: Semantic-Invariant Gaussian Splatting for Sparse-View Indoor Inverse Rendering | Mar 29, 2026 | | —Unverified | 0 |
| SPROUT: A Scalable Diffusion Foundation Model for Agricultural Vision | Mar 29, 2026 | | —Unverified | 0 |
| OmniColor: A Unified Framework for Multi-modal Lineart Colorization | Mar 29, 2026 | | —Unverified | 0 |
| LongCat-Next: Lexicalizing Modalities as Discrete Tokens | Mar 29, 2026 | | —Unverified | 0 |
| FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies | Mar 29, 2026 | | —Unverified | 0 |
| Emergent Social Intelligence Risks in Generative Multi-Agent Systems | Mar 29, 2026 | | —Unverified | 0 |
| Dual-Path Learning based on Frequency Structural Decoupling and Regional-Aware Fusion for Low-Light Image Super-Resolution | Mar 28, 2026 | | —Unverified | 0 |
| EpochX: Building the Infrastructure for an Emergent Agent Civilization | Mar 28, 2026 | | —Unverified | 0 |
| HMPDM: A Diffusion Model for Driving Video Prediction with Historical Motion Priors | Mar 28, 2026 | | —Unverified | 0 |
| Decompose, Mix, Adapt: A Unified Framework for Parameter-Efficient Neural Network Recombination and Compression | Mar 28, 2026 | | —Unverified | 0 |
| Diagnosing Non-Markovian Observations in Reinforcement Learning via Prediction-Based Violation Scoring | Mar 28, 2026 | | —Unverified | 0 |
| Inference-Time Structural Reasoning for Compositional Vision-Language Understanding | Mar 28, 2026 | | —Unverified | 0 |
| NimbusGS: Unified 3D Scene Reconstruction under Hybrid Weather | Mar 28, 2026 | | —Unverified | 0 |
| TrackMAE: Video Representation Learning via Track Mask and Predict | Mar 28, 2026 | | —Unverified | 0 |
| Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models | Mar 28, 2026 | | —Unverified | 0 |
| Text Data Integration | Mar 28, 2026 | | —Unverified | 0 |
| Structural Graph Probing of Vision-Language Models | Mar 28, 2026 | | —Unverified | 0 |
| LightCtrl: Training-free Controllable Video Relighting | Mar 28, 2026 | | —Unverified | 0 |
| Reasoning-Driven Anomaly Detection and Localization with Image-Level Supervision | Mar 28, 2026 | | —Unverified | 0 |
| Communicating about Space: Language-Mediated Spatial Integration Across Partial Views | Mar 28, 2026 | | —Unverified | 0 |
| Understanding and Mitigating Hallucinations in Multimodal Chain-of-Thought Models | Mar 28, 2026 | | —Unverified | 0 |
| VIRST: Video-Instructed Reasoning Assistant for SpatioTemporal Segmentation | Mar 28, 2026 | | —Unverified | 0 |
| RailVQA: A Benchmark and Framework for Efficient Interpretable Visual Cognition in Automatic Train Operation | Mar 28, 2026 | | —Unverified | 0 |
| DiffSoup: Direct Differentiable Rasterization of Triangle Soup for Extreme Radiance Field Simplification | Mar 28, 2026 | | —Unverified | 0 |
| DRUM: Diffusion-based Raydrop-aware Unpaired Mapping for Sim2Real LiDAR Segmentation | Mar 27, 2026 | | —Unverified | 0 |
| FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants | Mar 27, 2026 | | —Unverified | 0 |
| Seeing Like Radiologists: Context- and Gaze-Guided Vision-Language Pretraining for Chest X-rays | Mar 27, 2026 | | —Unverified | 0 |
| Provably Contractive and High-Quality Denoisers for Convergent Restoration | Mar 27, 2026 | | —Unverified | 0 |
| Consistency Beyond Contrast: Enhancing Open-Vocabulary Object Detection Robustness via Contextual Consistency Learning | Mar 27, 2026 | | —Unverified | 0 |
| DUGAE: Unified Geometry and Attribute Enhancement via Spatiotemporal Correlations for G-PCC Compressed Dynamic Point Clouds | Mar 27, 2026 | | —Unverified | 0 |
| Topology-Aware Graph Reinforcement Learning for Energy Storage Systems Optimal Dispatch in Distribution Networks | Mar 27, 2026 | | —Unverified | 0 |
| Reflect to Inform: Boosting Multimodal Reasoning via Information-Gain-Driven Verification | Mar 27, 2026 | | —Unverified | 0 |
| Conditional Diffusion for 3D CT Volume Reconstruction from 2D X-rays | Mar 27, 2026 | | —Unverified | 0 |
| Beyond MACs: Hardware Efficient Architecture Design for Vision Backbones | Mar 27, 2026 | | —Unverified | 0 |
| From Synthetic Data to Real Restorations: Diffusion Model for Patient-specific Dental Crown Completion | Mar 27, 2026 | | —Unverified | 0 |
| Zero-Shot Depth from Defocus | Mar 27, 2026 | | —Unverified | 0 |
| Dual-branch Graph Domain Adaptation for Cross-scenario Multi-modal Emotion Recognition | Mar 27, 2026 | | —Unverified | 0 |
| VAN-AD: Visual Masked Autoencoder with Normalizing Flow For Time Series Anomaly Detection | Mar 27, 2026 | | —Unverified | 0 |
| TTE-CAM: Built-in Class Activation Maps for Test-Time Explainability in Pretrained Black-Box CNNs | Mar 27, 2026 | | —Unverified | 0 |
| A Provable Energy-Guided Test-Time Defense Boosting Adversarial Robustness of Large Vision-Language Models | Mar 27, 2026 | | —Unverified | 0 |
| GUIDED: Granular Understanding via Identification, Detection, and Discrimination for Fine-Grained Open-Vocabulary Object Detection | Mar 27, 2026 | | —Unverified | 0 |
| TAPS: Task Aware Proposal Distributions for Speculative Sampling | Mar 27, 2026 | | —Unverified | 0 |
| MOOZY: A Patient-First Foundation Model for Computational Pathology | Mar 27, 2026 | | —Unverified | 0 |
| mSFT: Addressing Dataset Mixtures Overfitting Heterogeneously in Multi-task SFT | Mar 27, 2026 | | —Unverified | 0 |
| A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning | Mar 27, 2026 | | —Unverified | 0 |