| Explaining CLIP Zero-shot Predictions Through Concepts | Mar 30, 2026 | | —Unverified | 0 |
| RAWIC: Bit-Depth Adaptive Lossless Raw Image Compression | Mar 30, 2026 | | —Unverified | 0 |
| MedLoc-R1: Performance-Aware Curriculum Reward Scheduling for GRPO-Based Medical Visual Grounding | Mar 30, 2026 | | —Unverified | 0 |
| A Closer Look at Cross-Domain Few-Shot Object Detection: Fine-Tuning Matters and Parallel Decoder Helps | Mar 30, 2026 | | —Unverified | 0 |
| CPUBone: Efficient Vision Backbone Design for Devices with Low Parallelization Capabilities | Mar 30, 2026 | | —Unverified | 0 |
| FlashSign: Pose-Free Guidance for Efficient Sign Language Video Generation | Mar 30, 2026 | | —Unverified | 0 |
| ForestSim: A Synthetic Benchmark for Intelligent Vehicle Perception in Unstructured Forest Environments | Mar 30, 2026 | | —Unverified | 0 |
| EnsemJudge: Enhancing Reliability in Chinese LLM-Generated Text Detection through Diverse Model Ensembles | Mar 30, 2026 | | —Unverified | 0 |
| Hg-I2P: Bridging Modalities for Generalizable Image-to-Point-Cloud Registration via Heterogeneous Graphs | Mar 30, 2026 | | —Unverified | 0 |
| Drift-AR: Single-Step Visual Autoregressive Generation via Anti-Symmetric Drifting | Mar 30, 2026 | | —Unverified | 0 |
| InkDrop: Invisible Backdoor Attacks Against Dataset Condensation | Mar 30, 2026 | | —Unverified | 0 |
| AutoDrive-P^3: Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning | Mar 30, 2026 | | —Unverified | 0 |
| MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios | Mar 30, 2026 | | —Unverified | 0 |
| Robust Remote Sensing Image-Text Retrieval with Noisy Correspondence | Mar 30, 2026 | | —Unverified | 0 |
| TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation | Mar 30, 2026 | | —Unverified | 0 |
| Reasoning as Energy Minimization over Structured Latent Trajectories | Mar 30, 2026 | | —Unverified | 0 |
| Prototype-Enhanced Multi-View Learning for Thyroid Nodule Ultrasound Classification | Mar 30, 2026 | | —Unverified | 0 |
| FairGC: Fairness-aware Graph Condensation | Mar 30, 2026 | | —Unverified | 0 |
| INSID3: Training-Free In-Context Segmentation with DINOv3 | Mar 30, 2026 | | —Unverified | 0 |
| GEditBench v2: A Human-Aligned Benchmark for General Image Editing | Mar 30, 2026 | | —Unverified | 0 |
| ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning | Mar 30, 2026 | | —Unverified | 0 |
| TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark | Mar 30, 2026 | | —Unverified | 0 |
| AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding | Mar 30, 2026 | | —Unverified | 0 |
| DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing | Mar 30, 2026 | | —Unverified | 0 |
| ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining | Mar 30, 2026 | | —Unverified | 0 |
| Rethinking Language Model Scaling under Transferable Hypersphere Optimization | Mar 30, 2026 | | —Unverified | 0 |
| Adaptive Block-Scaled Data Types | Mar 30, 2026 | | —Unverified | 0 |
| HandX: Scaling Bimanual Motion and Interaction Generation | Mar 30, 2026 | | —Unverified | 0 |
| Gen-Searcher: Reinforcing Agentic Search for Image Generation | Mar 30, 2026 | | —Unverified | 0 |
| NeiGAD: Augmenting Graph Anomaly Detection via Spectral Neighbor Information | Mar 30, 2026 | | —Unverified | 0 |
| LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models | Mar 30, 2026 | | —Unverified | 0 |
| Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization | Mar 30, 2026 | | —Unverified | 0 |
| Courtroom-Style Multi-Agent Debate with Progressive RAG and Role-Switching for Controversial Claim Verification | Mar 30, 2026 | | —Unverified | 0 |
| GraphWalker: Agentic Knowledge Graph Question Answering via Synthetic Trajectory Curriculum | Mar 30, 2026 | | —Unverified | 0 |
| ORSIFlow: Saliency-Guided Rectified Flow for Optical Remote Sensing Salient Object Detection | Mar 30, 2026 | | —Unverified | 0 |
| ELViS: Efficient Visual Similarity from Local Descriptors that Generalizes Across Domains | Mar 30, 2026 | | —Unverified | 0 |
| Industrial3D: A Terrestrial LiDAR Point Cloud Dataset and CrossParadigm Benchmark for Industrial Infrastructure | Mar 30, 2026 | | —Unverified | 0 |
| WAFT-Stereo: Warping-Alone Field Transforms for Stereo Matching | Mar 30, 2026 | | —Unverified | 0 |
| Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models | Mar 30, 2026 | | —Unverified | 0 |
| CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence | Mar 30, 2026 | | —Unverified | 0 |
| ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks | Mar 29, 2026 | | —Unverified | 0 |
| RSR-core: A High-Performance Engine for Low-Bit Matrix-Vector Multiplication | Mar 29, 2026 | | —Unverified | 0 |
| KV Cache Quantization for Self-Forcing Video Generation: A 33-Method Empirical Study | Mar 29, 2026 | | —Unverified | 0 |
| Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs | Mar 29, 2026 | | —Unverified | 0 |
| Streamlined Open-Vocabulary Human-Object Interaction Detection | Mar 29, 2026 | | —Unverified | 0 |
| Q-BIOLAT: Binary Latent Protein Fitness Landscapes for QUBO-Based Optimization | Mar 29, 2026 | | —Unverified | 0 |
| OpenDPR: Open-Vocabulary Change Detection via Vision-Centric Diffusion-Guided Prototype Retrieval for Remote Sensing Imagery | Mar 29, 2026 | | —Unverified | 0 |
| PRBench: End-to-end Paper Reproduction in Physics Research | Mar 29, 2026 | | —Unverified | 0 |
| RHO: Robust Holistic OSM-Based Metric Cross-View Geo-Localization | Mar 29, 2026 | | —Unverified | 0 |
| GS3LAM: Gaussian Semantic Splatting SLAM | Mar 29, 2026 | | —Unverified | 0 |