| Can Large Multimodal Models Inspect Buildings? A Hierarchical Benchmark for Structural Pathology Reasoning | Mar 20, 2026 | | —Unverified | 0 |
| Improving Generalization on Cybersecurity Tasks with Multi-Modal Contrastive Learning | Mar 20, 2026 | | —Unverified | 0 |
| Enhancing Hyperspace Analogue to Language (HAL) Representations via Attention-Based Pooling for Text Classification | Mar 20, 2026 | | —Unverified | 0 |
| Design-OS: A Specification-Driven Framework for Engineering System Design with a Control-Systems Design Case | Mar 20, 2026 | | —Unverified | 0 |
| Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD | Mar 20, 2026 | | —Unverified | 0 |
| Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models | Mar 20, 2026 | | —Unverified | 0 |
| Evaluating Evidence Grounding Under User Pressure in Instruction-Tuned Language Models | Mar 20, 2026 | | —Unverified | 0 |
| The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning | Mar 20, 2026 | | —Unverified | 0 |
| EgoForge: Goal-Directed Egocentric World Simulator | Mar 20, 2026 | | —Unverified | 0 |
| Learning Dynamic Belief Graphs for Theory-of-mind Reasoning | Mar 20, 2026 | | —Unverified | 0 |
| TinyML Enhances CubeSat Mission Capabilities | Mar 20, 2026 | | —Unverified | 0 |
| LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis | Mar 20, 2026 | | —Unverified | 0 |
| AI Agents Can Already Autonomously Perform Experimental High Energy Physics | Mar 20, 2026 | | —Unverified | 0 |
| Adaptive Greedy Frame Selection for Long Video Understanding | Mar 20, 2026 | | —Unverified | 0 |
| VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking | Mar 20, 2026 | | —Unverified | 0 |
| Improving Image-to-Image Translation via a Rectified Flow Reformulation | Mar 20, 2026 | | —Unverified | 0 |
| MeanFlow Meets Control: Scaling Sampled-Data Control for Swarms | Mar 20, 2026 | | —Unverified | 0 |
| Deterministic Mode Proposals: An Efficient Alternative to Generative Sampling for Ambiguous Segmentation | Mar 20, 2026 | | —Unverified | 0 |
| LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation | Mar 20, 2026 | | —Unverified | 0 |
| MME-CoF-Pro: Evaluating Reasoning Coherence in Video Generative Models with Text and Visual Hints | Mar 20, 2026 | | —Unverified | 0 |
| Graph-Informed Adversarial Modeling: Infimal Subadditivity of Interpolative Divergences | Mar 20, 2026 | | —Unverified | 0 |
| Layered Quantum Architecture Search for 3D Point Cloud Classification | Mar 20, 2026 | | —Unverified | 0 |
| Enhancing Alignment for Unified Multimodal Models via Semantically-Grounded Supervision | Mar 20, 2026 | | —Unverified | 0 |
| A Super Fast K-means for Indexing Vector Embeddings | Mar 20, 2026 | | CodeCode Available | 1 |
| Dual Prompt-Driven Feature Encoding for Nighttime UAV Tracking | Mar 20, 2026 | | CodeCode Available | 0 |
| DynFlowDrive: Flow-Based Dynamic World Modeling for Autonomous Driving | Mar 20, 2026 | | CodeCode Available | 0 |
| Vision-Language Attribute Disentanglement and Reinforcement for Lifelong Person Re-Identification | Mar 20, 2026 | | CodeCode Available | 0 |
| Unbiased Dynamic Multimodal Fusion | Mar 20, 2026 | | CodeCode Available | 0 |
| Demographic-Aware Self-Supervised Anomaly Detection Pretraining for Equitable Rare Cardiac Diagnosis | Mar 20, 2026 | | CodeCode Available | 0 |
| BALM: A Model-Agnostic Framework for Balanced Multimodal Learning under Imbalanced Missing Rates | Mar 20, 2026 | | CodeCode Available | 0 |
| ReManNet: A Riemannian Manifold Network for Monocular 3D Lane Detection | Mar 20, 2026 | | CodeCode Available | 0 |
| IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment | Mar 20, 2026 | | CodeCode Available | 0 |
| What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time | Mar 20, 2026 | | CodeCode Available | 0 |
| Deep Autocorrelation Modeling for Time-Series Forecasting: Progress and Prospects | Mar 20, 2026 | | CodeCode Available | 0 |
| Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery | Mar 20, 2026 | | CodeCode Available | 0 |
| MedSPOT: A Workflow-Aware Sequential Grounding Benchmark for Clinical GUI | Mar 20, 2026 | | CodeCode Available | 0 |
| CFCML: A Coarse-to-Fine Crossmodal Learning Framework For Disease Diagnosis Using Multimodal Images and Tabular Data | Mar 20, 2026 | | CodeCode Available | 0 |
| Kolmogorov-Arnold causal generative models | Mar 20, 2026 | | CodeCode Available | 0 |
| MuSteerNet: Human Reaction Generation from Videos via Observation-Reaction Mutual Steering | Mar 20, 2026 | | CodeCode Available | 0 |
| Wildfire Spread Scenarios: Increasing Sample Diversity of Segmentation Diffusion Models with Training-Free Methods | Mar 20, 2026 | | CodeCode Available | 0 |
| CoVR-R:Reason-Aware Composed Video Retrieval | Mar 20, 2026 | | CodeCode Available | 0 |
| From Masks to Pixels and Meaning: A New Taxonomy, Benchmark, and Metrics for VLM Image Tampering | Mar 20, 2026 | | CodeCode Available | 0 |
| EvidenceRL: Reinforcing Evidence Consistency for Trustworthy Language Models | Mar 20, 2026 | | CodeCode Available | 0 |
| PFM-VEPAR: Prompting Foundation Models for RGB-Event Camera based Pedestrian Attribute Recognition | Mar 20, 2026 | | CodeCode Available | 0 |
| CurveStream: Boosting Streaming Video Understanding in MLLMs via Curvature-Aware Hierarchical Visual Memory Management | Mar 20, 2026 | | CodeCode Available | 0 |
| MagicSeg: Open-World Segmentation Pretraining via Counterfactural Diffusion-Based Auto-Generation | Mar 20, 2026 | | CodeCode Available | 0 |
| Semantic Audio-Visual Navigation in Continuous Environments | Mar 20, 2026 | | CodeCode Available | 0 |
| RouterKGQA: Specialized--General Model Routing for Constraint-Aware Knowledge Graph Question Answering | Mar 20, 2026 | | CodeCode Available | 0 |
| Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs | Mar 20, 2026 | | CodeCode Available | 0 |
| MFil-Mamba: Multi-Filter Scanning for Spatial Redundancy-Aware Visual State Space Models | Mar 20, 2026 | | CodeCode Available | 0 |