| DeepHistoViT: An Interpretable Vision Transformer Framework for Histopathological Cancer Classification | Mar 12, 2026 | | —Unverified | 0 |
| GPT4o-Receipt: A Dataset and Human Study for AI-Generated Document Forensics | Mar 12, 2026 | | —Unverified | 0 |
| MDS-VQA: Model-Informed Data Selection for Video Quality Assessment | Mar 12, 2026 | | —Unverified | 0 |
| CFD-HAR: User-controllable Privacy through Conditional Feature Disentanglement | Mar 12, 2026 | | —Unverified | 0 |
| MANSION: Multi-floor lANguage-to-3D Scene generatIOn for loNg-horizon tasks | Mar 12, 2026 | | —Unverified | 0 |
| Streaming Translation and Transcription Through Speech-to-Text Causal Alignment | Mar 12, 2026 | | —Unverified | 0 |
| EnTransformer: A Deep Generative Transformer for Multivariate Probabilistic Forecasting | Mar 12, 2026 | | —Unverified | 0 |
| InSpatio-WorldFM: An Open-Source Real-Time Generative Frame Model | Mar 12, 2026 | | —Unverified | 0 |
| ForensicZip: More Tokens are Better but Not Necessary in Forensic Vision-Language Models | Mar 12, 2026 | | —Unverified | 0 |
| RDNet: Region Proportion-Aware Dynamic Adaptive Salient Object Detection Network in Optical Remote Sensing Images | Mar 12, 2026 | | —Unverified | 0 |
| Triple X: A LLM-Based Multilingual Speech Recognition System for the INTERSPEECH2025 MLC-SLM Challenge | Mar 12, 2026 | | —Unverified | 0 |
| What do near-optimal learning rate schedules look like? | Mar 12, 2026 | | —Unverified | 0 |
| Global Evolutionary Steering: Refining Activation Steering Control via Cross-Layer Consistency | Mar 12, 2026 | | —Unverified | 0 |
| A Geometrically-Grounded Drive for MDL-Based Optimization in Deep Learning | Mar 12, 2026 | | —Unverified | 0 |
| VQQA: An Agentic Approach for Video Evaluation and Quality Improvement | Mar 12, 2026 | | —Unverified | 0 |
| Pruning-induced phases in fully-connected neural networks: the eumentia, the dementia, and the amentia | Mar 12, 2026 | | —Unverified | 0 |
| Maximum Entropy Exploration Without the Rollouts | Mar 12, 2026 | | —Unverified | 0 |
| Bridging the Gap Between Security Metrics and Key Risk Indicators: An Empirical Framework for Vulnerability Prioritization | Mar 12, 2026 | | —Unverified | 0 |
| Operationalising Cyber Risk Management Using AI: Connecting Cyber Incidents to MITRE ATT&CK Techniques, Security Controls, and Metrics | Mar 12, 2026 | | —Unverified | 0 |
| Learning Pore-scale Multiphase Flow from 4D Velocimetry | Mar 12, 2026 | | —Unverified | 0 |
| Delayed Backdoor Attacks: Exploring the Temporal Dimension as a New Attack Surface in Pre-Trained Models | Mar 12, 2026 | | —Unverified | 0 |
| FastLSQ: Solving PDEs in One Shot via Fourier Features with Exact Analytical Derivatives | Mar 12, 2026 | | CodeCode Available | 0 |
| Evaluation and LLM-Guided Learning of ICD Coding Rationales | Mar 12, 2026 | | —Unverified | 0 |
| LifeSim: Long-Horizon User Life Simulator for Personalized Assistant Evaluation | Mar 12, 2026 | | —Unverified | 0 |
| Deep Incentive Design with Differentiable Equilibrium Blocks | Mar 12, 2026 | | —Unverified | 0 |
| Gen-Fab: A Variation-Aware Generative Model for Predicting Fabrication Variations in Nanophotonic Devices | Mar 12, 2026 | | —Unverified | 0 |
| FlexRec: Adapting LLM-based Recommenders for Flexible Needs via Reinforcement Learning | Mar 12, 2026 | | —Unverified | 0 |
| Compiling Temporal Numeric Planning into Discrete PDDL+: Extended Version | Mar 12, 2026 | | —Unverified | 0 |
| Llettuce: An Open Source Natural Language Processing Tool for the Translation of Medical Terms into Uniform Clinical Encoding | Mar 12, 2026 | | —Unverified | 0 |
| Head-wise Adaptive Rotary Positional Encoding for Fine-Grained Image Generation | Mar 12, 2026 | | —Unverified | 0 |
| Thermodynamics of Reinforcement Learning Curricula | Mar 12, 2026 | | —Unverified | 0 |
| Temporal Straightening for Latent Planning | Mar 12, 2026 | | —Unverified | 0 |
| MM-CondChain: A Programmatically Verified Benchmark for Visually Grounded Deep Compositional Reasoning | Mar 12, 2026 | | —Unverified | 0 |
| Generalizing Vision-Language Models with Dedicated Prompt Guidance | Mar 12, 2026 | | —Unverified | 0 |
| Entropy Guided Diversification and Preference Elicitation in Agentic Recommendation Systems | Mar 12, 2026 | | —Unverified | 0 |
| EvoFlows: Evolutionary Edit-Based Flow-Matching for Protein Engineering | Mar 12, 2026 | | —Unverified | 0 |
| Social, Legal, Ethical, Empathetic and Cultural Norm Operationalisation for AI Agents | Mar 12, 2026 | | —Unverified | 0 |
| Cornserve: A Distributed Serving System for Any-to-Any Multimodal Models | Mar 12, 2026 | | —Unverified | 0 |
| A Two-Stage Dual-Modality Model for Facial Emotional Expression Recognition | Mar 12, 2026 | | —Unverified | 0 |
| Causal Representation Learning with Optimal Compression under Complex Treatments | Mar 12, 2026 | | —Unverified | 0 |
| Exploiting Expertise of Non-Expert and Diverse Agents in Social Bandit Learning: A Free Energy Approach | Mar 12, 2026 | | —Unverified | 0 |
| CoMMET: To What Extent Can LLMs Perform Theory of Mind Tasks? | Mar 12, 2026 | | —Unverified | 0 |
| Real-World Point Tracking with Verifier-Guided Pseudo-Labeling | Mar 12, 2026 | | —Unverified | 0 |
| PreLoRA: Hybrid Pre-training of Vision Transformers with Full Training and Low-Rank Adapters | Mar 12, 2026 | | —Unverified | 0 |
| Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously | Mar 12, 2026 | | CodeCode Available | 0 |
| Evaluate-as-Action: Self-Evaluated Process Rewards for Retrieval-Augmented Agents | Mar 12, 2026 | | —Unverified | 0 |
| Hidden State Poisoning Attacks against Mamba-based Language Models | Mar 12, 2026 | | —Unverified | 0 |
| DRIFT: Dual-Representation Inter-Fusion Transformer for Automated Driving Perception with 4D Radar Point Clouds | Mar 12, 2026 | | —Unverified | 0 |
| SpectralGuard: Detecting Memory Collapse Attacks in State Space Models | Mar 12, 2026 | | —Unverified | 0 |
| LLM-Augmented Therapy Normalization and Aspect-Based Sentiment Analysis for Treatment-Resistant Depression on Reddit | Mar 12, 2026 | | —Unverified | 0 |