| Do Understanding and Generation Fight? A Diagnostic Study of DPO for Unified Multimodal Models | Mar 17, 2026 | | —Unverified | 0 |
| SpecMoE: Spectral Mixture-of-Experts Foundation Model for Cross-Species EEG Decoding | Mar 17, 2026 | | —Unverified | 0 |
| LUMINA: A Multi-Vendor Mammography Benchmark with Energy Harmonization Protocol | Mar 17, 2026 | | —Unverified | 0 |
| Anonymous-by-Construction: An LLM-Driven Framework for Privacy-Preserving Text | Mar 17, 2026 | | —Unverified | 0 |
| When the City Teaches the Car: Label-Free 3D Perception from Infrastructure | Mar 17, 2026 | | —Unverified | 0 |
| Automated identification of Ichneumonoidea wasps via YOLO-based deep learning: Integrating HiresCam for Explainable AI | Mar 17, 2026 | | —Unverified | 0 |
| Transformer-Encoder Trees for Efficient Multilingual Machine Translation and Speech Translation | Mar 17, 2026 | | —Unverified | 0 |
| A Scalable Approach to Solving Simulation-Based Network Security Games | Mar 17, 2026 | | —Unverified | 0 |
| Semantic One-Dimensional Tokenizer for Image Reconstruction and Generation | Mar 17, 2026 | | —Unverified | 0 |
| Over-the-air White-box Attack on the Wav2Vec Speech Recognition Neural Network | Mar 17, 2026 | | —Unverified | 0 |
| Edge-Efficient Two-Stream Multimodal Architecture for Non-Intrusive Bathroom Fall Detection | Mar 17, 2026 | | —Unverified | 0 |
| CircuitBuilder: From Polynomials to Circuits via Reinforcement Learning | Mar 17, 2026 | | —Unverified | 0 |
| ProgressiveAvatars: Progressive Animatable 3D Gaussian Avatars | Mar 17, 2026 | | —Unverified | 0 |
| Data-driven generalized perimeter control: Zürich case study | Mar 17, 2026 | | —Unverified | 0 |
| Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models | Mar 17, 2026 | | —Unverified | 0 |
| Transformers can do Bayesian Clustering | Mar 17, 2026 | | —Unverified | 0 |
| Knowing What You Cannot Explain: Learning to Reject Low-Quality Explanations | Mar 17, 2026 | | —Unverified | 0 |
| EdiVal-Agent: An Object-Centric Framework for Automated, Fine-Grained Evaluation of Multi-Turn Editing | Mar 17, 2026 | | —Unverified | 0 |
| Accurate Shift Invariant Convolutional Neural Networks Using Gaussian-Hermite Moments | Mar 17, 2026 | | —Unverified | 0 |
| Patient4D: Temporally Consistent Patient Body Mesh Recovery from Monocular Operating Room Video | Mar 17, 2026 | | —Unverified | 0 |
| LLM-Powered Flood Depth Estimation from Social Media Imagery: A Vision-Language Model Framework with Mechanistic Interpretability for Transportation Resilience | Mar 17, 2026 | | —Unverified | 0 |
| Self-Regularized Learning Methods | Mar 17, 2026 | | —Unverified | 0 |
| Exploiting the English Grammar Profile for L2 grammatical analysis with LLMs | Mar 17, 2026 | | —Unverified | 0 |
| Generalist Multimodal LLMs Gain Biometric Expertise via Human Salience | Mar 17, 2026 | | —Unverified | 0 |
| CircuitLM: A Multi-Agent LLM-Aided Design Framework for Generating Circuit Schematics from Natural Language Prompts | Mar 17, 2026 | | —Unverified | 0 |
| Formal verification of tree-based machine learning models for lateral spreading | Mar 17, 2026 | | —Unverified | 0 |
| Integrating Inductive Biases in Transformers via Distillation for Financial Time Series Forecasting | Mar 17, 2026 | | —Unverified | 0 |
| Ensemble Self-Training for Unsupervised Machine Translation | Mar 17, 2026 | | —Unverified | 0 |
| SpokenUS: A Spoken User Simulator for Task-Oriented Dialogue | Mar 17, 2026 | | —Unverified | 0 |
| Block-Recurrent Dynamics in Vision Transformers | Mar 17, 2026 | | —Unverified | 1 |
| BAWSeg: A UAV Multispectral Benchmark for Barley Weed Segmentation | Mar 17, 2026 | | —Unverified | 0 |
| Self-Aware Markov Models for Discrete Reasoning | Mar 17, 2026 | | —Unverified | 0 |
| Linearized Bregman Iterations for Sparse Spiking Neural Networks | Mar 17, 2026 | | —Unverified | 0 |
| VideoVerse: Does Your T2V Generator Have World Model Capability to Synthesize Videos? | Mar 17, 2026 | | —Unverified | 0 |
| Readers Prefer Outputs of AI Trained on Copyrighted Books over Expert Human Writers | Mar 17, 2026 | | —Unverified | 0 |
| Connecting Jensen-Shannon and Kullback-Leibler Divergences: A New Bound for Representation Learning | Mar 17, 2026 | | —Unverified | 0 |
| Exploring Collatz Dynamics with Human-LLM Collaboration | Mar 17, 2026 | | —Unverified | 0 |
| AgriPath: A Systematic Exploration of Architectural Trade-offs for Crop Disease Classification | Mar 17, 2026 | | —Unverified | 0 |
| High-Fidelity Compression of Seismic Velocity Models via SIREN Auto-Decoders | Mar 17, 2026 | | —Unverified | 0 |
| OpenHospital: A Thing-in-itself Arena for Evolving and Benchmarking LLM-based Collective Intelligence | Mar 17, 2026 | | —Unverified | 0 |
| Advancing Visual Reliability: Color-Accurate Underwater Image Enhancement for Real-Time Underwater Missions | Mar 17, 2026 | | —Unverified | 0 |
| InViC: Intent-aware Visual Cues for Medical Visual Question Answering | Mar 17, 2026 | | —Unverified | 0 |
| Deep Reinforcement Learning-Assisted Automated Operator Portfolio for Constrained Multi-objective Optimization | Mar 17, 2026 | | —Unverified | 0 |
| Near-light Photometric Stereo with Symmetric Lights | Mar 17, 2026 | | —Unverified | 0 |
| An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU | Mar 17, 2026 | | —Unverified | 0 |
| CD-FKD: Cross-Domain Feature Knowledge Distillation for Robust Single-Domain Generalization in Object Detection | Mar 17, 2026 | | —Unverified | 0 |
| Follow the Clues, Frame the Truth: Hybrid-evidential Deductive Reasoning in Open-Vocabulary Multimodal Emotion Recognition | Mar 17, 2026 | | —Unverified | 0 |
| Multi-Agent Reinforcement Learning Counteracts Delayed CSI in Multi-Satellite Systems | Mar 17, 2026 | | —Unverified | 0 |
| On the Emotion Understanding of Synthesized Speech | Mar 17, 2026 | | —Unverified | 0 |
| Unlearning for One-Step Generative Models via Unbalanced Optimal Transport | Mar 17, 2026 | | —Unverified | 0 |