| Towards Scalable Pre-training of Visual Tokenizers for Generation | Mar 6, 2026 | | CodeCode Available | 0 |
| Understanding and Improving Hyperbolic Deep Reinforcement Learning | Mar 6, 2026 | | CodeCode Available | 0 |
| (MGS)^2-Net: Unifying Micro-Geometric Scale and Macro-Geometric Structure for Cross-View Geo-Localization | Mar 6, 2026 | | CodeCode Available | 0 |
| Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation | Mar 6, 2026 | | CodeCode Available | 0 |
| CASA: Cross-Attention over Self-Attention for Efficient Vision-Language Fusion | Mar 6, 2026 | | —Unverified | 1 |
| U6G XL-MIMO Radiomap Prediction: Multi-Config Dataset and Beam Map Approach | Mar 6, 2026 | | —Unverified | 1 |
| CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization | Mar 6, 2026 | | —Unverified | 1 |
| LikePhys: Evaluating Intuitive Physics Understanding in Video Diffusion Models via Likelihood Preference | Mar 6, 2026 | | —Unverified | 1 |
| Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion | Mar 6, 2026 | | —Unverified | 2 |
| RAMoEA-QA: Hierarchical Specialization for Robust Respiratory Audio Question Answering | Mar 6, 2026 | | —Unverified | 0 |
| Symmetry-Constrained Language-Guided Program Synthesis for Discovering Governing Equations from Noisy and Partial Observations | Mar 6, 2026 | | —Unverified | 0 |
| Can LLMs Capture Expert Uncertainty? A Comparative Analysis of Value Alignment in Ethnographic Qualitative Research | Mar 6, 2026 | | —Unverified | 0 |
| Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on Battery-Powered Small Devices | Mar 6, 2026 | | —Unverified | 0 |
| Exploiting Spatiotemporal Properties for Efficient Event-Driven Human Pose Estimation | Mar 6, 2026 | | —Unverified | 0 |
| Robust Sparse Signal Recovery with Outliers: A Hard Thresholding Pursuit Approach Based on LAD | Mar 6, 2026 | | —Unverified | 0 |
| Systematic Evaluation of Novel View Synthesis for Video Place Recognition | Mar 6, 2026 | | —Unverified | 0 |
| ThaiSafetyBench: Assessing Language Model Safety in Thai Cultural Contexts | Mar 6, 2026 | | CodeCode Available | 0 |
| Proof-of-Guardrail in AI Agents and What (Not) to Trust from It | Mar 6, 2026 | | CodeCode Available | 0 |
| Quantum parameter estimation with uncertainty quantification from continuous measurement data using neural network ensembles | Mar 6, 2026 | | —Unverified | 0 |
| Real-Time Learning of Predictive Dynamic Obstacle Models for Robotic Motion Planning | Mar 6, 2026 | | —Unverified | 0 |
| VisioMath: Benchmarking Figure-based Mathematical Reasoning in LMMs | Mar 6, 2026 | | —Unverified | 0 |
| Discerning What Matters: A Multi-Dimensional Assessment of Moral Competence in LLMs | Mar 6, 2026 | | —Unverified | 0 |
| Reasoned Safety Alignment: Ensuring Jailbreak Defense via Answer-Then-Check | Mar 6, 2026 | | —Unverified | 0 |
| Do We Really Need Permutations? Impact of Model Width on Linear Mode Connectivity | Mar 6, 2026 | | —Unverified | 0 |
| Phys2Real: Fusing VLM Priors with Interactive Online Adaptation for Uncertainty-Aware Sim-to-Real Manipulation | Mar 6, 2026 | | —Unverified | 0 |
| AURASeg: Attention-guided Upsampling with Residual-Assistive Boundary Refinement for Onboard Robot Drivable-Area Segmentation | Mar 6, 2026 | | —Unverified | 0 |
| Critical Confabulation: Can LLMs Hallucinate for Social Good? | Mar 6, 2026 | | —Unverified | 0 |
| SPARK: Jailbreaking T2V Models by Synergistically Prompting Auditory and Recontextualized Knowledge | Mar 6, 2026 | | —Unverified | 0 |
| SyncMV4D: Synchronized Multi-view Joint Diffusion of Appearance and Motion for Hand-Object Interaction Synthesis | Mar 6, 2026 | | —Unverified | 0 |
| UniTS: Unified Spatio-Temporal Generative Model for Remote Sensing | Mar 6, 2026 | | —Unverified | 0 |
| XR-DT: Extended Reality-Enhanced Digital Twin for Safe Motion Planning via Human-Aware Model Predictive Path Integral Control | Mar 6, 2026 | | —Unverified | 0 |
| Agent Tools Orchestration Leaks More: Dataset, Benchmark, and Mitigation | Mar 6, 2026 | | —Unverified | 0 |
| Purification Before Fusion: Toward Mask-Free Speech Enhancement for Robust Audio-Visual Speech Recognition | Mar 6, 2026 | | —Unverified | 0 |
| Online unsupervised Hebbian learning in deep photonic neuromorphic networks | Mar 6, 2026 | | —Unverified | 0 |
| COMI: Coarse-to-fine Context Compression via Marginal Information Gain | Mar 6, 2026 | | —Unverified | 0 |
| Uncertainty Quantification in LLM Agents: Foundations, Emerging Challenges, and Opportunities | Mar 6, 2026 | | —Unverified | 0 |
| Why Human Guidance Matters in Collaborative Vibe Coding | Mar 6, 2026 | | —Unverified | 0 |
| Coverage-Aware Web Crawling for Domain-Specific Supplier Discovery via a Web--Knowledge--Web Pipeline | Mar 6, 2026 | | —Unverified | 0 |
| MatRIS: Toward Reliable and Efficient Pretrained Machine Learning Interatomic Potentials | Mar 6, 2026 | | —Unverified | 0 |
| Phys4D: Fine-Grained Physics-Consistent 4D Modeling from Video Diffusion | Mar 6, 2026 | | —Unverified | 0 |
| Large-Language-Model-Guided State Estimation for Partially Observable Task and Motion Planning | Mar 6, 2026 | | —Unverified | 0 |
| MOOSEnger -- a Domain-Specific AI Agent for the MOOSE Ecosystem | Mar 6, 2026 | | —Unverified | 0 |
| Visual Words Meet BM25: Sparse Auto-Encoder Visual Word Scoring for Image Retrieval | Mar 6, 2026 | | —Unverified | 0 |
| Layer-wise Instance Binding for Regional and Occlusion Control in Text-to-Image Diffusion Transformers | Mar 6, 2026 | | —Unverified | 0 |
| First-Order Softmax Weighted Switching Gradient Method for Distributed Stochastic Minimax Optimization with Stochastic Constraints | Mar 6, 2026 | | —Unverified | 0 |
| Tutor Move Taxonomy: A Theory-Aligned Framework for Analyzing Instructional Moves in Tutoring | Mar 6, 2026 | | —Unverified | 0 |
| Balancing Domestic and Global Perspectives: Evaluating Dual-Calibration and LLM-Generated Nudges for Diverse News Recommendation | Mar 6, 2026 | | —Unverified | 0 |
| Spectral Probing of Feature Upsamplers in 2D-to-3D Scene Reconstruction | Mar 6, 2026 | | —Unverified | 0 |
| StreamWise: Serving Multi-Modal Generation in Real-Time at Scale | Mar 6, 2026 | | —Unverified | 0 |
| Ambiguity Collapse by LLMs: A Taxonomy of Epistemic Risks | Mar 6, 2026 | | —Unverified | 0 |