| Segment-to-Act: Label-Noise-Robust Action-Prompted Video Segmentation Towards Embodied Intelligence | Mar 4, 2026 | | CodeCode Available | 0 |
| AMiD: Knowledge Distillation for LLMs with α-mixture Assistant Distribution | Mar 4, 2026 | | CodeCode Available | 0 |
| When and Where to Reset Matters for Long-Term Test-Time Adaptation | Mar 4, 2026 | | CodeCode Available | 0 |
| Toward Early Quality Assessment of Text-to-Image Diffusion Models | Mar 4, 2026 | | CodeCode Available | 0 |
| Glass Segmentation with Fusion of Learned and General Visual Features | Mar 4, 2026 | | CodeCode Available | 0 |
| From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning | Mar 4, 2026 | | CodeCode Available | 0 |
| Bridging Human Evaluation to Infrared and Visible Image Fusion | Mar 4, 2026 | | CodeCode Available | 0 |
| Joint Hardware-Workload Co-Optimization for In-Memory Computing Accelerators | Mar 4, 2026 | | CodeCode Available | 0 |
| CzechTopic: A Benchmark for Zero-Shot Topic Localization in Historical Czech Documents | Mar 4, 2026 | | CodeCode Available | 0 |
| RIVER: A Real-Time Interaction Benchmark for Video LLMs | Mar 4, 2026 | | CodeCode Available | 0 |
| VietNormalizer: An Open-Source, Dependency-Free Python Library for Vietnamese Text Normalization in TTS and NLP Applications | Mar 4, 2026 | | CodeCode Available | 0 |
| Error as Signal: Stiffness-Aware Diffusion Sampling via Embedded Runge-Kutta Guidance | Mar 4, 2026 | | CodeCode Available | 0 |
| Topological Alignment of Shared Vision-Language Embedding Space | Mar 4, 2026 | | CodeCode Available | 0 |
| Towards Realistic Personalization: Evaluating Long-Horizon Preference Following in Personalized User-LLM Interactions | Mar 4, 2026 | | CodeCode Available | 0 |
| VidEoMT: Your ViT is Secretly Also a Video Segmentation Model | Mar 4, 2026 | | —Unverified | 2 |
| Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents | Mar 4, 2026 | | —Unverified | 2 |
| VideoChat-M1: Collaborative Policy Planning for Video Understanding via Multi-Agent Reinforcement Learning | Mar 4, 2026 | | —Unverified | 0 |
| Crab^+: A Scalable and Unified Audio-Visual Scene Understanding Model with Explicit Cooperation | Mar 4, 2026 | | —Unverified | 0 |
| MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing | Mar 3, 2026 | | —Unverified | 0 |
| Evaluating Prompting Strategies for Chart Question Answering with Large Language Models | Mar 3, 2026 | | —Unverified | 0 |
| Multi-Agent Debate with Memory Masking | Mar 3, 2026 | | —Unverified | 0 |
| Locally Coherent Parallel Decoding in Diffusion Language Models | Mar 3, 2026 | | —Unverified | 0 |
| Expected Reward Prediction, with Applications to Model Routing | Mar 3, 2026 | | —Unverified | 0 |
| An experimental study of KV cache reuse strategies in chunk-level caching systems | Mar 3, 2026 | | —Unverified | 0 |
| Thinking into the Future: Latent Lookahead Training for Transformers | Mar 3, 2026 | | —Unverified | 0 |
| GSI Agent: Domain Knowledge Enhancement for Large Language Models in Green Stormwater Infrastructure | Mar 3, 2026 | | —Unverified | 0 |
| EEG-SeeGraph: Interpreting functional connectivity disruptions in dementias via sparse-explanatory dynamic EEG-graph learning | Mar 3, 2026 | | —Unverified | 0 |
| EEG-Based Brain-LLM Interface for Human Preference Aligned Generation | Mar 3, 2026 | | —Unverified | 0 |
| Tokenization Tradeoffs in Structured EHR Foundation Models | Mar 3, 2026 | | —Unverified | 0 |
| Form Follows Function: Recursive Stem Model | Mar 3, 2026 | | —Unverified | 0 |
| CraniMem: Cranial Inspired Gated and Bounded Memory for Agentic Systems | Mar 3, 2026 | | CodeCode Available | 0 |
| Evidence-based Distributional Alignment for Large Language Models | Mar 3, 2026 | | —Unverified | 0 |
| Benchmarking Compact VLMs for Clip-Level Surveillance Anomaly Detection Under Weak Supervision | Mar 3, 2026 | | —Unverified | 0 |
| Task Expansion and Cross Refinement for Open-World Conditional Modeling | Mar 3, 2026 | | —Unverified | 0 |
| Preventing Curriculum Collapse in Self-Evolving Reasoning Systems | Mar 3, 2026 | | —Unverified | 0 |
| Suppressing Domain-Specific Hallucination in Construction LLMs: A Knowledge Graph Foundation for GraphRAG and QLoRA on River and Sediment Control Technical Standards | Mar 3, 2026 | | —Unverified | 0 |
| A Browser-based Open Source Assistant for Multimodal Content Verification | Mar 3, 2026 | | —Unverified | 0 |
| Hybrid Orchestration of Edge AI and Microservices via Graph-based Self-Imitation Learning | Mar 3, 2026 | | —Unverified | 0 |
| calibfusion: Transformer-Based Differentiable Calibration for Radar-Camera Fusion Detection in Water-Surface Environments | Mar 3, 2026 | | —Unverified | 0 |
| Unmixing microinfrared spectroscopic images of cross-sections of historical oil paintings | Mar 3, 2026 | | —Unverified | 0 |
| XAI and Few-shot-based Hybrid Classification Model for Plant Leaf Disease Prognosis | Mar 3, 2026 | | —Unverified | 0 |
| Chart Deep Research in LVLMs via Parallel Relative Policy Optimization | Mar 3, 2026 | | —Unverified | 0 |
| VB: Visibility Benchmark for Visibility and Perspective Reasoning in Images | Mar 3, 2026 | | —Unverified | 0 |
| MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines | Mar 3, 2026 | | —Unverified | 0 |
| ERP-RiskBench: Leakage-Safe Ensemble Learning for Financial Risk | Mar 3, 2026 | | —Unverified | 0 |
| Does Semantic Noise Initialization Transfer from Images to Videos? A Paired Diagnostic Study | Mar 3, 2026 | | —Unverified | 0 |
| AutoFigure-Edit: Generating Editable Scientific Illustration | Mar 3, 2026 | | CodeCode Available | 0 |
| GNN For Muon Particle Momentum estimation | Mar 3, 2026 | | —Unverified | 0 |
| A theoretical model of dynamical grammatical gender shifting based on set-valued set function | Mar 3, 2026 | | —Unverified | 0 |
| Baseline Performance of AI Tools in Classifying Cognitive Demand of Mathematical Tasks | Mar 3, 2026 | | —Unverified | 0 |