| Cropping outperforms dropout as an augmentation strategy for self-supervised training of text embeddings | Mar 16, 2026 | | —Unverified | 0 |
| STEMTOX: From Social Tags to Fine-Grained Toxic Meme Detection via Entropy-Guided Multi-Task Learning | Mar 16, 2026 | | —Unverified | 0 |
| Towards Privacy-Preserving Machine Translation at the Inference Stage: A New Task and Benchmark | Mar 16, 2026 | | —Unverified | 0 |
| Benchmarking LLM-based agents for single-cell omics analysis | Mar 16, 2026 | | —Unverified | 0 |
| Surgical Video Understanding with Label Interpolation | Mar 16, 2026 | | —Unverified | 0 |
| EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer | Mar 16, 2026 | | —Unverified | 0 |
| Grounded Misunderstandings in Asymmetric Dialogue: A Perspectivist Annotation Scheme for MapTask | Mar 16, 2026 | | —Unverified | 0 |
| Collaborating Vision, Depth, and Thermal Signals for Multi-Modal Tracking: Dataset and Algorithm | Mar 16, 2026 | | —Unverified | 0 |
| YOLO26: Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection | Mar 16, 2026 | | —Unverified | 0 |
| Convergence of Distributionally Robust Q-Learning with Linear Function Approximation | Mar 16, 2026 | | —Unverified | 0 |
| Near-Equilibrium Propagation training in nonlinear wave systems | Mar 16, 2026 | | —Unverified | 0 |
| Diffusion-Classifier Synergy: Reward-Aligned Learning via Mutual Boosting Loop for FSCIL | Mar 16, 2026 | | —Unverified | 0 |
| Diverse Text-to-Image Generation via Contrastive Noise Optimization | Mar 16, 2026 | | —Unverified | 0 |
| Watch and Learn: Learning to Use Computers from Online Videos | Mar 16, 2026 | | —Unverified | 0 |
| Dynamic Stress Detection: A Study of Temporal Progression Modelling of Stress in Speech | Mar 16, 2026 | | —Unverified | 0 |
| Data-intrinsic approximation in metric spaces | Mar 16, 2026 | | —Unverified | 0 |
| Qubit-centric Transformer for Surface Code Decoding | Mar 16, 2026 | | —Unverified | 0 |
| A Functional Perspective on Knowledge Distillation in Neural Networks | Mar 16, 2026 | | —Unverified | 0 |
| Feature-driven reinforcement learning for photovoltaic in continuous intraday trading | Mar 16, 2026 | | —Unverified | 0 |
| SemBench: A Benchmark for Semantic Query Processing Engines | Mar 16, 2026 | | —Unverified | 0 |
| First Proof | Mar 16, 2026 | | —Unverified | 0 |
| VLAD-Grasp: Zero-shot Grasp Detection via Vision-Language Models | Mar 16, 2026 | | —Unverified | 0 |
| MedPT: A Massive Medical Question Answering Dataset for Brazilian-Portuguese Speakers | Mar 16, 2026 | | —Unverified | 0 |
| Tractable Probabilistic Models for Investment Planning | Mar 16, 2026 | | —Unverified | 0 |
| Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving | Mar 16, 2026 | | —Unverified | 0 |
| SVG360: Multi-View SVG Generation with Geometric and Color Consistency from a Single SVG | Mar 16, 2026 | | —Unverified | 0 |
| ConsistCompose: Unified Multimodal Layout Control for Image Composition | Mar 16, 2026 | | —Unverified | 0 |
| Scale Where It Matters: Training-Free Localized Scaling for Diffusion Models | Mar 16, 2026 | | —Unverified | 0 |
| GENA3D: Generative Amodal 3D Modeling by Bridging 2D Priors and 3D Coherence | Mar 16, 2026 | | —Unverified | 0 |
| STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative | Mar 16, 2026 | | —Unverified | 0 |
| MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator | Mar 16, 2026 | | —Unverified | 4 |
| Setting the Stage: Text-Driven Scene-Consistent Image Generation | Mar 16, 2026 | | —Unverified | 0 |
| Training-Free Global Geometric Association for 4D LiDAR Panoptic Segmentation | Mar 16, 2026 | | —Unverified | 0 |
| Assessing generative modeling approaches for free energy estimates in condensed matter | Mar 16, 2026 | | —Unverified | 0 |
| Agentic Retoucher for Text-To-Image Generation | Mar 16, 2026 | | —Unverified | 0 |
| WebCoderBench: Benchmarking Web Application Generation with Comprehensive and Interpretable Evaluation Metrics | Mar 16, 2026 | | —Unverified | 0 |
| MorphGS: Morphology-Adaptive Articulated 3D Motion Transfer from Videos | Mar 16, 2026 | | —Unverified | 0 |
| From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence | Mar 16, 2026 | | —Unverified | 0 |
| LAMB: LLM-based Audio Captioning with Modality Gap Bridging via Cauchy-Schwarz Divergence | Mar 16, 2026 | | —Unverified | 0 |
| Boosting Latent Diffusion Models via Disentangled Representation Alignment | Mar 16, 2026 | | —Unverified | 0 |
| RAG-3DSG: Enhancing 3D Scene Graphs with Re-Shot Guided Retrieval-Augmented Generation | Mar 16, 2026 | | —Unverified | 0 |
| Sparks of Cooperative Reasoning: LLMs as Strategic Hanabi Agents | Mar 16, 2026 | | —Unverified | 0 |
| NaVIDA: Vision-Language Navigation with Inverse Dynamics Augmentation | Mar 16, 2026 | | —Unverified | 0 |
| BabyReasoningBench: Generating Developmentally-Inspired Reasoning Tasks for Evaluating Baby Language Models | Mar 16, 2026 | | —Unverified | 0 |
| The Geometric Mechanics of Contrastive Learning: Alignment Potentials, Entropic Dispersion, and Modality Gap | Mar 16, 2026 | | —Unverified | 0 |
| CRAFT: Calibrated Reasoning with Answer-Faithful Traces via Reinforcement Learning for Multi-Hop Question Answering | Mar 16, 2026 | | —Unverified | 0 |
| The Wisdom of Many Queries: Complexity-Diversity Principle for Dense Retriever Training | Mar 16, 2026 | | —Unverified | 0 |
| Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models | Mar 16, 2026 | | CodeCode Available | 0 |
| SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis | Mar 16, 2026 | | —Unverified | 3 |
| HyperTokens: Controlling Token Dynamics for Continual Video-Language Understanding | Mar 16, 2026 | | —Unverified | 0 |