| InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning | Feb 9, 2026 | | —Unverified | 1 |
| Large Multimodal Models as General In-Context Classifiers | Feb 26, 2026 | | —Unverified | 1 |
| CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding | Feb 2, 2026 | | —Unverified | 1 |
| Optimal Scaling Needs Optimal Norm | Jan 27, 2026 | | —Unverified | 1 |
| Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening | Feb 6, 2026 | | —Unverified | 1 |
| Embed-RL: Reinforcement Learning for Reasoning-Driven Multimodal Embeddings | Mar 12, 2026 | | —Unverified | 1 |
| SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation | Feb 26, 2026 | | —Unverified | 1 |
| TADA! Tuning Audio Diffusion Models through Activation Steering | Feb 12, 2026 | | —Unverified | 1 |
| Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning | Jan 27, 2026 | | —Unverified | 1 |
| HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions | Mar 16, 2026 | | —Unverified | 1 |
| DSGym: A Holistic Framework for Evaluating and Training Data Science Agents | Jan 22, 2026 | | —Unverified | 1 |
| SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise | Feb 13, 2026 | | —Unverified | 1 |
| MedCLIPSeg: Probabilistic Vision-Language Adaptation for Data-Efficient and Generalizable Medical Image Segmentation | Feb 23, 2026 | | —Unverified | 1 |
| DREAM: Where Visual Understanding Meets Text-to-Image Generation | Mar 3, 2026 | | —Unverified | 1 |
| CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance | Mar 11, 2026 | | —Unverified | 1 |
| See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning | Feb 5, 2026 | | —Unverified | 1 |
| PluRel: Synthetic Data unlocks Scaling Laws for Relational Foundation Models | Feb 3, 2026 | | —Unverified | 1 |
| Learning to Configure Agentic AI Systems | Feb 12, 2026 | | —Unverified | 1 |
| Benchmarking Vision-Language Models for French PDF-to-Markdown Conversion | Feb 12, 2026 | | —Unverified | 1 |
| Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model | Mar 5, 2026 | | —Unverified | 1 |
| ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents | Mar 19, 2026 | | —Unverified | 1 |
| CASA: Cross-Attention over Self-Attention for Efficient Vision-Language Fusion | Mar 6, 2026 | | —Unverified | 1 |
| BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning | Mar 5, 2026 | | —Unverified | 1 |
| HEARTS: Benchmarking LLM Reasoning on Health Time Series | Mar 14, 2026 | | —Unverified | 1 |
| CodeV: Code with Images for Faithful Visual Reasoning via Tool-Aware Policy Optimization | Mar 2, 2026 | | —Unverified | 1 |
| AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition | Feb 28, 2026 | | —Unverified | 1 |
| U6G XL-MIMO Radiomap Prediction: Multi-Config Dataset and Beam Map Approach | Mar 6, 2026 | | —Unverified | 1 |
| Epistemic Diversity and Knowledge Collapse in Large Language Models | Jan 28, 2026 | | —Unverified | 1 |
| OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models | Feb 4, 2026 | | —Unverified | 1 |
| NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing | Mar 3, 2026 | | —Unverified | 1 |
| How2Everything: Mining the Web for How-To Procedures to Evaluate and Improve LLMs | Feb 9, 2026 | | —Unverified | 1 |
| MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding | Feb 23, 2026 | | —Unverified | 1 |
| -Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space | Mar 5, 2026 | | —Unverified | 1 |
| Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration | Mar 12, 2026 | | —Unverified | 1 |
| Pix2Shape: Towards Unsupervised Learning of 3D Scenes from Images using a View-based Representation | Mar 23, 2020 | DecoderSpatial Reasoning | CodeCode Available | 1 |
| PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction | Feb 16, 2020 | 3D Reconstruction | CodeCode Available | 1 |
| Learning Graph Regularisation for Guided Super-Resolution | Mar 27, 2022 | Super-Resolution | CodeCode Available | 1 |
| SpeechNet: A Universal Modularized Model for Speech Processing Tasks | May 7, 2021 | Multi-Task Learning | CodeCode Available | 1 |
| CNN-Based Image Reconstruction Method for Ultrafast Ultrasound Imaging | Aug 28, 2020 | Image Reconstruction | CodeCode Available | 1 |
| Peeking inside the Black Box: Interpreting Deep Learning Models for Exoplanet Atmospheric Retrievals | Nov 23, 2020 | Retrieval | CodeCode Available | 1 |
| Uncrowded Hypervolume-based Multi-objective Optimization with Gene-pool Optimal Mixing | Apr 10, 2020 | Evolutionary Algorithms | CodeCode Available | 1 |
| A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents | Apr 16, 2018 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 |
| Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? | Dec 1, 2020 | Feature EngineeringQ-Learning | CodeCode Available | 1 |
| Fairwashing Explanations with Off-Manifold Detergent | Jul 20, 2020 | Decision Making | CodeCode Available | 1 |
| Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Recognition | Aug 4, 2023 | Cross-corpusDomain Adaptation | CodeCode Available | 1 |
| Anisotropic 3D Multi-Stream CNN for Accurate Prostate Segmentation from Multi-Planar MRI | Sep 23, 2020 | Hyperparameter OptimizationSegmentation | CodeCode Available | 1 |
| Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning | Sep 2, 2021 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Multi-Task Learning for Dense Prediction Tasks: A Survey | Apr 28, 2020 | Multi-Task LearningPrediction | CodeCode Available | 1 |
| Smooth activations and reproducibility in deep networks | Oct 20, 2020 | | CodeCode Available | 1 |
| Homomorphism Autoencoder -- Learning Group Structured Representations from Observed Transitions | Jul 25, 2022 | Open-Ended Question AnsweringRepresentation Learning | CodeCode Available | 1 |