| We Need to Measure Data Diversity in NLP -- Better and Broader | May 26, 2025 | Diversityvalid | —Unverified | 0 |
| Optimal Conformal Prediction under Epistemic Uncertainty | May 25, 2025 | Conformal PredictionPrediction | CodeCode Available | 0 |
| NTIRE 2025 Challenge on Video Quality Enhancement for Video Conferencing: Datasets, Methods and Results | May 25, 2025 | validVideo Quality Assessment | CodeCode Available | 0 |
| Efficient Long CoT Reasoning in Small Language Models | May 24, 2025 | Mathematical Reasoningvalid | —Unverified | 0 |
| MedScore: Factuality Evaluation of Free-Form Medical Answers | May 24, 2025 | FormHallucination | CodeCode Available | 0 |
| Graph Style Transfer for Counterfactual Explainability | May 23, 2025 | counterfactualCounterfactual Explanation | CodeCode Available | 0 |
| Flexible MOF Generation with Torsion-Aware Flow Matching | May 23, 2025 | valid | —Unverified | 0 |
| Efficient Adaptive Experimentation with Non-Compliance | May 23, 2025 | valid | CodeCode Available | 0 |
| Effects of auditory distance cues and reverberation on spatial perception and listening strategies | May 23, 2025 | valid | CodeCode Available | 0 |
| Applications of Modular Co-Design for De Novo 3D Molecule Generation | May 23, 2025 | 3D Molecule GenerationDenoising | —Unverified | 0 |
| Anytime-valid, Bayes-assisted,Prediction-Powered Inference | May 23, 2025 | Predictionvalid | —Unverified | 0 |
| A collaborative constrained graph diffusion model for the generation of realistic synthetic molecules | May 22, 2025 | valid | CodeCode Available | 0 |
| MuseRAG: Idea Originality Scoring At Scale | May 22, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 |
| Statistical Inference for Online Algorithms | May 22, 2025 | valid | CodeCode Available | 0 |
| Statistical Test for Saliency Maps of Graph Neural Networks via Selective Inference | May 22, 2025 | valid | —Unverified | 0 |
| Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack | May 21, 2025 | Multiple-choiceMultiple Choice Question Answering (MCQA) | —Unverified | 0 |
| Projection-Based Correction for Enhancing Deep Inverse Networks | May 21, 2025 | valid | —Unverified | 0 |
| Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study | May 21, 2025 | valid | —Unverified | 0 |
| Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets | May 21, 2025 | Diversityvalid | —Unverified | 0 |
| Valid Post-Contextual Bandit Inference | May 20, 2025 | Translationvalid | —Unverified | 0 |
| Learning to Insert for Constructive Neural Vehicle Routing Solver | May 20, 2025 | Model OptimizationPosition | —Unverified | 0 |
| Temporal Alignment of Time Sensitive Facts with Activation Engineering | May 20, 2025 | valid | —Unverified | 0 |
| A Comprehensive Benchmarking Platform for Deep Generative Models in Molecular Design | May 19, 2025 | BenchmarkingDrug Discovery | —Unverified | 0 |
| NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results | May 17, 2025 | valid | —Unverified | 0 |
| Coherent Language Reconstruction from Brain Recordings with Flexible Multi-Modal Input Stimuli | May 15, 2025 | valid | —Unverified | 0 |
| Better Understanding Triple Differences Estimators | May 15, 2025 | valid | —Unverified | 0 |
| Feature Fitted Online Conformal Prediction for Deep Time Series Forecasting Model | May 13, 2025 | Conformal PredictionPrediction | CodeCode Available | 0 |
| A spherical amplitude-phase formulation for 3-D adaptive line-of-sight (ALOS) guidance with USGES stability guarantees | May 13, 2025 | valid | —Unverified | 0 |
| Sharp Gaussian approximations for Decentralized Federated Learning | May 12, 2025 | Federated Learningvalid | —Unverified | 0 |
| Transfer Learning Across Fixed-Income Product Classes | May 12, 2025 | Transfer Learningvalid | —Unverified | 0 |
| Generalization Bounds and Stopping Rules for Learning with Self-Selected Data | May 12, 2025 | Active LearningGeneralization Bounds | —Unverified | 0 |
| LLM-Augmented Chemical Synthesis and Design Decision Programs | May 11, 2025 | Decision MakingMulti-step retrosynthesis | —Unverified | 0 |
| Tell Me Who Your Students Are: GPT Can Generate Valid Multiple-Choice Questions When Students' (Mis)Understanding Is Hinted | May 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evolutionary thoughts: integration of large language models and evolutionary algorithms | May 9, 2025 | Evolutionary AlgorithmsHallucination | CodeCode Available | 0 |
| PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes | May 8, 2025 | valid | —Unverified | 0 |
| Reinforcement Learning for Game-Theoretic Resource Allocation on Graphs | May 8, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Fair Uncertainty Quantification for Depression Prediction | May 8, 2025 | Conformal PredictionFairness | —Unverified | 0 |
| Conformal Prediction with Corrupted Labels: Uncertain Imputation and Robust Re-weighting | May 7, 2025 | Conformal PredictionImputation | CodeCode Available | 0 |
| LLM Code Customization with Visual Results: A Benchmark on TikZ | May 7, 2025 | Code Generationvalid | —Unverified | 0 |
| Sufficient Decision Proxies for Decision-Focused Learning | May 6, 2025 | valid | —Unverified | 0 |
| An Active Inference Model of Covert and Overt Visual Attention | May 6, 2025 | valid | CodeCode Available | 0 |
| Logits-Constrained Framework with RoBERTa for Ancient Chinese NER | May 5, 2025 | Chinese Named Entity RecognitionModel Selection | —Unverified | 0 |
| Corr2Distrib: Making Ambiguous Correspondences an Ally to Predict Reliable 6D Pose Distributions | May 5, 2025 | Pose Estimationvalid | —Unverified | 0 |
| Improved Dimensionality Reduction for Inverse Problems in Nuclear Fusion and High-Energy Astrophysics | May 5, 2025 | Dimensionality Reductionvalid | —Unverified | 0 |
| Model Checks in a Kernel Ridge Regression Framework | May 2, 2025 | regressionvalid | —Unverified | 0 |
| Constrained Network Adversarial Attacks: Validity, Robustness, and Transferability | May 2, 2025 | Adversarial AttackIntrusion Detection | —Unverified | 0 |
| Policy Learning with α-Expected Welfare | May 1, 2025 | valid | —Unverified | 0 |
| Real-time Program Evaluation using Anytime-valid Rank Tests | Apr 30, 2025 | counterfactualvalid | —Unverified | 0 |
| Passive Measurement of Autonomic Arousal in Real-World Settings | Apr 30, 2025 | valid | —Unverified | 0 |
| Representation Learning Preserving Ignorability and Covariate Matching for Treatment Effects | Apr 29, 2025 | Representation LearningSelection bias | CodeCode Available | 0 |