| Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector | May 21, 2025 | Bias DetectionIn-Context Learning | —Unverified | 0 |
| Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition | May 21, 2025 | Dialogue GenerationLanguage Modeling | —Unverified | 0 |
| Ensembling Sparse Autoencoders | May 21, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation | May 21, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Internal and External Impacts of Natural Language Processing Papers | May 21, 2025 | ArticlesEthics | —Unverified | 0 |
| Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering | May 21, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling | May 21, 2025 | Emotion RecognitionFace Detection | —Unverified | 0 |
| Diagnosing our datasets: How does my language model learn clinical information? | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective | May 21, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |