| Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark | Jun 4, 2025 | SentenceVisual Reasoning | —Unverified | 0 |
| A Statistical Physics of Language Model Reasoning | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mechanistic Decomposition of Sentence Representations | Jun 4, 2025 | Dictionary LearningSentence | —Unverified | 0 |
| IMPARA-GED: Grammatical Error Detection is Boosting Reference-free Grammatical Error Quality Estimator | Jun 3, 2025 | Grammatical Error CorrectionGrammatical Error Detection | —Unverified | 0 |
| The Reader is the Metric: How Textual Features and Reader Profiles Explain Conflicting Evaluations of AI Creative Writing | Jun 3, 2025 | Feature ImportanceSentence | CodeCode Available | 0 |
| EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG | Jun 1, 2025 | Contrastive LearningDecoder | —Unverified | 0 |
| CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching | Jun 1, 2025 | Dialogue GenerationDisentanglement | —Unverified | 0 |
| Efficient Text Encoders for Labor Market Analysis | May 30, 2025 | Contrastive LearningExtreme Multi-Label Classification | —Unverified | 0 |
| BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation System | May 29, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering | May 29, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |