| Comparing human and LLM proofreading in L2 writing: Impact on lexical and syntactic features | Jun 10, 2025 | Sentence | —Unverified | 0 |
| Advancing STT for Low-Resource Real-World Speech | Jun 10, 2025 | SentenceSpeech-to-Text | —Unverified | 0 |
| Factors affecting the in-context learning abilities of LLMs for dialogue state tracking | Jun 10, 2025 | Dialogue State TrackingIn-Context Learning | —Unverified | 0 |
| Multimodal Representation Alignment for Cross-modal Information Retrieval | Jun 10, 2025 | Cross-Modal Information RetrievalInformation Retrieval | —Unverified | 0 |
| Swiss Parliaments Corpus Re-Imagined (SPC_R): Enhanced Transcription with RAG-based Correction and Predicted BLEU | Jun 9, 2025 | RAGSentence | —Unverified | 0 |
| Neural Responses to Affective Sentences Reveal Signatures of Depression | Jun 6, 2025 | DiagnosticEEG | —Unverified | 0 |
| ConECT Dataset: Overcoming Data Scarcity in Context-Aware E-Commerce MT | Jun 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Static Word Embeddings for Sentence Semantic Representation | Jun 5, 2025 | Contrastive LearningKnowledge Distillation | —Unverified | 0 |
| A MISMATCHED Benchmark for Scientific Natural Language Inference | Jun 5, 2025 | ArticlesNatural Language Inference | CodeCode Available | 0 |
| SSA-COMET: Do LLMs Outperform Learned Metrics in Evaluating MT for Under-Resourced African Languages? | Jun 5, 2025 | Machine TranslationSentence | —Unverified | 0 |
| Mechanistic Decomposition of Sentence Representations | Jun 4, 2025 | Dictionary LearningSentence | —Unverified | 0 |
| Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark | Jun 4, 2025 | SentenceVisual Reasoning | —Unverified | 0 |
| A Statistical Physics of Language Model Reasoning | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| IMPARA-GED: Grammatical Error Detection is Boosting Reference-free Grammatical Error Quality Estimator | Jun 3, 2025 | Grammatical Error CorrectionGrammatical Error Detection | —Unverified | 0 |
| The Reader is the Metric: How Textual Features and Reader Profiles Explain Conflicting Evaluations of AI Creative Writing | Jun 3, 2025 | Feature ImportanceSentence | CodeCode Available | 0 |
| CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching | Jun 1, 2025 | Dialogue GenerationDisentanglement | —Unverified | 0 |
| EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG | Jun 1, 2025 | Contrastive LearningDecoder | —Unverified | 0 |
| Efficient Text Encoders for Labor Market Analysis | May 30, 2025 | Contrastive LearningExtreme Multi-Label Classification | —Unverified | 0 |
| BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation System | May 29, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering | May 29, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal Transport | May 29, 2025 | Document Level Machine TranslationImage Captioning | CodeCode Available | 0 |
| StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs | May 29, 2025 | Extractive SummarizationSentence | —Unverified | 0 |
| Do Large Language Models Think Like the Brain? Sentence-Level Evidence from fMRI and Hierarchical Embeddings | May 28, 2025 | Sentence | —Unverified | 0 |
| NegVQA: Can Vision Language Models Understand Negation? | May 28, 2025 | NegationQuestion Answering | —Unverified | 0 |
| StressTest: Can YOUR Speech LM Handle the Stress? | May 28, 2025 | Question AnsweringSentence | —Unverified | 0 |
| Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations | May 27, 2025 | Sentence | —Unverified | 0 |
| Causal Distillation: Transferring Structured Explanations from Large to Compact Language Models | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Token-level Accept or Reject: A Micro Alignment Approach for Large Language Models | May 26, 2025 | Binary ClassificationSentence | CodeCode Available | 0 |
| Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling | May 26, 2025 | SentenceSpeech Synthesis | —Unverified | 0 |
| Inconsistent Tokenizations Cause Language Models to be Perplexed by Japanese Grammar | May 26, 2025 | Machine TranslationSentence | —Unverified | 0 |
| MVP: Multi-source Voice Pathology detection | May 26, 2025 | SentenceVoice pathology detection | CodeCode Available | 0 |
| Analyzing Political Bias in LLMs via Target-Oriented Sentiment Classification | May 26, 2025 | SentenceSentiment Analysis | —Unverified | 0 |
| SCIRGC: Multi-Granularity Citation Recommendation and Citation Sentence Preference Alignment | May 26, 2025 | ArticlesCitation Recommendation | —Unverified | 0 |
| Research on feature fusion and multimodal patent text based on graph attention network | May 26, 2025 | Computational EfficiencyGraph Attention | —Unverified | 0 |
| Dependency Parsing is More Parameter-Efficient with Normalization | May 26, 2025 | Dependency ParsingSentence | —Unverified | 0 |
| Model Enumeration of Two-Variable Logic with Quadratic Delay Complexity | May 26, 2025 | Sentence | CodeCode Available | 0 |
| AmpleHate: Amplifying the Attention for Versatile Implicit Hate Detection | May 26, 2025 | Contrastive LearningHate Speech Detection | CodeCode Available | 0 |
| Rethinking the Understanding Ability across LLMs through Mutual Information | May 25, 2025 | SentenceSentence Embedding | —Unverified | 0 |
| Learning to Explain: Prototype-Based Surrogate Models for LLM Classification | May 25, 2025 | Decision MakingSentence | —Unverified | 0 |
| Unveiling Dual Quality in Product Reviews: An NLP-Based Approach | May 25, 2025 | Sentence | —Unverified | 0 |
| WHISTRESS: Enriching Transcriptions with Sentence Stress Detection | May 25, 2025 | SentenceZero-shot Generalization | —Unverified | 0 |
| ASPO: Adaptive Sentence-Level Preference Optimization for Fine-Grained Multimodal Reasoning | May 25, 2025 | Computational EfficiencyMultimodal Reasoning | —Unverified | 0 |
| Building a Functional Machine Translation Corpus for Kpelle | May 24, 2025 | Data AugmentationLanguage Modelling | —Unverified | 0 |
| MedScore: Factuality Evaluation of Free-Form Medical Answers | May 24, 2025 | FormHallucination | CodeCode Available | 0 |
| PD^3: A Project Duplication Detection Framework via Adapted Multi-Agent Debate | May 23, 2025 | Sentence | —Unverified | 0 |
| Multi-Scale Probabilistic Generation Theory: A Hierarchical Framework for Interpreting Large Language Models | May 23, 2025 | Sentence | —Unverified | 0 |
| Memorization or Reasoning? Exploring the Idiom Understanding of LLMs | May 22, 2025 | Machine TranslationMemorization | —Unverified | 0 |
| A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLP | May 22, 2025 | Continual PretrainingDiagnostic | CodeCode Available | 0 |
| SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning | May 22, 2025 | Sentence | —Unverified | 0 |
| LLMs Are Not Scorers: Rethinking MT Evaluation with Generation-Based Methods | May 22, 2025 | DecoderMachine Translation | CodeCode Available | 0 |