| HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection | May 1, 2025 | Extractive Question-AnsweringHallucination | —Unverified | 0 |
| The Distribution of Dependency Distance and Hierarchical Distance in Contemporary Written Japanese and Its Influencing Factors | Apr 30, 2025 | Sentence | —Unverified | 0 |
| Improving Retrieval-Augmented Neural Machine Translation with Monolingual Data | Apr 30, 2025 | Machine TranslationRetrieval | —Unverified | 0 |
| 20min-XD: A Comparable Corpus of Swiss News Articles | Apr 30, 2025 | ArticlesSemantic Similarity | CodeCode Available | 0 |
| Crisp: Cognitive Restructuring of Negative Thoughts through Multi-turn Supportive Dialogues | Apr 24, 2025 | Sentence | —Unverified | 0 |
| Do Large Language Models know who did what to whom? | Apr 23, 2025 | Sentence | —Unverified | 0 |
| Information Leakage of Sentence Embeddings via Generative Embedding Inversion Attacks | Apr 23, 2025 | SentenceSentence Embedding | CodeCode Available | 0 |
| FairTranslate: An English-French Dataset for Gender Bias Evaluation in Machine Translation by Overcoming Gender Binarity | Apr 22, 2025 | Machine TranslationSentence | CodeCode Available | 0 |
| FinTextSim: Enhancing Financial Text Analysis with BERTopic | Apr 22, 2025 | SentenceStock Price Prediction | —Unverified | 0 |
| Describe Anything: Detailed Localized Image and Video Captioning | Apr 22, 2025 | SentenceVideo Captioning | —Unverified | 0 |
| Automatic Evaluation Metrics for Document-level Translation: Overview, Challenges and Trends | Apr 21, 2025 | Machine TranslationSentence | —Unverified | 0 |
| Automatic Text Summarization (ATS) for Research Documents in Sorani Kurdish | Apr 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| sEEG-based Encoding for Sentence Retrieval: A Contrastive Learning Approach to Brain-Language Alignment | Apr 20, 2025 | Contrastive LearningSentence | —Unverified | 0 |
| Disentangling Linguistic Features with Dimension-Wise Analysis of Vector Embeddings | Apr 20, 2025 | NegationSentence | —Unverified | 0 |
| A Baseline for Self-state Identification and Classification in Mental Health Data: CLPsych 2025 Task | Apr 18, 2025 | AttributeBinary Classification | —Unverified | 0 |
| ViClaim: A Multilingual Multilabel Dataset for Automatic Claim Detection in Videos | Apr 17, 2025 | MisinformationSentence | —Unverified | 0 |
| Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models | Apr 17, 2025 | Sentence | —Unverified | 0 |
| SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data | Apr 16, 2025 | Contrastive Learningcounterfactual | —Unverified | 0 |
| An LLM-as-a-judge Approach for Scalable Gender-Neutral Translation Evaluation | Apr 16, 2025 | Sentence | —Unverified | 0 |
| Multilingual Contextualization of Large Language Models for Document-Level Machine Translation | Apr 16, 2025 | Document Level Machine TranslationDocument Translation | —Unverified | 0 |
| The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation | Apr 16, 2025 | SentenceText-to-Video Generation | —Unverified | 0 |
| LLMs Can Achieve High-quality Simultaneous Machine Translation as Efficiently as Offline | Apr 13, 2025 | Machine TranslationSentence | CodeCode Available | 0 |
| Dynamik: Syntactically-Driven Dynamic Font Sizing for Emphasis of Key Information | Apr 13, 2025 | Sentence | CodeCode Available | 0 |
| Do LLMs Understand Your Translations? Evaluating Paragraph-level MT with Question Answering | Apr 10, 2025 | Machine TranslationQuestion Answering | CodeCode Available | 0 |
| RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts | Apr 9, 2025 | Dialogue EvaluationLanguage Modeling | CodeCode Available | 0 |