| HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection | May 1, 2025 | Extractive Question-AnsweringHallucination | —Unverified | 0 |
| Improving Retrieval-Augmented Neural Machine Translation with Monolingual Data | Apr 30, 2025 | Machine TranslationRetrieval | —Unverified | 0 |
| The Distribution of Dependency Distance and Hierarchical Distance in Contemporary Written Japanese and Its Influencing Factors | Apr 30, 2025 | Sentence | —Unverified | 0 |
| 20min-XD: A Comparable Corpus of Swiss News Articles | Apr 30, 2025 | ArticlesSemantic Similarity | CodeCode Available | 0 |
| Crisp: Cognitive Restructuring of Negative Thoughts through Multi-turn Supportive Dialogues | Apr 24, 2025 | Sentence | —Unverified | 0 |
| Do Large Language Models know who did what to whom? | Apr 23, 2025 | Sentence | —Unverified | 0 |
| Information Leakage of Sentence Embeddings via Generative Embedding Inversion Attacks | Apr 23, 2025 | SentenceSentence Embedding | CodeCode Available | 0 |
| FinTextSim: Enhancing Financial Text Analysis with BERTopic | Apr 22, 2025 | SentenceStock Price Prediction | —Unverified | 0 |
| Describe Anything: Detailed Localized Image and Video Captioning | Apr 22, 2025 | SentenceVideo Captioning | —Unverified | 0 |
| FairTranslate: An English-French Dataset for Gender Bias Evaluation in Machine Translation by Overcoming Gender Binarity | Apr 22, 2025 | Machine TranslationSentence | CodeCode Available | 0 |
| Automatic Evaluation Metrics for Document-level Translation: Overview, Challenges and Trends | Apr 21, 2025 | Machine TranslationSentence | —Unverified | 0 |
| sEEG-based Encoding for Sentence Retrieval: A Contrastive Learning Approach to Brain-Language Alignment | Apr 20, 2025 | Contrastive LearningSentence | —Unverified | 0 |
| Automatic Text Summarization (ATS) for Research Documents in Sorani Kurdish | Apr 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Disentangling Linguistic Features with Dimension-Wise Analysis of Vector Embeddings | Apr 20, 2025 | NegationSentence | —Unverified | 0 |
| A Baseline for Self-state Identification and Classification in Mental Health Data: CLPsych 2025 Task | Apr 18, 2025 | AttributeBinary Classification | —Unverified | 0 |
| ViClaim: A Multilingual Multilabel Dataset for Automatic Claim Detection in Videos | Apr 17, 2025 | MisinformationSentence | —Unverified | 0 |
| Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models | Apr 17, 2025 | Sentence | —Unverified | 0 |
| Multilingual Contextualization of Large Language Models for Document-Level Machine Translation | Apr 16, 2025 | Document Level Machine TranslationDocument Translation | —Unverified | 0 |
| The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation | Apr 16, 2025 | SentenceText-to-Video Generation | —Unverified | 0 |
| An LLM-as-a-judge Approach for Scalable Gender-Neutral Translation Evaluation | Apr 16, 2025 | Sentence | —Unverified | 0 |
| SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data | Apr 16, 2025 | Contrastive Learningcounterfactual | —Unverified | 0 |
| Dynamik: Syntactically-Driven Dynamic Font Sizing for Emphasis of Key Information | Apr 13, 2025 | Sentence | CodeCode Available | 0 |
| LLMs Can Achieve High-quality Simultaneous Machine Translation as Efficiently as Offline | Apr 13, 2025 | Machine TranslationSentence | CodeCode Available | 0 |
| Do LLMs Understand Your Translations? Evaluating Paragraph-level MT with Question Answering | Apr 10, 2025 | Machine TranslationQuestion Answering | CodeCode Available | 0 |
| RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts | Apr 9, 2025 | Dialogue EvaluationLanguage Modeling | CodeCode Available | 0 |
| Topic mining based on fine-tuning Sentence-BERT and LDA | Apr 7, 2025 | Sentence | —Unverified | 0 |
| DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation | Apr 7, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Generative AI Enhanced Financial Risk Management Information Retrieval | Apr 4, 2025 | Information RetrievalManagement | CodeCode Available | 0 |
| Coarse-to-Fine Semantic Communication Systems for Text Transmission | Apr 2, 2025 | Semantic CommunicationSentence | —Unverified | 0 |
| TransforMerger: Transformer-based Voice-Gesture Fusion for Robust Human-Robot Communication | Apr 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Foundations and Evaluations in NLP | Apr 2, 2025 | Boundary DetectionDependency Parsing | —Unverified | 0 |
| SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching | Apr 1, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| An extension of linear self-attention for in-context learning | Mar 31, 2025 | In-Context LearningSentence | —Unverified | 0 |
| Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery | Mar 29, 2025 | Action UnderstandingInstrument Recognition | —Unverified | 0 |
| A Framework for Lightweight Responsible Prompting Recommendation | Mar 29, 2025 | SentenceSentence Embeddings | —Unverified | 0 |
| MultiClaimNet: A Massively Multilingual Dataset of Fact-Checked Claim Clusters | Mar 28, 2025 | ClusteringFact Checking | —Unverified | 0 |
| Beyond Single-Sentence Prompts: Upgrading Value Alignment Benchmarks with Dialogues and Stories | Mar 28, 2025 | EthicsSentence | —Unverified | 0 |
| Negation: A Pink Elephant in the Large Language Models' Room? | Mar 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generating Synthetic Oracle Datasets to Analyze Noise Impact: A Study on Building Function Classification Using Tweets | Mar 28, 2025 | Domain GeneralizationEarth Observation | —Unverified | 0 |
| A Retrieval-Based Approach to Medical Procedure Matching in Romanian | Mar 26, 2025 | Medical ProcedureRetrieval | —Unverified | 0 |
| BeLightRec: A lightweight recommender system enhanced with BERT | Mar 26, 2025 | Collaborative FilteringRecommendation Systems | —Unverified | 0 |
| BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation | Mar 26, 2025 | DescriptiveImage Generation | —Unverified | 0 |
| Beyond Relevance: An Adaptive Exploration-Based Framework for Personalized Recommendations | Mar 25, 2025 | Collaborative FilteringDiversity | —Unverified | 0 |
| SCI-IDEA: Context-Aware Scientific Ideation Using Token and Sentence Embeddings | Mar 25, 2025 | scientific discoverySentence | —Unverified | 0 |
| Bridging Writing Manner Gap in Visual Instruction Tuning by Creating LLM-aligned Instructions | Mar 24, 2025 | Sentence | —Unverified | 0 |
| Collaborative Temporal Consistency Learning for Point-supervised Natural Language Video Localization | Mar 22, 2025 | Saliency DetectionSentence | —Unverified | 0 |
| CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement | Mar 21, 2025 | Dimensionality ReductionLanguage Modeling | —Unverified | 0 |
| Leveraging Human Production-Interpretation Asymmetries to Test LLM Cognitive Plausibility | Mar 21, 2025 | Sentence | —Unverified | 0 |
| FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs | Mar 21, 2025 | HallucinationKnowledge Graphs | —Unverified | 0 |
| TROVE: A Challenge for Fine-Grained Text Provenance via Source Sentence Tracing and Relationship Classification | Mar 19, 2025 | RetrievalSentence | —Unverified | 0 |