| HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation | Jun 11, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 0 |
| Building Bridges: A Dataset for Evaluating Gender-Fair Machine Translation into German | Jun 10, 2024 | Machine TranslationSentence | CodeCode Available | 0 |
| MaskLID: Code-Switching Language Identification through Iterative Masking | Jun 10, 2024 | Language IdentificationSentence | CodeCode Available | 1 |
| Verifiable Generation with Subsentence-Level Fine-Grained Citations | Jun 10, 2024 | SentenceSpecificity | —Unverified | 0 |
| Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems | Jun 10, 2024 | Data AugmentationSentence | —Unverified | 0 |
| Text-aware and Context-aware Expressive Audiobook Speech Synthesis | Jun 9, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation | Jun 9, 2024 | DiversitySentence | CodeCode Available | 1 |
| Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss | Jun 8, 2024 | Contrastive LearningSemantic Textual Similarity | CodeCode Available | 0 |
| BAMO at SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense | Jun 7, 2024 | Common Sense ReasoningSentence | CodeCode Available | 0 |
| Do Language Models Exhibit Human-like Structural Priming Effects? | Jun 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Creating an AI Observer: Generative Semantic Workspaces | Jun 7, 2024 | Sentence | —Unverified | 0 |
| Proofread: Fixes All Errors with One Tap | Jun 6, 2024 | AllQuantization | —Unverified | 0 |
| Characterizing Similarities and Divergences in Conversational Tones in Humans and LLMs by Sampling with People | Jun 6, 2024 | Sentence | CodeCode Available | 0 |
| Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation | Jun 6, 2024 | Conversational Question AnsweringQuestion Answering | —Unverified | 0 |
| Recovering document annotations for sentence-level bitext | Jun 6, 2024 | Machine TranslationSentence | —Unverified | 0 |
| HeSum: a Novel Dataset for Abstractive Text Summarization in Hebrew | Jun 6, 2024 | Abstractive Text SummarizationSentence | CodeCode Available | 0 |
| Advancing Anomaly Detection: Non-Semantic Financial Data Encoding with LLMs | Jun 5, 2024 | Anomaly DetectionSentence | —Unverified | 0 |
| Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese | Jun 5, 2024 | Data AugmentationMulti-Task Learning | —Unverified | 0 |
| Space Decomposition for Sentence Embedding | Jun 5, 2024 | Semantic Textual SimilaritySentence | CodeCode Available | 0 |
| Document-level Claim Extraction and Decontextualisation for Fact-Checking | Jun 5, 2024 | Extractive SummarizationFact Checking | CodeCode Available | 1 |
| LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback | Jun 5, 2024 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection | Jun 4, 2024 | HallucinationMachine Translation | CodeCode Available | 0 |
| Robust Interaction-Based Relevance Modeling for Online e-Commerce Search | Jun 4, 2024 | RetrievalSentence | CodeCode Available | 0 |
| CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks | Jun 4, 2024 | Document SummarizationSentence | CodeCode Available | 1 |
| How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio? | Jun 4, 2024 | Decision MakingSentence | —Unverified | 0 |