| More Discriminative Sentence Embeddings via Semantic Graph Smoothing | Feb 20, 2024 | ClusteringSentence | CodeCode Available | 0 |
| TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization | Feb 20, 2024 | HallucinationNews Summarization | CodeCode Available | 1 |
| Asynchronous and Segmented Bidirectional Encoding for NMT | Feb 19, 2024 | Machine TranslationNMT | —Unverified | 0 |
| Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST! | Feb 19, 2024 | Sentence | CodeCode Available | 0 |
| Machine-Generated Text Localization | Feb 19, 2024 | Binary ClassificationMisinformation | CodeCode Available | 1 |
| NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms | Feb 19, 2024 | Machine TranslationNatural Language Understanding | CodeCode Available | 0 |
| Ontology Enhanced Claim Detection | Feb 19, 2024 | SentenceSentence Embeddings | —Unverified | 0 |
| Shaping Human-AI Collaboration: Varied Scaffolding Levels in Co-writing with Language Models | Feb 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can Deception Detection Go Deeper? Dataset, Evaluation, and Benchmark for Deception Reasoning | Feb 18, 2024 | Deception DetectionSentence | —Unverified | 0 |
| Syntactic Language Change in English and German: Metrics, Parsers, and Convergences | Feb 18, 2024 | Optical Character Recognition (OCR)Sentence | CodeCode Available | 0 |