| A Surprising Failure? Multimodal LLMs and the NLVR Challenge | Feb 26, 2024 | SentenceSpatial Reasoning | —Unverified | 0 |
| Cross-domain Chinese Sentence Pattern Parsing | Feb 26, 2024 | Sentence | —Unverified | 0 |
| Likelihood-based Mitigation of Evaluation Bias in Large Language Models | Feb 25, 2024 | Grammatical Error CorrectionIn-Context Learning | CodeCode Available | 0 |
| Hitting "Probe"rty with Non-Linearity, and More | Feb 25, 2024 | Sentence | —Unverified | 0 |
| GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection? | Feb 23, 2024 | DiagnosticHate Speech Detection | CodeCode Available | 0 |
| Self-Adaptive Reconstruction with Contrastive Learning for Unsupervised Sentence Embeddings | Feb 23, 2024 | Contrastive LearningSentence | —Unverified | 0 |
| DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators | Feb 23, 2024 | DecoderMachine Translation | CodeCode Available | 0 |
| Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency | Feb 23, 2024 | AttributeSentence | —Unverified | 0 |
| Improving Sentence Embeddings with Automatic Generation of Training Data Using Few-shot Examples | Feb 23, 2024 | Dataset GenerationDecoder | CodeCode Available | 0 |
| Noise-BERT: A Unified Perturbation-Robust Framework with Noise Alignment Pre-training for Noisy Slot Filling Task | Feb 22, 2024 | Adversarial AttackContrastive Learning | —Unverified | 0 |
| GATE X-E : A Challenge Set for Gender-Fair Translations from Weakly-Gendered Languages | Feb 22, 2024 | Machine TranslationNMT | —Unverified | 0 |
| Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment | Feb 21, 2024 | Sentence | CodeCode Available | 0 |
| Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions | Feb 21, 2024 | In-Context LearningKnowledge Distillation | —Unverified | 0 |
| Overview of the VLSP 2023 -- ComOM Shared Task: A Data Challenge for Comparative Opinion Mining from Vietnamese Product Reviews | Feb 21, 2024 | Opinion MiningSentence | —Unverified | 0 |
| Are ELECTRA's Sentence Embeddings Beyond Repair? The Case of Semantic Textual Similarity | Feb 20, 2024 | Semantic Textual SimilaritySentence | CodeCode Available | 0 |
| ChatEL: Entity Linking with Chatbots | Feb 20, 2024 | Entity LinkingSentence | CodeCode Available | 0 |
| Simpson's Paradox and the Accuracy-Fluency Tradeoff in Translation | Feb 20, 2024 | SentenceTranslation | —Unverified | 0 |
| UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translation | Feb 20, 2024 | Machine TranslationNatural Language Understanding | CodeCode Available | 0 |
| More Discriminative Sentence Embeddings via Semantic Graph Smoothing | Feb 20, 2024 | ClusteringSentence | CodeCode Available | 0 |
| Explaining Relationships Among Research Papers | Feb 20, 2024 | Sentence | —Unverified | 0 |
| Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST! | Feb 19, 2024 | Sentence | CodeCode Available | 0 |
| Ontology Enhanced Claim Detection | Feb 19, 2024 | SentenceSentence Embeddings | —Unverified | 0 |
| NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms | Feb 19, 2024 | Machine TranslationNatural Language Understanding | CodeCode Available | 0 |
| Asynchronous and Segmented Bidirectional Encoding for NMT | Feb 19, 2024 | Machine TranslationNMT | —Unverified | 0 |
| Syntactic Language Change in English and German: Metrics, Parsers, and Convergences | Feb 18, 2024 | Optical Character Recognition (OCR)Sentence | CodeCode Available | 0 |