| A Surprising Failure? Multimodal LLMs and the NLVR Challenge | Feb 26, 2024 | SentenceSpatial Reasoning | —Unverified | 0 |
| Cross-domain Chinese Sentence Pattern Parsing | Feb 26, 2024 | Sentence | —Unverified | 0 |
| Hitting "Probe"rty with Non-Linearity, and More | Feb 25, 2024 | Sentence | —Unverified | 0 |
| Likelihood-based Mitigation of Evaluation Bias in Large Language Models | Feb 25, 2024 | Grammatical Error CorrectionIn-Context Learning | CodeCode Available | 0 |
| Improving Sentence Embeddings with Automatic Generation of Training Data Using Few-shot Examples | Feb 23, 2024 | Dataset GenerationDecoder | CodeCode Available | 0 |
| GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection? | Feb 23, 2024 | DiagnosticHate Speech Detection | CodeCode Available | 0 |
| Self-Adaptive Reconstruction with Contrastive Learning for Unsupervised Sentence Embeddings | Feb 23, 2024 | Contrastive LearningSentence | —Unverified | 0 |
| Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency | Feb 23, 2024 | AttributeSentence | —Unverified | 0 |
| DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators | Feb 23, 2024 | DecoderMachine Translation | CodeCode Available | 0 |
| Noise-BERT: A Unified Perturbation-Robust Framework with Noise Alignment Pre-training for Noisy Slot Filling Task | Feb 22, 2024 | Adversarial AttackContrastive Learning | —Unverified | 0 |