| PropNet: a White-Box and Human-Like Network for Sentence Representation | Feb 15, 2025 | Semantic Textual SimilaritySentence | —Unverified | 0 |
| Rethinking Evaluation Metrics for Grammatical Error Correction: Why Use a Different Evaluation Process than Human? | Feb 13, 2025 | Grammatical Error CorrectionSentence | CodeCode Available | 1 |
| When the LM misunderstood the human chuckled: Analyzing garden path effects in humans and language models | Feb 13, 2025 | Image GenerationSentence | —Unverified | 0 |
| SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models | Feb 13, 2025 | Long Form Question AnsweringQuestion Answering | CodeCode Available | 0 |
| ParetoRAG: Leveraging Sentence-Context Attention for Robust and Efficient Retrieval-Augmented Generation | Feb 12, 2025 | RAGRetrieval | —Unverified | 0 |
| Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples | Feb 12, 2025 | Distractor GenerationInformation Retrieval | —Unverified | 0 |
| Lexical Manifold Reconfiguration in Large Language Models: A Novel Architectural Approach for Contextual Modulation | Feb 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Making Language Models Robust Against Negation | Feb 11, 2025 | Natural Language UnderstandingNegation | —Unverified | 0 |
| A Large-Scale Benchmark for Vietnamese Sentence Paraphrases | Feb 11, 2025 | Paraphrase GenerationSentence | CodeCode Available | 0 |
| Perceived Confidence Scoring for Data Annotation with Zero-Shot LLMs | Feb 11, 2025 | Sentence | —Unverified | 0 |