| Language Models Fail to Introspect About Their Knowledge of Language | Mar 10, 2025 | Sentence | CodeCode Available | 0 |
| Topology of Syntax Networks across Languages | Mar 9, 2025 | Sentence | —Unverified | 0 |
| Leveraging Semantic Type Dependencies for Clinical Named Entity Recognition | Mar 7, 2025 | Clinical Knowledgenamed-entity-recognition | —Unverified | 0 |
| SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs | Mar 7, 2025 | ClusteringHallucination | —Unverified | 0 |
| DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models | Mar 6, 2025 | Sentence | —Unverified | 0 |
| DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms | Mar 5, 2025 | Sentence | CodeCode Available | 0 |
| The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation | Mar 5, 2025 | Common Sense ReasoningMachine Translation | CodeCode Available | 0 |
| Hierarchical Re-ranker Retriever (HRR) | Mar 4, 2025 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Tailoring Table Retrieval from a Field-aware Hybrid Matching Perspective | Mar 4, 2025 | RetrievalSentence | —Unverified | 0 |
| Multilingual Relative Clause Attachment Ambiguity Resolution in Large Language Models | Mar 4, 2025 | Sentence | CodeCode Available | 0 |