| The Knowref Coreference Corpus: Removing Gender and Number Cues for Difficult Pronominal Anaphora Resolution | Nov 2, 2018 | Common Sense Reasoningcoreference-resolution | CodeCode Available | 0 |
| LoFTI: Localization and Factuality Transfer to Indian Locales | Jul 16, 2024 | World Knowledge | CodeCode Available | 0 |
| A Study of Implicit Ranking Unfairness in Large Language Models | Nov 13, 2023 | Data AugmentationFairness | CodeCode Available | 0 |
| Locating and Extracting Relational Concepts in Large Language Models | Jun 19, 2024 | World Knowledge | CodeCode Available | 0 |
| A surprisal oracle for when every layer counts | Dec 4, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 0 |
| MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided Conversations | Jun 25, 2025 | World Knowledge | CodeCode Available | 0 |
| MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News Summarization | Sep 22, 2021 | ArticlesDocument Summarization | CodeCode Available | 0 |
| Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge | Oct 23, 2023 | Phrase GroundingWorld Knowledge | CodeCode Available | 0 |
| Mitigating Hallucination in Fictional Character Role-Play | Jun 25, 2024 | HallucinationWorld Knowledge | CodeCode Available | 0 |
| LLM-based Agent Simulation for Maternal Health Interventions: Uncertainty Estimation and Decision-focused Evaluation | Mar 25, 2025 | counterfactualDecision Making | CodeCode Available | 0 |