| GrowOVER: How Can LLMs Adapt to Growing Real-World Knowledge? | Jun 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension | Feb 1, 2019 | Dialogue UnderstandingMultiple-choice | CodeCode Available | 0 |
| Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection | May 18, 2025 | MemorizationWorld Knowledge | CodeCode Available | 0 |
| DORA The Explorer: Directed Outreaching Reinforcement Action-Selection | Apr 11, 2018 | Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| RetinaQA: A Robust Knowledge Base Question Answering Model for both Answerable and Unanswerable Questions | Mar 16, 2024 | Knowledge Base Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data | Jan 31, 2024 | BenchmarkingChange Detection | CodeCode Available | 0 |
| Retrieval-Augmented Language Model for Extreme Multi-Label Knowledge Graph Link Prediction | May 21, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| The Effect of Masking Strategies on Knowledge Retention by Language Models | Jun 12, 2023 | Information RetrievalQuestion Answering | CodeCode Available | 0 |