| ShareGPT4V: Improving Large Multi-Modal Models with Better Captions | Nov 21, 2023 | DescriptiveMME | CodeCode Available | 0 |
| ExPUNations: Augmenting Puns with Keywords and Explanations | Oct 24, 2022 | Explanation GenerationNatural Language Understanding | CodeCode Available | 0 |
| Temporal Fact Reasoning over Hyper-Relational Knowledge Graphs | Jul 14, 2023 | Knowledge GraphsLink Prediction | CodeCode Available | 0 |
| Investigating associative, switchable and negatable Winograd items on renewed French data sets | Jun 1, 2022 | NegationWorld Knowledge | CodeCode Available | 0 |
| SocialVec: Social Entity Embeddings | Nov 5, 2021 | Entity EmbeddingsWord Embeddings | CodeCode Available | 0 |
| Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion | Sep 26, 2024 | Image GenerationIn-Context Learning | CodeCode Available | 0 |
| PK-Chat: Pointer Network Guided Knowledge Driven Generative Dialogue Model | Apr 2, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| Intrinsic Knowledge Evaluation on Chinese Language Models | Nov 29, 2020 | World Knowledge | CodeCode Available | 0 |
| ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval | Oct 24, 2024 | Image RetrievalRetrieval | CodeCode Available | 0 |
| Interweaving Memories of a Siamese Large Language Model | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Improving Neural Story Generation by Targeted Common Sense Grounding | Aug 26, 2019 | Common Sense ReasoningMulti-Task Learning | CodeCode Available | 0 |
| Augmenting Neural Networks with First-order Logic | Jun 14, 2019 | ChunkingNatural Language Inference | CodeCode Available | 0 |
| Explain Yourself! Leveraging Language Models for Commonsense Reasoning | Jun 6, 2019 | Common Sense ReasoningForm | CodeCode Available | 0 |
| Prepositions Matter in Quantifier Scope Disambiguation | Oct 1, 2022 | World Knowledge | CodeCode Available | 0 |
| Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech | Jun 6, 2024 | Explanation GenerationWorld Knowledge | CodeCode Available | 0 |
| ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge | Jun 17, 2025 | BenchmarkingRetrieval | CodeCode Available | 0 |
| Causal interventions expose implicit situation models for commonsense language understanding | Jun 6, 2023 | World Knowledge | CodeCode Available | 0 |
| Implicit Affordance Acquisition via Causal Action-Effect Modeling in the Video Domain | Dec 18, 2023 | World Knowledge | CodeCode Available | 0 |
| EventNarrative: A large-scale Event-centric Dataset for Knowledge Graph-to-Text Generation | Oct 30, 2021 | KG-to-Text GenerationKnowledge Graphs | CodeCode Available | 0 |
| Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question Answering | Sep 15, 2021 | Image CaptioningKnowledge Graphs | CodeCode Available | 0 |
| Stance Reasoner: Zero-Shot Stance Detection on Social Media with Explicit Reasoning | Mar 22, 2024 | Few-Shot Stance DetectionIn-Context Learning | CodeCode Available | 0 |
| Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation | May 24, 2025 | Image GenerationText to Image Generation | CodeCode Available | 0 |
| Event knowledge in large language models: the gap between the impossible and the unlikely | Dec 2, 2022 | SentenceWorld Knowledge | CodeCode Available | 0 |
| Style Outweighs Substance: Failure Modes of LLM Judges in Alignment Benchmarking | Sep 23, 2024 | BenchmarkingDiversity | CodeCode Available | 0 |
| Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension | Apr 21, 2019 | Data AugmentationLanguage Modelling | CodeCode Available | 0 |
| Probing Simile Knowledge from Pre-trained Language Models | Apr 27, 2022 | DiversityLanguage Modelling | CodeCode Available | 0 |
| Probing the Geometry of Truth: Consistency and Generalization of Truth Directions in LLMs Across Logical Transformations and Question Answering Tasks | Jun 1, 2025 | In-Context LearningNegation | CodeCode Available | 0 |
| EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge Graphs | Mar 30, 2024 | Graph Neural NetworkKnowledge Graphs | CodeCode Available | 0 |
| Evaluating Methods for Extraction of Aspect Terms in Opinion Texts in Portuguese - the Challenges of Implicit Aspects | Jun 1, 2022 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 0 |
| Evaluating Contrastive Feedback for Effective User Simulations | May 5, 2025 | Information RetrievalPrompt Engineering | CodeCode Available | 0 |
| Image2tweet: Datasets in Hindi and English for Generating Tweets from Images | Dec 1, 2021 | Image CaptioningWorld Knowledge | CodeCode Available | 0 |
| ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation | Jul 5, 2021 | Few-Shot LearningNatural Language Understanding | CodeCode Available | 0 |
| Anchoring Path for Inductive Relation Prediction in Knowledge Graphs | Dec 21, 2023 | Inductive Relation PredictionKnowledge Graphs | CodeCode Available | 0 |
| Word Order and World Knowledge | Mar 1, 2024 | World Knowledge | CodeCode Available | 0 |
| Tackling scalability issues in mining path patterns from knowledge graphs: a preliminary study | Jul 17, 2020 | Fact CheckingKnowledge Graphs | CodeCode Available | 0 |
| Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text | Dec 1, 2021 | World Knowledge | CodeCode Available | 0 |
| Enhancing Content-based Recommendation via Large Language Model | Mar 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| How Decoding Strategies Affect the Verifiability of Generated Text | Nov 9, 2019 | Language ModellingNatural Language Understanding | CodeCode Available | 0 |
| Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-Experts | May 25, 2022 | Mixture-of-ExpertsMulti-Task Learning | CodeCode Available | 0 |
| TeamOtter at SemEval-2022 Task 5: Detecting Misogynistic Content in Multimodal Memes | Jul 1, 2022 | World Knowledge | CodeCode Available | 0 |
| QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs | Dec 16, 2024 | BenchmarkingCommon Sense Reasoning | CodeCode Available | 0 |
| Hierarchy-based Image Embeddings for Semantic Image Retrieval | Sep 26, 2018 | Few-Shot LearningImage Retrieval | CodeCode Available | 0 |
| TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge | Mar 16, 2022 | Dialogue GenerationKnowledge Graphs | CodeCode Available | 0 |
| A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences | Jun 17, 2024 | In-Context Learningvalid | CodeCode Available | 0 |
| Hate is the New Infodemic: A Topic-aware Modeling of Hate Speech Diffusion on Twitter | Oct 9, 2020 | ArticlesWorld Knowledge | CodeCode Available | 0 |
| Bravo MaRDI: A Wikibase Powered Knowledge Graph on Mathematics | Sep 20, 2023 | World Knowledge | CodeCode Available | 0 |
| World Knowledge in Multiple Choice Reading Comprehension | Nov 13, 2022 | General KnowledgeMultiple-choice | CodeCode Available | 0 |
| AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge | Dec 18, 2024 | BenchmarkingWorld Knowledge | CodeCode Available | 0 |
| Test-time Augmentation for Factual Probing | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Empirical Study on Few-shot Knowledge Probing for Pretrained Language Models | Sep 6, 2021 | Knowledge ProbingPrompt Engineering | CodeCode Available | 0 |