SOTAVerified

World Knowledge

Papers

Showing 751800 of 818 papers

TitleStatusHype
ShareGPT4V: Improving Large Multi-Modal Models with Better CaptionsCode0
ExPUNations: Augmenting Puns with Keywords and ExplanationsCode0
Temporal Fact Reasoning over Hyper-Relational Knowledge GraphsCode0
Investigating associative, switchable and negatable Winograd items on renewed French data setsCode0
SocialVec: Social Entity EmbeddingsCode0
Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative CriterionCode0
PK-Chat: Pointer Network Guided Knowledge Driven Generative Dialogue ModelCode0
Intrinsic Knowledge Evaluation on Chinese Language ModelsCode0
ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image RetrievalCode0
Interweaving Memories of a Siamese Large Language ModelCode0
Improving Neural Story Generation by Targeted Common Sense GroundingCode0
Augmenting Neural Networks with First-order LogicCode0
Explain Yourself! Leveraging Language Models for Commonsense ReasoningCode0
Prepositions Matter in Quantifier Scope DisambiguationCode0
Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate SpeechCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
Causal interventions expose implicit situation models for commonsense language understandingCode0
Implicit Affordance Acquisition via Causal Action-Effect Modeling in the Video DomainCode0
EventNarrative: A large-scale Event-centric Dataset for Knowledge Graph-to-Text GenerationCode0
Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question AnsweringCode0
Stance Reasoner: Zero-Shot Stance Detection on Social Media with Explicit ReasoningCode0
Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image GenerationCode0
Event knowledge in large language models: the gap between the impossible and the unlikelyCode0
Style Outweighs Substance: Failure Modes of LLM Judges in Alignment BenchmarkingCode0
Investigating Prior Knowledge for Challenging Chinese Machine Reading ComprehensionCode0
Probing Simile Knowledge from Pre-trained Language ModelsCode0
Probing the Geometry of Truth: Consistency and Generalization of Truth Directions in LLMs Across Logical Transformations and Question Answering TasksCode0
EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge GraphsCode0
Evaluating Methods for Extraction of Aspect Terms in Opinion Texts in Portuguese - the Challenges of Implicit AspectsCode0
Evaluating Contrastive Feedback for Effective User SimulationsCode0
Image2tweet: Datasets in Hindi and English for Generating Tweets from ImagesCode0
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and GenerationCode0
Anchoring Path for Inductive Relation Prediction in Knowledge GraphsCode0
Word Order and World KnowledgeCode0
Tackling scalability issues in mining path patterns from knowledge graphs: a preliminary studyCode0
Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and TextCode0
Enhancing Content-based Recommendation via Large Language ModelCode0
How Decoding Strategies Affect the Verifiability of Generated TextCode0
Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-ExpertsCode0
TeamOtter at SemEval-2022 Task 5: Detecting Misogynistic Content in Multimodal MemesCode0
QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMsCode0
Hierarchy-based Image Embeddings for Semantic Image RetrievalCode0
TegTok: Augmenting Text Generation via Task-specific and Open-world KnowledgeCode0
A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic InferencesCode0
Hate is the New Infodemic: A Topic-aware Modeling of Hate Speech Diffusion on TwitterCode0
Bravo MaRDI: A Wikibase Powered Knowledge Graph on MathematicsCode0
World Knowledge in Multiple Choice Reading ComprehensionCode0
AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World KnowledgeCode0
Test-time Augmentation for Factual ProbingCode0
An Empirical Study on Few-shot Knowledge Probing for Pretrained Language ModelsCode0
Show:102550
← PrevPage 16 of 17Next →

No leaderboard results yet.