SOTAVerified

World Knowledge

Papers

Showing 76100 of 818 papers

TitleStatusHype
Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language ModelsCode1
Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language modelsCode1
Aging with GRACE: Lifelong Model Editing with Discrete Key-Value AdaptorsCode1
Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open WorldsCode1
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval AugmentationCode1
LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial ApplicationCode1
Imagine This! Scripts to Compositions to VideosCode1
AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic FrameworkCode1
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name RecognitionCode1
InGram: Inductive Knowledge Graph Embedding via Relation GraphsCode1
Breaking NLI Systems with Sentences that Require Simple Lexical InferencesCode1
A-OKVQA: A Benchmark for Visual Question Answering using World KnowledgeCode1
Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?Code1
How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent AdvancesCode1
I Don't Know: Explicit Modeling of Uncertainty with an [IDK] TokenCode1
Hallucinated but Factual! Inspecting the Factuality of Hallucinations in Abstractive SummarizationCode1
Knowledge Editing through Chain-of-ThoughtCode1
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model BiasCode1
Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World KnowledgeCode1
A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question AnsweringCode1
Diversify and Conquer: Diversity-Centric Data Selection with Iterative RefinementCode1
BLADE: Benchmarking Language Model Agents for Data-Driven ScienceCode1
Counterfactual reasoning: Testing language models' understanding of hypothetical scenariosCode1
Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in LanguageCode1
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language ModelsCode1
Show:102550
← PrevPage 4 of 33Next →

No leaderboard results yet.