SOTAVerified

World Knowledge

Papers

Showing 101150 of 818 papers

TitleStatusHype
Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World KnowledgeCode1
LLaRA: Large Language-Recommendation AssistantCode1
BLADE: Benchmarking Language Model Agents for Data-Driven ScienceCode1
Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQACode1
Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched PromptsCode1
Machine Translation Meta Evaluation through Translation Accuracy Challenge SetsCode1
Language Models as Knowledge Bases: On Entity Representations, Storage Capacity, and Paraphrased QueriesCode1
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLUCode1
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model CollaborationCode1
An Automatic Graph Construction Framework based on Large Language Models for RecommendationCode1
KoLA: Carefully Benchmarking World Knowledge of Large Language ModelsCode1
Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and LayersCode1
O^2-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question AnsweringCode1
Can LLMs' Tuning Methods Work in Medical Multimodal Domain?Code1
Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training ModelCode1
ASER: A Large-scale Eventuality Knowledge GraphCode1
Large Scale Knowledge WashingCode1
Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language ModelsCode1
Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge GeneratorsCode1
OpenMix: Exploring Outlier Samples for Misclassification DetectionCode1
Chain-of-Skills: A Configurable Model for Open-domain Question AnsweringCode1
PAC-Bayesian Generalization Bounds for Knowledge Graph Representation LearningCode1
Beyond Embeddings: The Promise of Visual Table in Visual ReasoningCode1
LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial ApplicationCode1
Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood InformationCode1
Analyzing Knowledge Graph Embedding Methods from a Multi-Embedding Interaction PerspectiveCode1
Knowledge Editing through Chain-of-ThoughtCode1
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval AugmentationCode1
Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open WorldsCode1
Is ChatGPT a Good Recommender? A Preliminary StudyCode1
BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language ModelsCode1
Counterfactual reasoning: Do language models need world knowledge for causal understanding?Code1
KELM: Knowledge Enhanced Pre-Trained Language Representations with Message Passing on Hierarchical Relational GraphsCode1
Knowledge Graph Contrastive Learning for RecommendationCode1
Large-Scale Relation Learning for Question Answering over Knowledge Bases with Pre-trained Language ModelsCode1
I Don't Know: Explicit Modeling of Uncertainty with an [IDK] TokenCode1
Imagine This! Scripts to Compositions to VideosCode1
How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent AdvancesCode1
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name RecognitionCode1
Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?Code1
GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning ChainsCode1
HeadlineCause: A Dataset of News Headlines for Detecting CausalitiesCode1
ACES: Translation Accuracy Challenge Sets for Evaluating Machine Translation MetricsCode1
CommonsenseQA: A Question Answering Challenge Targeting Commonsense KnowledgeCode1
Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained TransformersCode1
Common Sense Enhanced Knowledge-based Recommendation with Large Language ModelCode1
A User-Centric Multi-Intent Benchmark for Evaluating Large Language ModelsCode1
GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task AssistantsCode1
InGram: Inductive Knowledge Graph Embedding via Relation GraphsCode1
A Unified Encoder-Decoder Framework with Entity MemoryCode1
Show:102550
← PrevPage 3 of 17Next →

No leaderboard results yet.