World Knowledge

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–400 of 818 papers

Title	Date	Tasks	Status
All Entities are Not Created Equal: Examining the Long Tail for Fine-Grained Entity Typing	Oct 22, 2024	AllEntity Typing	—Unverified
Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic	Oct 21, 2024	Formal LogicWorld Knowledge	—Unverified
Roadmap towards Superhuman Speech Understanding using Large Language Models	Oct 17, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Understanding the Role of LLMs in Multimodal Evaluation Benchmarks	Oct 16, 2024	BenchmarkingLarge Language Model	CodeCode Available
Comprehending Knowledge Graphs with Large Language Models for Recommender Systems	Oct 16, 2024	Knowledge-Aware RecommendationKnowledge Graphs	—Unverified
KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities	Oct 15, 2024	Image GenerationRetrieval	—Unverified
DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities	Oct 10, 2024	Document RankingEntity Embeddings	CodeCode Available
TVBench: Redesigning Video-Language Evaluation	Oct 10, 2024	Multiple-choiceOpen-Ended Question Answering	—Unverified
Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance?	Oct 9, 2024	In-Context LearningLogical Reasoning	CodeCode Available
SEAL: SEmantic-Augmented Imitation Learning via Language Model	Oct 3, 2024	Decision MakingImitation Learning	—Unverified
Intent Detection in the Age of LLMs	Oct 2, 2024	Data AugmentationIn-Context Learning	—Unverified
"Oh LLM, I'm Asking Thee, Please Give Me a Decision Tree": Zero-Shot Decision Tree Induction and Embedding with Large Language Models	Sep 27, 2024	Interpretable Machine LearningWorld Knowledge	—Unverified
"Why" Has the Least Side Effect on Model Editing	Sep 27, 2024	Experimental Designknowledge editing	—Unverified
Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion	Sep 26, 2024	Image GenerationIn-Context Learning	CodeCode Available
60 Data Points are Sufficient to Fine-Tune LLMs for Question-Answering	Sep 24, 2024	Question AnsweringWorld Knowledge	—Unverified
Style Outweighs Substance: Failure Modes of LLM Judges in Alignment Benchmarking	Sep 23, 2024	BenchmarkingDiversity	CodeCode Available
The X Types -- Mapping the Semantics of the Twitter Sphere	Sep 22, 2024	Type predictionWorld Knowledge	—Unverified
Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models	Sep 22, 2024	World Knowledge	—Unverified
Relevance-driven Decision Making for Safer and More Efficient Human Robot Collaboration	Sep 21, 2024	Collision AvoidanceDecision Making	—Unverified
Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time	Sep 20, 2024	BenchmarkingWorld Knowledge	—Unverified
Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark	Sep 13, 2024	Sequential Decision MakingWorld Knowledge	—Unverified
Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles	Sep 10, 2024	Autonomous VehiclesLanguage Modeling	—Unverified
How Does Code Pretraining Affect Language Model Task Performance?	Sep 6, 2024	Language ModelingLanguage Modelling	—Unverified
Physical Rule-Guided Convolutional Neural Network	Sep 3, 2024	World Knowledge	—Unverified
CV-Probes: Studying the interplay of lexical and world knowledge in visually grounded verb understanding	Sep 2, 2024	World Knowledge	—Unverified
Novel-WD: Exploring acquisition of Novel World Knowledge in LLMs Using Prefix-Tuning	Aug 30, 2024	Causal Language ModelingContinual Learning	—Unverified
Zero-Shot Visual Reasoning by Vision-Language Models: Benchmarking and Analysis	Aug 27, 2024	BenchmarkingLarge Language Model	—Unverified
Exploring the Potential of Large Language Models for Heterophilic Graphs	Aug 26, 2024	Node ClassificationWorld Knowledge	—Unverified
To Code, or Not To Code? Exploring Impact of Code in Pre-training	Aug 20, 2024	Code GenerationWorld Knowledge	—Unverified
Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models	Aug 20, 2024	Music RecommendationRecommendation Systems	—Unverified
CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendation	Aug 20, 2024	Collaborative FilteringGeneral Knowledge	—Unverified
CoDi: Conversational Distillation for Grounded Question Answering	Aug 20, 2024	Question AnsweringWorld Knowledge	—Unverified
On the Necessity of World Knowledge for Mitigating Missing Labels in Extreme Classification	Aug 18, 2024	ImputationMissing Labels	CodeCode Available
A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models	Aug 16, 2024	Logical Reasoningvalid	—Unverified
Prompt Tuning as User Inherent Profile Inference Machine	Aug 13, 2024	QuantizationRecommendation Systems	—Unverified
MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty	Aug 13, 2024	Mathematical ReasoningQuestion Answering	CodeCode Available
LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description	Aug 9, 2024	DiversityInstruction Following	CodeCode Available
Better Alignment with Instruction Back-and-Forth Translation	Aug 8, 2024	DiversityTranslation	—Unverified
Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation	Aug 7, 2024	Logical ReasoningRecommendation Systems	—Unverified
CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge	Jul 30, 2024	In-Context LearningKnowledge Graphs	—Unverified
Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models	Jul 28, 2024	World Knowledge	—Unverified
DYNAMICQA: Tracing Internal Knowledge Conflicts in Language Models	Jul 24, 2024	Retrieval-augmented GenerationWorld Knowledge	CodeCode Available
Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models	Jul 22, 2024	DisentanglementQuestion Answering	CodeCode Available
Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data	Jul 20, 2024	Language ModellingMachine Translation	—Unverified
LoFTI: Localization and Factuality Transfer to Indian Locales	Jul 16, 2024	World Knowledge	CodeCode Available
VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving	Jul 9, 2024	Autonomous DrivingImage to 3D	—Unverified
BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization	Jun 30, 2024	Continual LearningGeneral Knowledge	—Unverified
Mental Modeling of Reinforcement Learning Agents by Language Models	Jun 26, 2024	Decision Makingreinforcement-learning	—Unverified
LABOR-LLM: Language-Based Occupational Representations with Large Language Models	Jun 25, 2024	In-Context LearningJob Prediction	—Unverified
Mitigating Hallucination in Fictional Character Role-Play	Jun 25, 2024	HallucinationWorld Knowledge	CodeCode Available

Show:10 25 50

← PrevPage 8 of 17Next →

No leaderboard results yet.