World Knowledge

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–375 of 818 papers

Title	Date	Tasks	Status
All Entities are Not Created Equal: Examining the Long Tail for Fine-Grained Entity Typing	Oct 22, 2024	AllEntity Typing	—Unverified
Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic	Oct 21, 2024	Formal LogicWorld Knowledge	—Unverified
Roadmap towards Superhuman Speech Understanding using Large Language Models	Oct 17, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Comprehending Knowledge Graphs with Large Language Models for Recommender Systems	Oct 16, 2024	Knowledge-Aware RecommendationKnowledge Graphs	—Unverified
Understanding the Role of LLMs in Multimodal Evaluation Benchmarks	Oct 16, 2024	BenchmarkingLarge Language Model	CodeCode Available
KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities	Oct 15, 2024	Image GenerationRetrieval	—Unverified
DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities	Oct 10, 2024	Document RankingEntity Embeddings	CodeCode Available
TVBench: Redesigning Video-Language Evaluation	Oct 10, 2024	Multiple-choiceOpen-Ended Question Answering	—Unverified
Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance?	Oct 9, 2024	In-Context LearningLogical Reasoning	CodeCode Available
SEAL: SEmantic-Augmented Imitation Learning via Language Model	Oct 3, 2024	Decision MakingImitation Learning	—Unverified
Intent Detection in the Age of LLMs	Oct 2, 2024	Data AugmentationIn-Context Learning	—Unverified
"Oh LLM, I'm Asking Thee, Please Give Me a Decision Tree": Zero-Shot Decision Tree Induction and Embedding with Large Language Models	Sep 27, 2024	Interpretable Machine LearningWorld Knowledge	—Unverified
"Why" Has the Least Side Effect on Model Editing	Sep 27, 2024	Experimental Designknowledge editing	—Unverified
Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion	Sep 26, 2024	Image GenerationIn-Context Learning	CodeCode Available
60 Data Points are Sufficient to Fine-Tune LLMs for Question-Answering	Sep 24, 2024	Question AnsweringWorld Knowledge	—Unverified
Style Outweighs Substance: Failure Modes of LLM Judges in Alignment Benchmarking	Sep 23, 2024	BenchmarkingDiversity	CodeCode Available
Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models	Sep 22, 2024	World Knowledge	—Unverified
The X Types -- Mapping the Semantics of the Twitter Sphere	Sep 22, 2024	Type predictionWorld Knowledge	—Unverified
Relevance-driven Decision Making for Safer and More Efficient Human Robot Collaboration	Sep 21, 2024	Collision AvoidanceDecision Making	—Unverified
Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time	Sep 20, 2024	BenchmarkingWorld Knowledge	—Unverified
Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark	Sep 13, 2024	Sequential Decision MakingWorld Knowledge	—Unverified
Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles	Sep 10, 2024	Autonomous VehiclesLanguage Modeling	—Unverified
How Does Code Pretraining Affect Language Model Task Performance?	Sep 6, 2024	Language ModelingLanguage Modelling	—Unverified
Physical Rule-Guided Convolutional Neural Network	Sep 3, 2024	World Knowledge	—Unverified
CV-Probes: Studying the interplay of lexical and world knowledge in visually grounded verb understanding	Sep 2, 2024	World Knowledge	—Unverified

Show:10 25 50

← PrevPage 15 of 33Next →

No leaderboard results yet.