SOTAVerified

World Knowledge

Papers

Showing 301350 of 818 papers

TitleStatusHype
Large Language Models Can Solve Real-World Planning Rigorously with Formal Verification Tools0
More Room for Language: Investigating the Effect of Retrieval on Language ModelsCode0
Data Collection of Real-Life Knowledge Work in Context: The RLKWiC Dataset0
Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity RepresentationCode3
Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task0
CEM: A Data-Efficient Method for Large Language Models to Continue Evolving From Mistakes0
LLMs' Reading Comprehension Is Affected by Parametric Knowledge and Struggles with Hypothetical Statements0
Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI DetectionCode1
Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language ModelsCode1
Mixture of Low-rank Experts for Transferable AI-Generated Image DetectionCode1
BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language ModelsCode1
Scope Ambiguities in Large Language ModelsCode0
PRobELM: Plausibility Ranking Evaluation for Language Models0
PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal ModelCode1
Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization0
GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo ViewsCode3
LLMTreeRec: Unleashing the Power of Large Language Models for Cold-Start RecommendationsCode0
EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge GraphsCode0
Enhancing Content-based Recommendation via Large Language ModelCode0
Are We on the Right Way for Evaluating Large Vision-Language Models?Code3
LLMSense: Harnessing LLMs for High-level Reasoning Over Spatiotemporal Sensor Traces0
Knowledge Boundary and Persona Dynamic Shape A Better Social Media AgentCode0
Beyond Embeddings: The Promise of Visual Table in Visual ReasoningCode1
Common Sense Enhanced Knowledge-based Recommendation with Large Language ModelCode1
Sequential Recommendation with Latent Relations based on Large Language ModelCode1
Large Language Models Need Consultants for Reasoning: Becoming an Expert in a Complex Human System Through Behavior SimulationCode0
Mechanistic Understanding and Mitigation of Language Model Non-Factual HallucinationsCode0
Large Language Models Enhanced Collaborative Filtering0
Understanding Long Videos with Multimodal Language ModelsCode2
ChatDBG: Augmenting Debugging with Large Language ModelsCode5
Stance Reasoner: Zero-Shot Stance Detection on Social Media with Explicit ReasoningCode0
Log Probabilities Are a Reliable Estimate of Semantic Plausibility in Base and Instruction-Tuned Language ModelsCode0
Embodied LLM Agents Learn to Cooperate in Organized TeamsCode2
Informed Spectral Normalized Gaussian Processes for Trajectory Prediction0
EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents0
RetinaQA: A Robust Knowledge Base Question Answering Model for both Answerable and Unanswerable QuestionsCode0
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM0
Unified Source-Free Domain AdaptationCode3
Can LLMs' Tuning Methods Work in Medical Multimodal Domain?Code1
MeaCap: Memory-Augmented Zero-shot Image CaptioningCode2
Towards Efficient and Effective Unlearning of Large Language Models for RecommendationCode1
Language Guided Exploration for RL Agents in Text Environments0
FKA-Owl: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs0
Cognition is All You Need -- The Next Layer of AI Above Large Language Models0
LLMs for Targeted Sentiment in News Headlines: Exploring the Descriptive-Prescriptive Dilemma0
Word Order and World KnowledgeCode0
EyeGPT: Ophthalmic Assistant with Large Language Models0
AKEW: Assessing Knowledge Editing in the WildCode0
Learning or Self-aligning? Rethinking Instruction Fine-tuningCode1
ICE-SEARCH: A Language Model-Driven Feature Selection Approach0
Show:102550
← PrevPage 7 of 17Next →

No leaderboard results yet.