SOTAVerified

World Knowledge

Papers

Showing 351400 of 818 papers

TitleStatusHype
A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic InferencesCode0
StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children's Story-Based LearningCode0
Language Model Behavior: A Comprehensive SurveyCode0
Large Language Models Need Consultants for Reasoning: Becoming an Expert in a Complex Human System Through Behavior SimulationCode0
Look Before You Decide: Prompting Active Deduction of MLLMs for Assumptive Reasoning0
EyeGPT: Ophthalmic Assistant with Large Language Models0
Extracting Common Inference Patterns from Semi-Structured Explanations0
ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model0
Extracting Action Sequences from Texts Based on Deep Reinforcement Learning0
Exploring the Potential of Large Language Models for Heterophilic Graphs0
A Survey of Reinforcement Learning Informed by Natural Language0
A Knowledge-Augmented Neural Network Model for Implicit Discourse Relation Classification0
Exploring the Limits of Few-Shot Link Prediction in Knowledge Graphs0
Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection0
Exploring Failure Cases in Multimodal Reasoning About Physical Dynamics0
Characterizing Large Language Models as Rationalizers of Knowledge-intensive Tasks0
Exploring Factual Entailment with NLI: A News Media Study0
Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach0
Categorization in the Wild: Generalizing Cognitive Models to Naturalistic Data across Languages0
A Study into Investigating Temporal Robustness of LLMs0
A Joint Training Framework for Open-World Knowledge Graph Embeddings0
EXnet: Efficient In-context Learning for Data-less Text classification0
EventVAD: Training-Free Event-Aware Video Anomaly Detection0
Evaluating the Ability of Large Language Models to Reason about Cardinal Directions0
The Next Chapter: A Study of Large Language Models in Storytelling0
A Google-Proof Collection of French Winograd Schemas0
EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents0
Can LLMs Maintain Fundamental Abilities under KV Cache Compression?0
Entity Type Recognition using an Ensemble of Distributional Semantic Models to Enhance Query Understanding0
Dynamic Retrieval-Augmented Generation0
Can Large Language Models Play Text Games Well? Current State-of-the-Art and Open Questions0
Enthymemetic Conditionals0
Enriching Basque Coreference Resolution System using Semantic Knowledge sources0
Can Language Models Act as Knowledge Bases at Scale?0
A Semi-supervised Scalable Unified Framework for E-commerce Query Classification0
ADAM: An Embodied Causal Agent in Open-World Environments0
A Bayesian Model for Joint Learning of Categories and their Features0
Enhancing Traffic Prediction with Textual Data Using Large Language Models0
Enhancing Question Answering by Injecting Ontological Knowledge through Regularization0
Can GPT tell us why these images are synthesized? Empowering Multimodal Large Language Models for Forensics0
Enhancing Multilingual Information Retrieval in Mixed Human Resources Environments: A RAG Model Implementation for Multicultural Enterprise0
Enhancing LLM-based Recommendation through Semantic-Aligned Collaborative Knowledge0
Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models0
Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration0
Encoding World Knowledge in the Evaluation of Local Coherence0
Bursting Filter Bubble: Enhancing Serendipity Recommendations with Aligned Large Language Models0
ARise: Towards Knowledge-Augmented Reasoning via Risk-Adaptive Search0
Empowering Language Models with Knowledge Graph Reasoning for Question Answering0
60 Data Points are Sufficient to Fine-Tune LLMs for Question-Answering0
Building a visual semantics aware object hierarchy0
Show:102550
← PrevPage 8 of 17Next →

No leaderboard results yet.