SOTAVerified

Large Language Model

Papers

Showing 26012625 of 6097 papers

TitleStatusHype
PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At InferenceCode0
Event Segmentation Applications in Large Language Model Enabled Automated Recall Assessments0
Reflection of Episodes: Learning to Play Game from Expert and Self Experiences0
AgentCF++: Memory-enhanced LLM-based Agents for Popularity-aware Cross-domain RecommendationsCode0
LLM should think and action as a human0
Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning0
Democratizing Large Language Model-Based Graph Data Augmentation via Latent Knowledge GraphsCode0
REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models0
RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering0
Autellix: An Efficient Serving Engine for LLM Agents as General Programs0
What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis0
Reproducing NevIR: Negation in Neural Information RetrievalCode0
Complex Ontology Matching with Large Language Model Embeddings0
Retrieving Versus Understanding Extractive Evidence in Few-Shot Learning0
TALKPLAY: Multimodal Music Recommendation with Large Language Models0
STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models0
Investigating and Extending Homans' Social Exchange Theory with Large Language Model based AgentsCode0
OCCULT: Evaluating Large Language Models for Offensive Cyber Operation Capabilities0
User Intent to Use DeepSeek for Healthcare Purposes and their Trust in the Large Language Model: Multinational Survey Study0
Towards more Contextual Agents: An extractor-Generator Optimization Framework0
Reasoning and the Trusting Behavior of DeepSeek and GPT: An Experiment Revealing Hidden Fault Lines in Large Language Models0
You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations0
Benchmarking Automatic Speech Recognition coupled LLM Modules for Medical Diagnostics0
Advanced simulation paradigm of human behaviour unveils complex financial systemic projection0
BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference0
Show:102550
← PrevPage 105 of 244Next →

No leaderboard results yet.