SOTAVerified

Large Language Model

Papers

Showing 26012650 of 6097 papers

TitleStatusHype
Retrieving Versus Understanding Extractive Evidence in Few-Shot Learning0
Event Segmentation Applications in Large Language Model Enabled Automated Recall Assessments0
AgentCF++: Memory-enhanced LLM-based Agents for Popularity-aware Cross-domain RecommendationsCode0
What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis0
RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering0
Democratizing Large Language Model-Based Graph Data Augmentation via Latent Knowledge GraphsCode0
Reproducing NevIR: Negation in Neural Information RetrievalCode0
REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models0
PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At InferenceCode0
Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning0
Autellix: An Efficient Serving Engine for LLM Agents as General Programs0
Complex Ontology Matching with Large Language Model Embeddings0
Reflection of Episodes: Learning to Play Game from Expert and Self Experiences0
TALKPLAY: Multimodal Music Recommendation with Large Language Models0
LLM should think and action as a human0
Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL0
OCCULT: Evaluating Large Language Models for Offensive Cyber Operation Capabilities0
User Intent to Use DeepSeek for Healthcare Purposes and their Trust in the Large Language Model: Multinational Survey Study0
Private Text Generation by Seeding Large Language Model Prompts0
Reasoning and the Trusting Behavior of DeepSeek and GPT: An Experiment Revealing Hidden Fault Lines in Large Language Models0
Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents0
Benchmarking Automatic Speech Recognition coupled LLM Modules for Medical Diagnostics0
STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models0
Advanced simulation paradigm of human behaviour unveils complex financial systemic projection0
Towards more Contextual Agents: An extractor-Generator Optimization Framework0
BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference0
Gesture-Aware Zero-Shot Speech Recognition for Patients with Language Disorders0
MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation0
SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback SystemsCode0
Investigating and Extending Homans' Social Exchange Theory with Large Language Model based AgentsCode0
KL Penalty Control via Perturbation for Direct Preference OptimizationCode0
Towards an automated workflow in materials science for combining multi-modal simulative and experimental information using data mining and large language models0
You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations0
ReviewEval: An Evaluation Framework for AI-Generated Reviews0
NOTA: Multimodal Music Notation Understanding for Visual Large Language Model0
Learning to Reason at the Frontier of Learnability0
Accuracy Assessment of OpenAlex and Clarivate Scholar ID with an LLM-Assisted Benchmark0
Locally-Deployed Chain-of-Thought (CoT) Reasoning Model in Chemical Engineering: Starting from 30 Experimental Data0
SmartLLM: Smart Contract Auditing using Custom Generative AI0
RIDE: Enhancing Large Language Model Alignment through Restyled In-Context Learning Demonstration ExemplarsCode0
TimeCAP: Learning to Contextualize, Augment, and Predict Time Series Events with Large Language Model Agents0
DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing0
Addressing Moral Uncertainty using Large Language Models for Ethical Decision-Making0
SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities0
Aligning Sentence Simplification with ESL Learner's Proficiency for Language AcquisitionCode0
Connecting Large Language Model Agent to High Performance Computing Resource0
ConFit v2: Improving Resume-Job Matching using Hypothetical Resume Embedding and Runner-Up Hard-Negative Mining0
GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs0
Competing LLM Agents in a Non-Cooperative Game of Opinion Polarisation0
Intelligent Mobile AI-Generated Content Services via Interactive Prompt Engineering and Dynamic Service Provisioning0
Show:102550
← PrevPage 53 of 122Next →

No leaderboard results yet.