SOTAVerified

Large Language Model

Papers

Showing 35013525 of 6097 papers

TitleStatusHype
Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios?0
Large Language Model Evaluation via Matrix Nuclear-NormCode0
ALLoRA: Adaptive Learning Rate Mitigates LoRA Fatal Flaws0
MisinfoEval: Generative AI in the Era of "Alternative Facts"0
Adaptive Reasoning and Acting in Medical Language Agents0
LoRE: Logit-Ranked Retriever Ensemble for Enhancing Open-Domain Question Answering0
Conversational Code Generation: a Case Study of Designing a Dialogue System for Generating Driving Scenarios for Testing Autonomous Vehicles0
Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization0
MoIN: Mixture of Introvert Experts to Upcycle an LLM0
Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation0
Extended Japanese Commonsense Morality Dataset with Masked Token and Label Enhancement0
DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning0
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense ReasoningCode0
Debiasing Vison-Language Models with Text-Only Training0
ViT3D Alignment of LLaMA3: 3D Medical Image Report Generation0
Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning0
P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains0
LLMD: A Large Language Model for Interpreting Longitudinal Medical Records0
Preferential Normalizing Flows0
Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos0
Can a large language model be a gaslighter?Code0
Emergent social conventions and collective bias in LLM populations0
Enterprise Benchmarks for Large Language Model EvaluationCode0
Hypothesis-only Biases in Large Language Model-Elicited Natural Language Inference0
uto\!L: Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks0
Show:102550
← PrevPage 141 of 244Next →

No leaderboard results yet.