SOTAVerified

Large Language Model

Papers

Showing 35013550 of 6097 papers

TitleStatusHype
Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios?0
Large Language Model Evaluation via Matrix Nuclear-NormCode0
ALLoRA: Adaptive Learning Rate Mitigates LoRA Fatal Flaws0
MisinfoEval: Generative AI in the Era of "Alternative Facts"0
Adaptive Reasoning and Acting in Medical Language Agents0
LoRE: Logit-Ranked Retriever Ensemble for Enhancing Open-Domain Question Answering0
Conversational Code Generation: a Case Study of Designing a Dialogue System for Generating Driving Scenarios for Testing Autonomous Vehicles0
Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization0
MoIN: Mixture of Introvert Experts to Upcycle an LLM0
Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation0
Extended Japanese Commonsense Morality Dataset with Masked Token and Label Enhancement0
DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning0
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense ReasoningCode0
Debiasing Vison-Language Models with Text-Only Training0
ViT3D Alignment of LLaMA3: 3D Medical Image Report Generation0
Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning0
P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains0
LLMD: A Large Language Model for Interpreting Longitudinal Medical Records0
Preferential Normalizing Flows0
Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos0
Can a large language model be a gaslighter?Code0
Emergent social conventions and collective bias in LLM populations0
Enterprise Benchmarks for Large Language Model EvaluationCode0
Hypothesis-only Biases in Large Language Model-Elicited Natural Language Inference0
uto\!L: Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks0
Uncovering Overfitting in Large Language Model Editing0
Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System0
Animating the Past: Reconstruct Trilobite via Video Generation0
Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled DataCode0
CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression0
Efficient Reinforcement Learning with Large Language Model Priors0
Mitigating Gender Bias in Code Large Language Models via Model Editing0
A Framework for Collaborating a Large Language Model Tool in Brainstorming for Triggering Creative Thoughts0
LecPrompt: A Prompt-based Approach for Logical Error Correction with CodeBERT0
The Large Language Model GreekLegalRoBERTa0
Disease Entity Recognition and Normalization is Improved with Large Language Model Derived Synthetic Normalized Mentions0
Promptly Yours? A Human Subject Study on Prompt Inference in AI-Generated Art0
Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization0
Uncovering Factor Level Preferences to Improve Human-Model Alignment0
FltLM: An Intergrated Long-Context Large Language Model for Effective Context Filtering and Understanding0
I Want to Break Free! Persuasion and Anti-Social Behavior of LLMs in Multi-Agent Settings with Social HierarchyCode0
Recent advancements in LLM Red-Teaming: Techniques, Defenses, and Ethical Considerations0
Reproducing and Extending Experiments in Behavioral Strategy with Large Language Models0
LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning0
TinyEmo: Scaling down Emotional Reasoning via Metric ProjectionCode0
Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis0
Personal Intelligence System UniLM: Hybrid On-Device Small Language Model and Server-Based Large Language Model for Malay Nusantara0
Large Language Model Compression with Neural Architecture Search0
QuAILoRA: Quantization-Aware Initialization for LoRA0
Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning0
Show:102550
← PrevPage 71 of 122Next →

No leaderboard results yet.