SOTAVerified

Large Language Model

Papers

Showing 24762500 of 6097 papers

TitleStatusHype
Character-aware audio-visual subtitling in context0
How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning PerspectiveCode0
Diagnosing Hate Speech Classification: Where Do Humans and Machines Disagree, and Why?0
Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios?0
Large Language Model Evaluation via Matrix Nuclear-NormCode0
A Multi-Task Text Classification Pipeline with Natural Language Explanations: A User-Centric Evaluation in Sentiment Analysis and Offensive Language Identification in Greek Tweets0
Large Language Model-Enhanced Reinforcement Learning for Generic Bus Holding Control Strategies0
Model-based Large Language Model Customization as Service0
ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization0
MisinfoEval: Generative AI in the Era of "Alternative Facts"0
Conversational Code Generation: a Case Study of Designing a Dialogue System for Generating Driving Scenarios for Testing Autonomous Vehicles0
Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization0
MoIN: Mixture of Introvert Experts to Upcycle an LLM0
Adaptive Reasoning and Acting in Medical Language Agents0
ALLoRA: Adaptive Learning Rate Mitigates LoRA Fatal Flaws0
HARDMath: A Benchmark Dataset for Challenging Problems in Applied MathematicsCode1
LoRE: Logit-Ranked Retriever Ensemble for Enhancing Open-Domain Question Answering0
Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation0
DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning0
Debiasing Vison-Language Models with Text-Only Training0
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense ReasoningCode0
Extended Japanese Commonsense Morality Dataset with Masked Token and Label Enhancement0
Enterprise Benchmarks for Large Language Model EvaluationCode0
LLMD: A Large Language Model for Interpreting Longitudinal Medical Records0
P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains0
Show:102550
← PrevPage 100 of 244Next →

No leaderboard results yet.