SOTAVerified

Large Language Model

Papers

Showing 24512500 of 6097 papers

TitleStatusHype
Revisited Large Language Model for Time Series Analysis through Modality Alignment0
Retrieval-Reasoning Large Language Model-based Synthetic Clinical Trial Generation0
Search Engines in an AI Era: The False Promise of Factual and Verifiable Source-Cited ResponsesCode1
Towards More Effective Table-to-Text Generation: Assessing In-Context Learning and Self-Evaluation with Open-Source Models0
De-jargonizing Science for Journalists with GPT-4: A Pilot StudyCode0
LargePiG: Your Large Language Model is Secretly a Pointer Generator0
WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic SegmentationCode2
MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router0
A Framework for Adapting Human-Robot Interaction to Diverse User GroupsCode0
MoChat: Joints-Grouped Spatio-Temporal Grounding LLM for Multi-Turn Motion Comprehension and Description0
ATTNChecker: Highly-Optimized Fault Tolerant Attention for Large Language Model Training0
Y-Mol: A Multiscale Biomedical Knowledge-Guided Large Language Model for Drug Development0
Retrieval Augmented Spelling Correction for E-Commerce Applications0
Synthetic Interlocutors. Experiments with Generative AI to Prolong Ethnographic Encounters0
Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion PredictionCode0
Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language ModelsCode0
Preserve or Modify? Context-Aware Evaluation for Balancing Preservation and Modification in Text-Guided Image Editing0
SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing0
G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks0
GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable RecommendationCode1
Sequential LLM Framework for Fashion Recommendation0
Skill Learning Using Process Mining for Large Language Model Plan Generation0
Not All Options Are Created Equal: Textual Option Weighting for Token-Efficient LLM-Based Knowledge Tracing0
PRACTIQ: A Practical Conversational Text-to-SQL dataset with Ambiguous and Unanswerable Queries0
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive MemoryCode3
Character-aware audio-visual subtitling in context0
How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning PerspectiveCode0
Diagnosing Hate Speech Classification: Where Do Humans and Machines Disagree, and Why?0
Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios?0
Large Language Model Evaluation via Matrix Nuclear-NormCode0
A Multi-Task Text Classification Pipeline with Natural Language Explanations: A User-Centric Evaluation in Sentiment Analysis and Offensive Language Identification in Greek Tweets0
Large Language Model-Enhanced Reinforcement Learning for Generic Bus Holding Control Strategies0
Model-based Large Language Model Customization as Service0
ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization0
MisinfoEval: Generative AI in the Era of "Alternative Facts"0
Conversational Code Generation: a Case Study of Designing a Dialogue System for Generating Driving Scenarios for Testing Autonomous Vehicles0
Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization0
MoIN: Mixture of Introvert Experts to Upcycle an LLM0
Adaptive Reasoning and Acting in Medical Language Agents0
ALLoRA: Adaptive Learning Rate Mitigates LoRA Fatal Flaws0
HARDMath: A Benchmark Dataset for Challenging Problems in Applied MathematicsCode1
LoRE: Logit-Ranked Retriever Ensemble for Enhancing Open-Domain Question Answering0
Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation0
DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning0
Debiasing Vison-Language Models with Text-Only Training0
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense ReasoningCode0
Extended Japanese Commonsense Morality Dataset with Masked Token and Label Enhancement0
Enterprise Benchmarks for Large Language Model EvaluationCode0
LLMD: A Large Language Model for Interpreting Longitudinal Medical Records0
P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains0
Show:102550
← PrevPage 50 of 122Next →

No leaderboard results yet.