SOTAVerified

Large Language Model

Papers

Showing 23512400 of 6097 papers

TitleStatusHype
Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning0
Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator0
Evaluating Steering Techniques using Human Similarity Judgments0
A Reproducibility Study of Graph-Based Legal Case Retrieval0
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models0
Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners0
Evaluating Nuanced Bias in Large Language Model Free Response Answers0
Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions0
CFunModel: A "Funny" Language Model Capable of Chinese Humor Generation and Processing0
Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems0
Agents on the Bench: Large Language Model Based Multi Agent Framework for Trustworthy Digital Justice0
Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation0
Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey0
Evaluating LLaMA 3.2 for Software Vulnerability Detection0
Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning0
Evaluating Large Language Model Creativity from a Literary Perspective0
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?0
Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation0
Evaluating Large Language Model Capabilities in Assessing Spatial Econometrics Research0
Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness0
CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model0
Are Human Conversations Special? A Large Language Model Perspective0
Evaluating GPT-4 with Vision on Detection of Radiological Findings on Chest Radiographs0
Integrating Diverse Knowledge Sources for Online One-shot Learning of Novel Tasks0
CFaiRLLM: Consumer Fairness Evaluation in Large-Language Model Recommender System0
Evaluating Consistencies in LLM responses through a Semantic Clustering of Question Answering0
CephGPT-4: An Interactive Multimodal Cephalometric Measurement and Diagnostic System with Visual Large Language Model0
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer0
Evaluating ChatGPT text-mining of clinical records for obesity monitoring0
Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems0
Evaluating Apple Intelligence's Writing Tools for Privacy Against Large Language Model-Based Inference Attacks: Insights from Early Datasets0
CDEMapper: Enhancing NIH Common Data Element Normalization using Large Language Models0
EvalLM: Interactive Evaluation of Large Language Model Prompts on User-Defined Criteria0
CCoE: A Compact LLM with Collaboration of Experts0
A Red Teaming Roadmap Towards System-Level Safety0
EuroLLM-9B: Technical Report0
ETimeline: An Extensive Timeline Generation Dataset based on Large Language Model0
EtC: Temporal Boundary Expand then Clarify for Weakly Supervised Video Grounding with Multimodal Large Language Model0
CBT-LLM: A Chinese Large Language Model for Cognitive Behavioral Therapy-based Mental Health Question Answering0
Estimating Contribution Quality in Online Deliberations Using a Large Language Model0
Arch-LLM: Taming LLMs for Neural Architecture Generation via Unsupervised Discrete Representation Learning0
Agents for self-driving laboratories applied to quantum computing0
Accuracy of a Large Language Model in Distinguishing Anti- And Pro-vaccination Messages on Social Media: The Case of Human Papillomavirus Vaccination0
Explain What You Mean: Intent Augmented Knowledge Graph Recommender Built With An LLM0
Infusing Environmental Captions for Long-Form Video Language Grounding0
E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity0
ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining0
Escaping Collapse: The Strength of Weak Data for Large Language Model Training0
ERABAL: Enhancing Role-Playing Agents through Boundary-Aware Learning0
Data Augmentations for Improved (Large) Language Model Generalization0
Show:102550
← PrevPage 48 of 122Next →

No leaderboard results yet.