SOTAVerified

Large Language Model

Papers

Showing 51015150 of 6097 papers

TitleStatusHype
Enriching Tabular Data with Contextual LLM Embeddings: A Comprehensive Ablation Study for Ensemble Classifiers0
EnsemW2S: Enhancing Weak-to-Strong Generalization with Large Language Model Ensembles0
Ensuring Consistency for In-Image Translation0
Ensuring Fair LLM Serving Amid Diverse Applications0
LLMs Plagiarize: Ensuring Responsible Sourcing of Large Language Model Training Data Through Knowledge Graph Comparison0
Enterprise Large Language Model Evaluation Benchmark0
EntroLLM: Entropy Encoded Weight Compression for Efficient Large Language Model Inference on Edge Devices0
Entropy-based Exploration Conduction for Multi-step Reasoning0
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning0
EnvInjection: Environmental Prompt Injection Attack to Multi-modal Web Agents0
EpilepsyLLM: Domain-Specific Large Language Model Fine-tuned with Epilepsy Medical Knowledge0
Episodic Memory Verbalization using Hierarchical Representations of Life-Long Robot Experience0
EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference0
ERABAL: Enhancing Role-Playing Agents through Boundary-Aware Learning0
Escaping Collapse: The Strength of Weak Data for Large Language Model Training0
ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining0
E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity0
Estimating Contribution Quality in Online Deliberations Using a Large Language Model0
EtC: Temporal Boundary Expand then Clarify for Weakly Supervised Video Grounding with Multimodal Large Language Model0
ETimeline: An Extensive Timeline Generation Dataset based on Large Language Model0
EuroLLM-9B: Technical Report0
EvalLM: Interactive Evaluation of Large Language Model Prompts on User-Defined Criteria0
Evaluating Apple Intelligence's Writing Tools for Privacy Against Large Language Model-Based Inference Attacks: Insights from Early Datasets0
Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems0
Evaluating ChatGPT text-mining of clinical records for obesity monitoring0
Evaluating Consistencies in LLM responses through a Semantic Clustering of Question Answering0
Integrating Diverse Knowledge Sources for Online One-shot Learning of Novel Tasks0
Evaluating GPT-4 with Vision on Detection of Radiological Findings on Chest Radiographs0
Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness0
Evaluating Large Language Model Capabilities in Assessing Spatial Econometrics Research0
Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation0
Evaluating Large Language Model Creativity from a Literary Perspective0
Evaluating LLaMA 3.2 for Software Vulnerability Detection0
Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey0
Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions0
Evaluating Nuanced Bias in Large Language Model Free Response Answers0
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models0
Evaluating Steering Techniques using Human Similarity Judgments0
Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator0
Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning0
Evaluating the Effect of Retrieval Augmentation on Social Biases0
Evaluating the Efficacy of LLM-Based Reasoning for Multiobjective HPC Job Scheduling0
Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology0
Measuring the Quality of Answers in Political Q&As with Large Language Models0
Evaluating Voice Command Pipelines for Drone Control: From STT and LLM to Direct Classification and Siamese Networks0
Evaluation of AI Chatbots for Patient-Specific EHR Questions0
Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers0
Evaluation of large language model performance on the Biomedical Language Understanding and Reasoning Benchmark0
Evaluation of OpenAI o1: Opportunities and Challenges of AGI0
Evaluation of the Automated Labeling Method for Taxonomic Nomenclature Through Prompt-Optimized Large Language Model0
Show:102550
← PrevPage 103 of 122Next →

No leaderboard results yet.