SOTAVerified

Large Language Model

Papers

Showing 23512375 of 6097 papers

TitleStatusHype
Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning0
Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator0
Evaluating Steering Techniques using Human Similarity Judgments0
A Reproducibility Study of Graph-Based Legal Case Retrieval0
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models0
Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners0
Evaluating Nuanced Bias in Large Language Model Free Response Answers0
Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions0
CFunModel: A "Funny" Language Model Capable of Chinese Humor Generation and Processing0
Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems0
Agents on the Bench: Large Language Model Based Multi Agent Framework for Trustworthy Digital Justice0
Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation0
Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey0
Evaluating LLaMA 3.2 for Software Vulnerability Detection0
Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning0
Evaluating Large Language Model Creativity from a Literary Perspective0
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?0
Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation0
Evaluating Large Language Model Capabilities in Assessing Spatial Econometrics Research0
Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness0
CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model0
Are Human Conversations Special? A Large Language Model Perspective0
Evaluating GPT-4 with Vision on Detection of Radiological Findings on Chest Radiographs0
Integrating Diverse Knowledge Sources for Online One-shot Learning of Novel Tasks0
CFaiRLLM: Consumer Fairness Evaluation in Large-Language Model Recommender System0
Show:102550
← PrevPage 95 of 244Next →

No leaderboard results yet.