SOTAVerified

Large Language Model

Papers

Showing 21512175 of 6097 papers

TitleStatusHype
How Far Are LLMs from Believable AI? A Benchmark for Evaluating the Believability of Human Behavior SimulationCode0
How Benchmark Prediction from Fewer Data Misses the MarkCode0
HORAE: A Domain-Agnostic Language for Automated Service RegulationCode0
EdgeWisePersona: A Dataset for On-Device User Profiling from Natural Language InteractionsCode0
HLAT: High-quality Large Language Model Pre-trained on AWS TrainiumCode0
Fine-Grained Behavior Simulation with Role-Playing Large Language Model on Social MediaCode0
Assessing the Reliability of Large Language Model KnowledgeCode0
Historical Ink: 19th Century Latin American Spanish Newspaper Corpus with LLM OCR CorrectionCode0
How Personality Traits Influence Negotiation Outcomes? A Simulation based on Large Language ModelsCode0
Human-Centered LLM-Agent User Interface: A Position PaperCode0
Assessing the Promise and Pitfalls of ChatGPT for Automated Code GenerationCode0
Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?Code0
HeavyWater and SimplexWater: Watermarking Low-Entropy Text DistributionsCode0
Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between ActionsCode0
FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information DisclosureCode0
Harnessing Large Language Models Over Transformer Models for Detecting Bengali Depressive Social Media Text: A Comprehensive StudyCode0
Harnessing the Power of Large Language Model for Uncertainty Aware Graph ProcessingCode0
Health Text Simplification: An Annotated Corpus for Digestive Cancer Education and Novel Strategies for Reinforcement LearningCode0
HALoS: Hierarchical Asynchronous Local SGD over Slow Networks for Geo-Distributed Large Language Model TrainingCode0
A Comparison of Methods for Evaluating Generative IRCode0
Heaps' Law in GPT-Neo Large Language Model Emulated CorporaCode0
G-Safeguard: A Topology-Guided Security Lens and Treatment on LLM-based Multi-agent SystemsCode0
G-SciEdBERT: A Contextualized LLM for Science Assessment Tasks in GermanCode0
ChatVis: Automating Scientific Visualization with a Large Language ModelCode0
Guarded Query Routing for Large Language ModelsCode0
Show:102550
← PrevPage 87 of 244Next →

No leaderboard results yet.