SOTAVerified

Large Language Model

Papers

Showing 12311240 of 6097 papers

TitleStatusHype
EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial StatementsCode1
Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providersCode1
AuditWen:An Open-Source Large Language Model for AuditCode1
Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output GenerationCode1
DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity EnvironmentsCode1
Exploring Empty Spaces: Human-in-the-Loop Data AugmentationCode1
Matching Patients to Clinical Trials with Large Language ModelsCode1
A Study of Generative Large Language Model for Medical Research and HealthcareCode1
CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global MemoryCode1
MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning ProblemsCode1
Show:102550
← PrevPage 124 of 610Next →

No leaderboard results yet.