SOTAVerified

Large Language Model

Papers

Showing 781790 of 6097 papers

TitleStatusHype
Virology Capabilities Test (VCT): A Multimodal Virology Q&A BenchmarkCode0
Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling EvaluatorsCode0
Kuwain 1.5B: An Arabic SLM via Language Injection0
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models0
Automated Duplicate Bug Report Detection in Large Open Bug Repositories0
Don't Retrieve, Generate: Prompting LLMs for Synthetic Training Data in Dense Retrieval0
ResNetVLLM -- Multi-modal Vision LLM for the Video Understanding Task0
Causal Disentanglement for Robust Long-tail Medical Image Generation0
PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines0
Bottom-Up Synthesis of Knowledge-Grounded Task-Oriented Dialogues with Iteratively Self-Refined Prompts0
Show:102550
← PrevPage 79 of 610Next →

No leaderboard results yet.