SOTAVerified

Benchmarking

Papers

Showing 32813290 of 5548 papers

TitleStatusHype
LLaVA-Docent: Instruction Tuning with Multimodal Large Language Model to Support Art Appreciation Education0
LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living0
LLM4DV: Using Large Language Models for Hardware Test Stimuli Generation0
LLM-based Evaluation Policy Extraction for Ecological Modeling0
LLM Evaluators Recognize and Favor Their Own Generations0
LLM-initialized Differentiable Causal Discovery0
LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation0
LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study0
LLMs and Finetuning: Benchmarking cross-domain performance for hate speech detection0
LMFormer: Lane based Motion Prediction Transformer0
Show:102550
← PrevPage 329 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified