SOTAVerified

Benchmarking

Papers

Showing 33813390 of 5548 papers

TitleStatusHype
LLaVA-Docent: Instruction Tuning with Multimodal Large Language Model to Support Art Appreciation Education0
LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living0
LLM4DV: Using Large Language Models for Hardware Test Stimuli Generation0
Benchmarking Multimodal Regex Synthesis with Complex Structures0
LLM-based Evaluation Policy Extraction for Ecological Modeling0
A War Beyond Deepfake: Benchmarking Facial Counterfeits and Countermeasures0
Benchmarking Multimodal Models for Ukrainian Language Understanding Across Academic and Cultural Domains0
A Distance Oriented Kalman Filter Particle Swarm Optimizer Applied to Multi-Modality Image Registration0
Benchmarking Multimodal Models for Fine-Grained Image Analysis: A Comparative Study Across Diverse Visual Features0
LLM Evaluators Recognize and Favor Their Own Generations0
Show:102550
← PrevPage 339 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified