SOTAVerified

Benchmarking

Papers

Showing 33913400 of 5548 papers

TitleStatusHype
Benchmarking Multimodal LLMs on Recognition and Understanding over Chemical Tables0
Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 20150
LLM-initialized Differentiable Causal Discovery0
Totally Corrective Boosting with Cardinality Penalization0
Benchmarking Multi-Domain Active Learning on Image Classification0
LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation0
LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study0
Incorporating Human Flexibility through Reward Preferences in Human-AI Teaming0
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms0
LLMs and Finetuning: Benchmarking cross-domain performance for hate speech detection0
Show:102550
← PrevPage 340 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified