SOTAVerified

Benchmarking

Papers

Showing 28512860 of 5548 papers

TitleStatusHype
Alexpaca: Learning Factual Clarification Question Generation Without Examples0
EvalCrafter: Benchmarking and Evaluating Large Video Generation ModelsCode1
DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in ConversationsCode1
BanglaNLP at BLP-2023 Task 1: Benchmarking different Transformer Models for Violence Inciting Text Detection in Bengali0
An Empirical Study of Super-resolution on Low-resolution Micro-expression Recognition0
Assessing Encoder-Decoder Architectures for Robust Coronary Artery Segmentation0
3DYoga90: A Hierarchical Video Dataset for Yoga Pose UnderstandingCode1
TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language ModelsCode0
A Novel Benchmarking Paradigm and a Scale- and Motion-Aware Model for Egocentric Pedestrian Trajectory Prediction0
Prompting Scientific Names for Zero-Shot Species Recognition0
Show:102550
← PrevPage 286 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified