SOTAVerified

Benchmarking

Papers

Showing 43264350 of 5548 papers

TitleStatusHype
Vision Transformer for Efficient Chest X-ray and Gastrointestinal Image Classification0
Visual Attention on the Sun: What Do Existing Models Actually Predict?0
Visual Fidelity Index for Generative Semantic Communications with Critical Information Embedding0
Visual Object Tracking on Multi-modal RGB-D Videos: A Review0
Visual Place Recognition for Large-Scale UAV Applications0
VITAL: A New Dataset for Benchmarking Pluralistic Alignment in Healthcare0
VoiceWukong: Benchmarking Deepfake Voice Detection0
V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning0
v-SVR Polynomial Kernel for Predicting the Defect Density in New Software Projects0
Vulnerability of Face Morphing Attacks: A Case Study on Lookalike and Identical Twins0
From Attack to Protection: Leveraging Watermarking Attack Network for Advanced Add-on Watermarking0
Ward: Provable RAG Dataset Inference via LLM Watermarks0
Watchog: A Light-weight Contrastive Learning based Framework for Column Annotation0
WebVision Challenge: Visual Learning and Understanding With Web Data0
WelQrate: Defining the Gold Standard in Small Molecule Drug Discovery Benchmarking0
WER We Stand: Benchmarking Urdu ASR Models0
What can 5.17 billion regression fits tell us about artificial models of the human visual system?0
What cleaves? Is proteasomal cleavage prediction reaching a ceiling?0
What Does Neuro Mean to Cardio? Investigating the Role of Clinical Specialty Data in Medical LLMs0
What Emotions Make One or Five Stars? Understanding Ratings of Online Product Reviews by Sentiment Analysis and XAI0
What if we had no Wikipedia? Domain-independent Term Extraction from a Large News Corpus0
Alexpaca: Learning Factual Clarification Question Generation Without Examples0
What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts0
Towards Self-adaptive Mutation in Evolutionary Multi-Objective Algorithms0
What Will it Take to Fix Benchmarking in Natural Language Understanding?0
Show:102550
← PrevPage 174 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified