SOTAVerified

Benchmarking

Papers

Showing 19111920 of 5548 papers

TitleStatusHype
Improved Target-specific Stance Detection on Social Media Platforms by Delving into Conversation ThreadsCode0
Bias Analysis and Mitigation in the Evaluation of Authorship VerificationCode0
BED: Bi-Encoder-Based Detectors for Out-of-Distribution DetectionCode0
ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity LearningCode0
Immunofluorescence Capillary Imaging Segmentation: Cases StudyCode0
BEARD: Benchmarking the Adversarial Robustness for Dataset DistillationCode0
AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and HealthcareCode0
Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation LearningCode0
Impact of ImageNet Model Selection on Domain AdaptationCode0
Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual IllusionsCode0
Show:102550
← PrevPage 192 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified