SOTAVerified

Benchmarking

Papers

Showing 39013910 of 5548 papers

TitleStatusHype
Improving Items and Contexts Understanding with Descriptive Graph for Conversational Recommendation0
Benchmarking the Physical-world Adversarial Robustness of Vehicle Detection0
Certifiable Black-Box Attacks with Randomized Adversarial Examples: Breaking Defenses with Provable ConfidenceCode0
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit0
On Evaluation of Bangla Word Analogies0
ForamViT-GAN: Exploring New Paradigms in Deep Learning for Micropaleontological Image Analysis0
SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented DataCode0
Benchmarking the Robustness of Quantized Models0
Probing Conceptual Understanding of Large Visual-Language ModelsCode0
Benchmarking Robustness to Text-Guided CorruptionsCode0
Show:102550
← PrevPage 391 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified