SOTAVerified

Benchmarking

Papers

Showing 26312640 of 5548 papers

TitleStatusHype
FuzzWiz -- Fuzzing Framework for Efficient Hardware Coverage0
Benchmarking Smoothness and Reducing High-Frequency Oscillations in Continuous Control Policies0
Safe Load Balancing in Software-Defined-Networking0
ISImed: A Framework for Self-Supervised Learning using Intrinsic Spatial Information in Medical ImagesCode0
Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing0
Benchmarking Large Language Models for Image Classification of Marine MammalsCode0
Building Conformal Prediction Intervals with Approximate Message PassingCode0
Hiding in Plain Sight: Reframing Hardware Trojan Benchmarking as a Hide&Seek Modification0
Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping0
Benchmarking Pathology Foundation Models: Adaptation Strategies and ScenariosCode0
Show:102550
← PrevPage 264 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified