SOTAVerified

Benchmarking

Papers

Showing 10511060 of 5548 papers

TitleStatusHype
The CropAndWeed Dataset: A Multi-Modal Learning Approach for Efficient Crop and Weed ManipulationCode1
Trace Encoding in Process Mining: a survey and benchmarkingCode1
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance SegmentationCode1
SQAD: Automatic Smartphone Camera Quality Assessment and BenchmarkingCode1
MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUsCode1
Benchmarking Robustness of 3D Object Detection to Common CorruptionsCode1
A Comprehensive Study of the Robustness for LiDAR-based 3D Object Detectors against Adversarial AttacksCode1
Benchmarking Spatial Relationships in Text-to-Image GenerationCode1
Benchmarking Robustness of Multimodal Image-Text Models under Distribution ShiftCode1
Benchmarking Large Language Models for Automated Verilog RTL Code GenerationCode1
Show:102550
← PrevPage 106 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified