SOTAVerified

Benchmarking

Papers

Showing 38713880 of 5548 papers

TitleStatusHype
Rule-based Data Selection for Large Language Models0
RxRx3-core: Benchmarking drug-target interactions in High-Content Microscopy0
Sadeed: Advancing Arabic Diacritization Through Small Language Model0
Safe Load Balancing in Software-Defined-Networking0
SAIBench: A Structural Interpretation of AI for Science Through Benchmarks0
SAIBench: Benchmarking AI for Science0
Saliency Benchmarking Made Easy: Separating Models, Maps and Metrics0
Salient Object Detection: A Benchmark0
SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models0
SAM-based instance segmentation models for the automation of structural damage detection0
Show:102550
← PrevPage 388 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified