SOTAVerified

Benchmarking

Papers

Showing 11111120 of 5548 papers

TitleStatusHype
Benchmarking Quantized Neural Networks on FPGAs with FINNCode1
Benchmarking saliency methods for chest X-ray interpretationCode1
Benchmarking Large Language Models on Controllable Generation under Diversified InstructionsCode1
Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson's Disease Severity in Walking SequencesCode1
Benchmarking Reinforcement Learning Techniques for Autonomous NavigationCode1
Benchmarking Self-Supervised Learning on Diverse Pathology DatasetsCode1
Benchmarking Simulation-Based InferenceCode1
German Text Embedding Clustering BenchmarkCode1
OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization ModelingCode1
Benchmarking the Generation of Fact Checking ExplanationsCode1
Show:102550
← PrevPage 112 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified