SOTAVerified

Benchmarking

Papers

Showing 42014225 of 5548 papers

TitleStatusHype
NAS-HPO-Bench-II: A Benchmark Dataset on Joint Optimization of Convolutional Neural Network Architecture and Training HyperparametersCode1
GAN-based disentanglement learning for chest X-ray rib suppression0
MTG: A Benchmarking Suite for Multilingual Text Generation0
Benchmarking Biomedical Nested NER and Relation Extraction Models0
Multitask Prompted Training Enables Zero-Shot Task GeneralizationCode2
HUMAN4D: A Human-Centric Multimodal Dataset for Motions and Immersive MediaCode1
OG-SPACE: Optimized Stochastic Simulation of Spatial Models of Cancer EvolutionCode0
Benchmarking the Robustness of Spatial-Temporal Models Against CorruptionsCode1
What can 5.17 billion regression fits tell us about artificial models of the human visual system?0
Benchmarking human visual search computational models in natural scenes: models comparison and reference datasets0
Codabench: Flexible, Easy-to-Use and Reproducible Benchmarking PlatformCode1
NAS-Bench-360: Benchmarking Neural Architecture Search on Diverse TasksCode1
S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech RepresentationsCode1
EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale DatasetCode1
Beyond Accuracy: A Consolidated Tool for Visual Question Answering BenchmarkingCode0
The CaLiGraph Ontology as a Challenge for OWL ReasonersCode0
SCEHR: Supervised Contrastive Learning for Clinical Risk Prediction using Electronic Health RecordsCode0
Performance Evaluation of Deep Transfer Learning on Multiclass Identification of Common Weed Species in Cotton Production SystemsCode1
Chaos as an interpretable benchmark for forecasting and data-driven modellingCode1
Evolving Evolutionary Algorithms with PatternsCode0
Hybrid Random FeaturesCode0
Process Extraction from Text: Benchmarking the State of the Art and Paving the Way for Future ChallengesCode0
Explicitly Multi-Modal Benchmarks for Multi-Objective Optimization0
SERAB: A multi-lingual benchmark for speech emotion recognitionCode1
EntQA: Entity Linking as Question AnsweringCode1
Show:102550
← PrevPage 169 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified