SOTAVerified

Benchmarking

Papers

Showing 54515475 of 5548 papers

TitleStatusHype
There's No Comparison: Reference-less Evaluation Metrics in Grammatical Error CorrectionCode0
Technical Report on the CleverHans v2.1.0 Adversarial Examples LibraryCode0
Estimating transmission from genetic and epidemiological data: a metric to compare transmission trees0
Geometry-Based Next Frame Prediction from Monocular Video0
Quantum-Assisted Learning of Hardware-Embedded Probabilistic Graphical Models0
Joint Online Spoken Language Understanding and Language Modeling with Recurrent Neural Networks0
Benchmarking State-of-the-Art Deep Learning Software Tools0
Benchmarking confound regression strategies for the control of motion artifact in studies of functional connectivity0
Haze Visibility Enhancement: A Survey and Quantitative Benchmarking0
Multi-Camera Action Dataset for Cross-Camera Action Recognition Benchmarking0
Sparse Representation-Based Classification: Orthogonal Least Squares or Orthogonal Matching Pursuit?0
Hierarchical Data Generator based on Tree-Structured Stick Breaking Process for Benchmarking Clustering Methods0
Spatially Binned ROC: A Comprehensive Saliency Metric0
Extraction of clinical information from the non-invasive fetal electrocardiogram0
Yum-me: A Personalized Nutrient-based Meal Recommender SystemCode0
Coupling volume-excluding compartment-based models of diffusion at different scales: Voronoi and pseudo-compartment approaches0
BMOBench: Black-Box Multi-Objective Optimization Benchmarking Platform0
Fine-Grained Classification of Pedestrians in Video: Benchmark and State of the Art0
Movie Description0
COCO: Performance AssessmentCode0
Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The BenchmarkCode0
Anytime Bi-Objective Optimization with a Hybrid Multi-Objective CMA-ES (HMO-CMA-ES)0
Active Learning for Community Detection in Stochastic Block Models0
Benchmarking Lexical Simplification Systems0
JATE 2.0: Java Automatic Term Extraction with Apache SolrCode0
Show:102550
← PrevPage 219 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified