SOTAVerified

Benchmarking

Papers

Showing 47014725 of 5548 papers

TitleStatusHype
Olympus: a benchmarking framework for noisy optimization and experiment planningCode1
The FaceChannelS: Strike of the Sequences for the AffWild 2 Challenge0
An Analysis of Control Parameters of MOEA/D Under Two Different Optimization Scenarios0
Reviewing and Benchmarking Parameter Control Methods in Differential Evolution0
OpenTraj: Assessing Prediction Complexity in Human Trajectories DatasetsCode1
A new dataset of dog breed images and a benchmark for fine-grained classification0
Bag of Tricks for Adversarial TrainingCode1
Metrics for Benchmarking and Uncertainty Quantification: Quality, Applicability, and a Path to Best Practices for Machine Learning in Chemistry0
HINT3: Raising the bar for Intent Detection in the WildCode1
Graph Joint Attention Networks0
An Analysis of Quality Indicators Using Approximated Optimal Distributions in a Three-dimensional Objective Space0
Benchmarking deep inverse models over time, and the neural-adjoint methodCode1
A BFS-Tree of Ranking References for Unsupervised Manifold LearningCode1
Using Neural Architecture Search for Improving Software Flaw Detection in Multimodal Deep Learning Models0
Measuring the Complexity of Domains Used to Evaluate AI Systems0
What if we had no Wikipedia? Domain-independent Term Extraction from a Large News Corpus0
Job2Vec: Job Title Benchmarking with Collective Multi-View Representation Learning0
NABU - Multilingual Graph-based Neural RDF Verbalizer0
TadGAN: Time Series Anomaly Detection Using Generative Adversarial NetworksCode2
CoDEx: A Comprehensive Knowledge Graph Completion BenchmarkCode1
CVPR 2020 Continual Learning in Computer Vision Competition: Approaches, Results, Current Challenges and Future DirectionsCode0
A Multisensory Learning Architecture for Rotation-invariant Object Recognition0
Utility-Optimized Synthesis of Differentially Private Location Traces0
BARS-CTR: Open Benchmarking for Click-Through Rate PredictionCode1
IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language UnderstandingCode1
Show:102550
← PrevPage 189 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified