SOTAVerified

Benchmarking

Papers

Showing 47014750 of 5548 papers

TitleStatusHype
Olympus: a benchmarking framework for noisy optimization and experiment planningCode1
The FaceChannelS: Strike of the Sequences for the AffWild 2 Challenge0
An Analysis of Control Parameters of MOEA/D Under Two Different Optimization Scenarios0
Reviewing and Benchmarking Parameter Control Methods in Differential Evolution0
OpenTraj: Assessing Prediction Complexity in Human Trajectories DatasetsCode1
A new dataset of dog breed images and a benchmark for fine-grained classification0
Bag of Tricks for Adversarial TrainingCode1
Metrics for Benchmarking and Uncertainty Quantification: Quality, Applicability, and a Path to Best Practices for Machine Learning in Chemistry0
HINT3: Raising the bar for Intent Detection in the WildCode1
Graph Joint Attention Networks0
An Analysis of Quality Indicators Using Approximated Optimal Distributions in a Three-dimensional Objective Space0
Benchmarking deep inverse models over time, and the neural-adjoint methodCode1
A BFS-Tree of Ranking References for Unsupervised Manifold LearningCode1
Using Neural Architecture Search for Improving Software Flaw Detection in Multimodal Deep Learning Models0
Measuring the Complexity of Domains Used to Evaluate AI Systems0
What if we had no Wikipedia? Domain-independent Term Extraction from a Large News Corpus0
Job2Vec: Job Title Benchmarking with Collective Multi-View Representation Learning0
NABU - Multilingual Graph-based Neural RDF Verbalizer0
TadGAN: Time Series Anomaly Detection Using Generative Adversarial NetworksCode2
CoDEx: A Comprehensive Knowledge Graph Completion BenchmarkCode1
CVPR 2020 Continual Learning in Computer Vision Competition: Approaches, Results, Current Challenges and Future DirectionsCode0
A Multisensory Learning Architecture for Rotation-invariant Object Recognition0
Utility-Optimized Synthesis of Differentially Private Location Traces0
BARS-CTR: Open Benchmarking for Click-Through Rate PredictionCode1
IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language UnderstandingCode1
Optimal Eco-driving Control of Autonomous and Electric Trucks in Adaptation to Highway Topography: Energy Minimization and Battery Life Extension0
MedMeshCNN -- Enabling MeshCNN for Medical Surface Models0
Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial ExamplesCode2
Integrated Benchmarking and Design for Reproducible and Accessible Evaluation of Robotic Agents0
Deep Metric Learning Meets Deep Clustering: An Novel Unsupervised Approach for Feature EmbeddingCode0
Referenced Thermodynamic Integration for Bayesian Model Selection: Application to COVID-19 Model SelectionCode0
Benchmarking off-the-shelf statistical shape modeling tools in clinical applications0
Iris Liveness Detection Competition (LivDet-Iris) -- The 2020 Edition0
PT-Ranking: A Benchmarking Platform for Neural Learning-to-RankCode1
Benchmarking adversarial attacks and defenses for time-series data0
NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and SizeCode1
Adversarially Training for Audio Classifiers0
Image Colorization: A Survey and DatasetCode1
Optimal Scheduling of Anticipated COVID-19 Vaccination: A Case Study of New York State0
HoloGen: An open source toolbox for high-speed hologram generation0
ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw TheoryCode1
Robust Vision Challenge 2020 -- 1st Place Report for Panoptic Segmentation0
Holistic Multi-View Building Analysis in the Wild with Projection Pooling0
Quantitative Survey of the State of the Art in Sign Language RecognitionCode1
A Unified Taylor Framework for Revisiting Attribution Methods0
Automatic sleep stage classification with deep residual networks in a mixed-cohort settingCode1
MTOP: A Comprehensive Multilingual Task-Oriented Semantic Parsing Benchmark0
ISSAFE: Improving Semantic Segmentation in Accidents by Fusing Event-based DataCode1
Benchmarking network fabrics for data distributed training of deep neural networks0
mlr3proba: An R Package for Machine Learning in Survival Analysis0
Show:102550
← PrevPage 95 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified