Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4451–4500 of 5548 papers

Title	Date	Tasks	Status
Bi-Discriminator Class-Conditional Tabular GAN	Nov 12, 2021	Benchmarking	—Unverified
Benchmarking deep generative models for diverse antibody sequence design	Nov 12, 2021	BenchmarkingDiversity	—Unverified
ADCB: An Alzheimer's disease benchmark for evaluating observational estimators of causal effects	Nov 12, 2021	BenchmarkingCausal Inference	—Unverified
MLHarness: A Scalable Benchmarking System for MLCommons	Nov 9, 2021	Benchmarking	—Unverified
Practical, Fast and Robust Point Cloud Registration for 3D Scene Stitching and Object Localization	Nov 8, 2021	3D Feature MatchingBenchmarking	—Unverified
Characterizing the adversarial vulnerability of speech self-supervised learning	Nov 8, 2021	Adversarial RobustnessBenchmarking	—Unverified
EvoLearner: Learning Description Logics with Evolutionary Algorithms	Nov 8, 2021	BenchmarkingEvolutionary Algorithms	CodeCode Available
A new baseline for retinal vessel segmentation: Numerical identification and correction of methodological inconsistencies affecting 100+ papers	Nov 6, 2021	BenchmarkingRetinal Vessel Segmentation	CodeCode Available
Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies	Nov 3, 2021	AllBenchmarking	—Unverified
Virus-MNIST: Machine Learning Baseline Calculations for Image Classification	Nov 3, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
Procedural Generalization by Planning with Self-Supervised World Models	Nov 2, 2021	BenchmarkingMeta-Learning	—Unverified
Who’s on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed Domains	Nov 1, 2021	BenchmarkingLanguage Modeling	CodeCode Available
Automatic Resolution of Domain Name Disputes	Nov 1, 2021	Benchmarking	CodeCode Available
Constructing a Psychometric Testbed for Fair Natural Language Processing	Nov 1, 2021	BenchmarkingFairness	CodeCode Available
Livestock Monitoring with Transformer	Nov 1, 2021	Action RecognitionBenchmarking	—Unverified
Distributing Deep Learning Hyperparameter Tuning for 3D Medical Image Segmentation	Oct 29, 2021	BenchmarkingBrain Tumor Segmentation	CodeCode Available
Towards a Taxonomy of Graph Learning Datasets	Oct 27, 2021	BenchmarkingGraph Learning	—Unverified
Identifying and Benchmarking Natural Out-of-Context Prediction Problems	Oct 25, 2021	Benchmarking	CodeCode Available
Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control Tasks	Oct 25, 2021	Benchmarkingcontinuous-control	CodeCode Available
Quantum Boosting using Domain-Partitioning Hypotheses	Oct 25, 2021	BenchmarkingEnsemble Learning	CodeCode Available
Scientific Machine Learning Benchmarks	Oct 25, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
Benchmarking of Lightweight Deep Learning Architectures for Skin Cancer Classification using ISIC 2017 Dataset	Oct 23, 2021	BenchmarkingCancer Classification	—Unverified
MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems	Oct 21, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair Prediction	Oct 20, 2021	BenchmarkingLanguage Modeling	CodeCode Available
An Open Natural Language Processing Development Framework for EHR-based Clinical Research: A case demonstration using the National COVID Cohort Collaborative (N3C)	Oct 20, 2021	Benchmarking	—Unverified
GAN-based disentanglement learning for chest X-ray rib suppression	Oct 18, 2021	BenchmarkingComputed Tomography (CT)	—Unverified
Benchmarking Biomedical Nested NER and Relation Extraction Models	Oct 16, 2021	BenchmarkingNER	—Unverified
MTG: A Benchmarking Suite for Multilingual Text Generation	Oct 16, 2021	BenchmarkingQuestion Generation	—Unverified
OG-SPACE: Optimized Stochastic Simulation of Spatial Models of Cancer Evolution	Oct 13, 2021	Benchmarking	CodeCode Available
Benchmarking human visual search computational models in natural scenes: models comparison and reference datasets	Oct 12, 2021	Benchmarking	—Unverified
What can 5.17 billion regression fits tell us about artificial models of the human visual system?	Oct 12, 2021	Benchmarking	—Unverified
The CaLiGraph Ontology as a Challenge for OWL Reasoners	Oct 11, 2021	BenchmarkingKnowledge Graphs	CodeCode Available
SCEHR: Supervised Contrastive Learning for Clinical Risk Prediction using Electronic Health Records	Oct 11, 2021	BenchmarkingBinary Classification	CodeCode Available
Beyond Accuracy: A Consolidated Tool for Visual Question Answering Benchmarking	Oct 11, 2021	BenchmarkingQuestion Answering	CodeCode Available
Evolving Evolutionary Algorithms with Patterns	Oct 10, 2021	BenchmarkingEvolutionary Algorithms	CodeCode Available
Hybrid Random Features	Oct 8, 2021	Benchmarking	CodeCode Available
Explicitly Multi-Modal Benchmarks for Multi-Objective Optimization	Oct 7, 2021	Benchmarking	—Unverified
Process Extraction from Text: Benchmarking the State of the Art and Paving the Way for Future Challenges	Oct 7, 2021	BenchmarkingModel extraction	CodeCode Available
Benchmarking Safety Monitors for Image Classifiers with Machine Learning	Oct 4, 2021	Autonomous VehiclesBenchmarking	CodeCode Available
A New Approach for Image Authentication Framework for Media Forensics Purpose	Oct 3, 2021	AstronomyBenchmarking	—Unverified
Less is more: Selecting the right benchmarking set of data for time series classification	Sep 29, 2021	BenchmarkingTime Series	—Unverified
Decentralized Learning for Overparameterized Problems: A Multi-Agent Kernel Approximation Approach	Sep 29, 2021	Benchmarking	—Unverified
NAS-Bench-Zero: A Large Scale Dataset for Understanding Zero-Shot Neural Architecture Search	Sep 29, 2021	BenchmarkingNeural Architecture Search	—Unverified
Modelling neuronal behaviour with time series regression: Recurrent Neural Networks on synthetic C. elegans data	Sep 29, 2021	Benchmarkingregression	—Unverified
Benchmarking Machine Learning Robustness in Covid-19 Spike Sequence Classification	Sep 29, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
FastEnsemble: Benchmarking and Accelerating Ensemble-based Uncertainty Estimation for Image-to-Image Translation	Sep 29, 2021	BenchmarkingImage Generation	—Unverified
Best Practices in Pool-based Active Learning for Image Classification	Sep 29, 2021	Active LearningBenchmarking	—Unverified
Benchmarking person re-identification approaches and training datasets for practical real-world implementations	Sep 29, 2021	BenchmarkingPedestrian Detection	—Unverified
A Two-Stage Neural-Filter Pareto Front Extractor and the need for Benchmarking	Sep 29, 2021	BenchmarkingMulti-Task Learning	—Unverified
Deep Learning of Intrinsically Motivated Options in the Arcade Learning Environment	Sep 29, 2021	Atari GamesBenchmarking	—Unverified

Show:10 25 50

← PrevPage 90 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified