Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4651–4700 of 5548 papers

Title	Date	Tasks	Status	Hype
Cryo-RALib -- a modular library for accelerating alignment in cryo-EM	Nov 11, 2020	BenchmarkingGPU	CodeCode Available	0
Perturbation-based exploration methods in deep reinforcement learning	Nov 10, 2020	Atari GamesBenchmarking	—Unverified	0
Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR	Nov 9, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Characterizing Transactional Databases for Frequent Itemset Mining	Nov 9, 2020	Benchmarking	—Unverified	0
Long Range Arena: A Benchmark for Efficient Transformers	Nov 8, 2020	16kBenchmarking	CodeCode Available	1
Beyond Marginal Uncertainty: How Accurately can Bayesian Regression Models Estimate Posterior Predictive Correlations?	Nov 6, 2020	Active LearningBenchmarking	CodeCode Available	0
A Comprehensive Comparison of Multi-Dimensional Image Denoising Methods	Nov 6, 2020	BenchmarkingDenoising	CodeCode Available	0
Defense-friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation Difficulty	Nov 5, 2020	Adversarial AttackBenchmarking	CodeCode Available	0
InferBench: Understanding Deep Learning Inference Serving with an Automatic Benchmarking System	Nov 4, 2020	Benchmarking	—Unverified	0
EEGS: A Transparent Model of Emotions	Nov 4, 2020	Benchmarkingmodel	—Unverified	0
The Forchheim Image Database for Camera Identification in the Wild	Nov 4, 2020	BenchmarkingFact Checking	—Unverified	0
Rearrangement: A Challenge for Embodied AI	Nov 3, 2020	Benchmarking	—Unverified	0
Face Morphing Attack Generation & Detection: A Comprehensive Survey	Nov 3, 2020	BenchmarkingFace Recognition	—Unverified	0
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP	Nov 2, 2020	BenchmarkingLanguage Modeling	—Unverified	0
Collective Knowledge: organizing research projects as a database of reusable components and portable workflows with common APIs	Nov 2, 2020	Benchmarking	CodeCode Available	1
Alibaba’s Submission for the WMT 2020 APE Shared Task: Improving Automatic Post-Editing with Pre-trained Conditional Cross-Lingual BERT	Nov 1, 2020	Automatic Post-EditingBenchmarking	—Unverified	0
Cross-lingual sentiment classification in low-resource Bengali language	Nov 1, 2020	BenchmarkingClassification	CodeCode Available	0
On the Reliability and Validity of Detecting Approval of Political Actors in Tweets	Nov 1, 2020	BenchmarkingSentiment Analysis	—Unverified	0
Benchmarking Meaning Representations in Neural Semantic Parsing	Nov 1, 2020	BenchmarkingSemantic Parsing	CodeCode Available	1
Neural Network Design: Learning from Neural Architecture Search	Nov 1, 2020	Benchmarkingimage-classification	CodeCode Available	0
Is Transfer Learning Necessary for Protein Landscape Prediction?	Oct 31, 2020	BenchmarkingPrediction	—Unverified	0
A Critical Assessment of State-of-the-Art in Entity Alignment	Oct 30, 2020	BenchmarkingEntity Alignment	CodeCode Available	1
Improving seasonal forecast using probabilistic deep learning	Oct 27, 2020	BenchmarkingDeep Learning	—Unverified	0
SHARP 2020: The 1st Shape Recovery from Partial Textured 3D Scans Challenge Results	Oct 26, 2020	Benchmarking	—Unverified	0
Benchmarking Deep Learning Interpretability in Time Series Predictions	Oct 26, 2020	BenchmarkingDeep Learning	CodeCode Available	1
Probing Acoustic Representations for Phonetic Properties	Oct 25, 2020	Benchmarkingspeech-recognition	CodeCode Available	0
Kvasir-Instrument: Diagnostic and therapeutic tool segmentation dataset in gastrointestinal endoscopy	Oct 23, 2020	BenchmarkingDiagnostic	CodeCode Available	1
KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi	Oct 23, 2020	ArticlesBenchmarking	CodeCode Available	1
CellCycleGAN: Spatiotemporal Microscopy Image Synthesis of Cell Populations using Statistical Shape Models and Conditional GANs	Oct 22, 2020	BenchmarkingCell Segmentation	—Unverified	0
Learnability and Complexity of Quantum Samples	Oct 22, 2020	Benchmarking	CodeCode Available	0
Exploiting News Article Structure for Automatic Corpus Generation of Entailment Datasets	Oct 22, 2020	ArticlesBenchmarking	CodeCode Available	1
Self-Alignment Pretraining for Biomedical Entity Representations	Oct 22, 2020	BenchmarkingEntity Linking	CodeCode Available	1
German's Next Language Model	Oct 21, 2020	BenchmarkingDocument Classification	CodeCode Available	1
On Benchmarking Iris Recognition within a Head-mounted Display for AR/VR Application	Oct 20, 2020	BenchmarkingIris Recognition	—Unverified	0
A Flatter Loss for Bias Mitigation in Cross-dataset Facial Age Estimation	Oct 20, 2020	Age EstimationBenchmarking	—Unverified	0
Promoting High Diversity Ensemble Learning with EnsembleBench	Oct 20, 2020	BenchmarkingDiversity	CodeCode Available	1
How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers	Oct 19, 2020	BenchmarkingGraph Mining	—Unverified	0
Bayesian Neural Networks with Soft Evidence	Oct 19, 2020	Benchmarking	CodeCode Available	0
RobustBench: a standardized adversarial robustness benchmark	Oct 19, 2020	Adversarial RobustnessBenchmarking	CodeCode Available	1
RADIATE: A Radar Dataset for Automotive Perception in Bad Weather	Oct 18, 2020	Autonomous DrivingBenchmarking	CodeCode Available	1
A Seq2Seq approach to Symbolic Regression	Oct 17, 2020	Benchmarkingregression	CodeCode Available	0
ArCOV19-Rumors: Arabic COVID-19 Twitter Dataset for Misinformation Detection	Oct 17, 2020	BenchmarkingFact Checking	—Unverified	0
ALdataset: a benchmark for pool-based active learning	Oct 16, 2020	Active LearningBenchmarking	—Unverified	0
Applicability and Challenges of Deep Reinforcement Learning for Satellite Frequency Plan Design	Oct 15, 2020	BenchmarkingDecision Making	—Unverified	0
Teaspoon: A comprehensive python package for topological signal processing	Oct 10, 2020	BenchmarkingTopological Data Analysis	—Unverified	0
Downsampling and geometric feature methods for EEG classification tasks with CNNs	Oct 10, 2020	BenchmarkingEEG	—Unverified	0
TOTOPO: Classifying univariate and multivariate time series with Topological Data Analysis	Oct 10, 2020	BenchmarkingTime Series	—Unverified	0
Light Field Salient Object Detection: A Review and Benchmark	Oct 10, 2020	BenchmarkingObject	CodeCode Available	1
Addressing the Real-world Class Imbalance Problem in Dermatology	Oct 9, 2020	BenchmarkingFew-Shot Learning	—Unverified	0
Black-Box Optimization Revisited: Improving Algorithm Selection Wizards through Massive Benchmarking	Oct 8, 2020	Benchmarking	—Unverified	0

Show:10 25 50

← PrevPage 94 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified