Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4751–4800 of 5548 papers

Title	Date	Tasks	Status
Classification of Single-View Object Point Clouds	Dec 18, 2020	3D Object Classification6D Pose Estimation using RGB	—Unverified
Calibrated Adaptive Probabilistic ODE Solvers	Dec 15, 2020	BenchmarkingDescriptive	CodeCode Available
Uncertainty Estimation with Deep Learning for Rainfall-Runoff Modelling	Dec 15, 2020	BenchmarkingDeep Learning	—Unverified
Data and its (dis)contents: A survey of dataset development and use in machine learning research	Dec 9, 2020	BenchmarkingBIG-bench Machine Learning	—Unverified
Hybrid Quantum Computing -- Tabu Search Algorithm for Partitioning Problems: preliminary study on the Traveling Salesman Problem	Dec 9, 2020	BenchmarkingTraveling Salesman Problem	—Unverified
JANUS: Benchmarking Commercial and Open-Source Cloud and Edge Platforms for Object and Anomaly Detection Workloads	Dec 9, 2020	Anomaly DetectionBenchmarking	—Unverified
MOLTR: Multiple Object Localisation, Tracking, and Reconstruction from Monocular RGB Videos	Dec 9, 2020	BenchmarkingObject	—Unverified
MultiON: Benchmarking Semantic Map Memory using Multi-Object Navigation	Dec 7, 2020	BenchmarkingObject	—Unverified
Benchmarking Commercial Intent Detection Services with Practice-Driven Evaluations	Dec 7, 2020	BenchmarkingGoal-Oriented Dialog	CodeCode Available
AuthNet: A Deep Learning based Authentication Mechanism using Temporal Facial Feature Movements	Dec 4, 2020	BenchmarkingLip password classification	CodeCode Available
Benchmarking Automated Clinical Language Simplification: Dataset, Algorithm, and Evaluation	Dec 4, 2020	BenchmarkingMachine Translation	CodeCode Available
SMPLy Benchmarking 3D Human Pose Estimation in the Wild	Dec 4, 2020	3D Human Pose EstimationBenchmarking	—Unverified
Benchmarking Energy-Conserving Neural Networks for Learning Dynamics from Data	Dec 3, 2020	BenchmarkingInductive Bias	—Unverified
mlOSP: Towards a Unified Implementation of Regression Monte Carlo Algorithms	Dec 1, 2020	BenchmarkingBIG-bench Machine Learning	CodeCode Available
ABSA-Bench: Towards the Unified Evaluation of Aspect-based Sentiment Analysis Research	Dec 1, 2020	Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA)	—Unverified
AraBench: Benchmarking Dialectal Arabic-English Machine Translation	Dec 1, 2020	BenchmarkingData Augmentation	—Unverified
Benchmarking of Transformer-Based Pre-Trained Models on Social Media Text Classification Datasets	Dec 1, 2020	BenchmarkingClassification	—Unverified
A General Benchmarking Framework for Text Generation	Dec 1, 2020	BenchmarkingKnowledge Graphs	CodeCode Available
Benchmarking Automated Review Response Generation for the Hospitality Domain	Dec 1, 2020	BenchmarkingDomain Adaptation	—Unverified
Bayesian Multi-type Mean Field Multi-agent Imitation Learning	Dec 1, 2020	BenchmarkingImitation Learning	—Unverified
Meta learning to classify intent and slot labels with noisy few shot examples	Nov 30, 2020	Benchmarkingintent-classification	—Unverified
RealCause: Realistic Causal Inference Benchmarking	Nov 30, 2020	BenchmarkingCausal Inference	—Unverified
Class-agnostic Object Detection	Nov 28, 2020	BenchmarkingClass-agnostic Object Detection	—Unverified
A survey of benchmarking frameworks for reinforcement learning	Nov 27, 2020	Benchmarkingreinforcement-learning	—Unverified
Improving Augmentation and Evaluation Schemes for Semantic Image Synthesis	Nov 25, 2020	BenchmarkingData Augmentation	—Unverified
Cable Tree Wiring -- Benchmarking Solvers on a Real-World Scheduling Problem with a Variety of Precedence Constraints	Nov 25, 2020	BenchmarkingScheduling	CodeCode Available
Benchmarking Inference Performance of Deep Learning Models on Analog Devices	Nov 24, 2020	BenchmarkingDeep Learning	—Unverified
Spatially Correlated Patterns in Adversarial Images	Nov 21, 2020	BenchmarkingBlocking	—Unverified
Variational Laplace for Bayesian neural networks	Nov 20, 2020	BenchmarkingVariational Inference	—Unverified
FedEval: A Holistic Evaluation Framework for Federated Learning	Nov 19, 2020	BenchmarkingFederated Learning	—Unverified
Automatic Microprocessor Performance Bug Detection	Nov 17, 2020	Benchmarking	—Unverified
Benchmarking Domain Randomisation for Visual Sim-to-Real Transfer	Nov 13, 2020	BenchmarkingPose Estimation	—Unverified
Cryo-RALib -- a modular library for accelerating alignment in cryo-EM	Nov 11, 2020	BenchmarkingGPU	CodeCode Available
Perturbation-based exploration methods in deep reinforcement learning	Nov 10, 2020	Atari GamesBenchmarking	—Unverified
Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR	Nov 9, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Characterizing Transactional Databases for Frequent Itemset Mining	Nov 9, 2020	Benchmarking	—Unverified
A Comprehensive Comparison of Multi-Dimensional Image Denoising Methods	Nov 6, 2020	BenchmarkingDenoising	CodeCode Available
Beyond Marginal Uncertainty: How Accurately can Bayesian Regression Models Estimate Posterior Predictive Correlations?	Nov 6, 2020	Active LearningBenchmarking	CodeCode Available
Defense-friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation Difficulty	Nov 5, 2020	Adversarial AttackBenchmarking	CodeCode Available
InferBench: Understanding Deep Learning Inference Serving with an Automatic Benchmarking System	Nov 4, 2020	Benchmarking	—Unverified
The Forchheim Image Database for Camera Identification in the Wild	Nov 4, 2020	BenchmarkingFact Checking	—Unverified
EEGS: A Transparent Model of Emotions	Nov 4, 2020	Benchmarkingmodel	—Unverified
Face Morphing Attack Generation & Detection: A Comprehensive Survey	Nov 3, 2020	BenchmarkingFace Recognition	—Unverified
Rearrangement: A Challenge for Embodied AI	Nov 3, 2020	Benchmarking	—Unverified
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP	Nov 2, 2020	BenchmarkingLanguage Modeling	—Unverified
Neural Network Design: Learning from Neural Architecture Search	Nov 1, 2020	Benchmarkingimage-classification	CodeCode Available
Alibaba’s Submission for the WMT 2020 APE Shared Task: Improving Automatic Post-Editing with Pre-trained Conditional Cross-Lingual BERT	Nov 1, 2020	Automatic Post-EditingBenchmarking	—Unverified
Cross-lingual sentiment classification in low-resource Bengali language	Nov 1, 2020	BenchmarkingClassification	CodeCode Available
On the Reliability and Validity of Detecting Approval of Political Actors in Tweets	Nov 1, 2020	BenchmarkingSentiment Analysis	—Unverified
Is Transfer Learning Necessary for Protein Landscape Prediction?	Oct 31, 2020	BenchmarkingPrediction	—Unverified

Show:10 25 50

← PrevPage 96 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified