Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4626–4650 of 5548 papers

Title	Date	Tasks	Status	Hype
ABSA-Bench: Towards the Unified Evaluation of Aspect-based Sentiment Analysis Research	Dec 1, 2020	Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA)	—Unverified	0
Benchmarking Automated Review Response Generation for the Hospitality Domain	Dec 1, 2020	BenchmarkingDomain Adaptation	—Unverified	0
AraBench: Benchmarking Dialectal Arabic-English Machine Translation	Dec 1, 2020	BenchmarkingData Augmentation	—Unverified	0
mlOSP: Towards a Unified Implementation of Regression Monte Carlo Algorithms	Dec 1, 2020	BenchmarkingBIG-bench Machine Learning	CodeCode Available	0
Evaluating Attribution for Graph Neural Networks	Dec 1, 2020	Benchmarking	CodeCode Available	1
Bayesian Multi-type Mean Field Multi-agent Imitation Learning	Dec 1, 2020	BenchmarkingImitation Learning	—Unverified	0
Meta learning to classify intent and slot labels with noisy few shot examples	Nov 30, 2020	Benchmarkingintent-classification	—Unverified	0
PMLB v1.0: An open source dataset collection for benchmarking machine learning methods	Nov 30, 2020	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
RealCause: Realistic Causal Inference Benchmarking	Nov 30, 2020	BenchmarkingCausal Inference	—Unverified	0
Class-agnostic Object Detection	Nov 28, 2020	BenchmarkingClass-agnostic Object Detection	—Unverified	0
A survey of benchmarking frameworks for reinforcement learning	Nov 27, 2020	Benchmarkingreinforcement-learning	—Unverified	0
Improving Augmentation and Evaluation Schemes for Semantic Image Synthesis	Nov 25, 2020	BenchmarkingData Augmentation	—Unverified	0
Cable Tree Wiring -- Benchmarking Solvers on a Real-World Scheduling Problem with a Variety of Precedence Constraints	Nov 25, 2020	BenchmarkingScheduling	CodeCode Available	0
Benchmarking Image Retrieval for Visual Localization	Nov 24, 2020	Autonomous DrivingBenchmarking	CodeCode Available	1
Benchmarking Inference Performance of Deep Learning Models on Analog Devices	Nov 24, 2020	BenchmarkingDeep Learning	—Unverified	0
RobustPointSet: A Dataset for Benchmarking Robustness of Point Cloud Classifiers	Nov 23, 2020	3D Point Cloud ClassificationBenchmarking	CodeCode Available	1
Spatially Correlated Patterns in Adversarial Images	Nov 21, 2020	BenchmarkingBlocking	—Unverified	0
Variational Laplace for Bayesian neural networks	Nov 20, 2020	BenchmarkingVariational Inference	—Unverified	0
FedEval: A Holistic Evaluation Framework for Federated Learning	Nov 19, 2020	BenchmarkingFederated Learning	—Unverified	0
Automatic Microprocessor Performance Bug Detection	Nov 17, 2020	Benchmarking	—Unverified	0
Real-Time Polyp Detection, Localization and Segmentation in Colonoscopy Using Deep Learning	Nov 15, 2020	BenchmarkingColorectal Polyps Characterization	CodeCode Available	1
Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and Benchmarking	Nov 15, 2020	Benchmarkingcontinuous-control	CodeCode Available	1
SoftGym: Benchmarking Deep Reinforcement Learning for Deformable Object Manipulation	Nov 14, 2020	BenchmarkingDeep Reinforcement Learning	CodeCode Available	1
Benchmarking Domain Randomisation for Visual Sim-to-Real Transfer	Nov 13, 2020	BenchmarkingPose Estimation	—Unverified	0
tvopt: A Python Framework for Time-Varying Optimization	Nov 12, 2020	Benchmarking	CodeCode Available	1

Show:10 25 50

← PrevPage 186 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified