Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1301–1350 of 5548 papers

Title	Date	Tasks	Status	Hype
Benchmarking Differential Privacy and Federated Learning for BERT Models	Jun 26, 2021	BenchmarkingFederated Learning	CodeCode Available	1
You are AllSet: A Multiset Function Framework for Hypergraph Neural Networks	Jun 24, 2021	BenchmarkingNode Classification	CodeCode Available	1
Mutual-Information Based Few-Shot Classification	Jun 23, 2021	BenchmarkingClassification	CodeCode Available	1
Synthetic Benchmarks for Scientific Research in Explainable Machine Learning	Jun 23, 2021	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
Underwater Image Restoration via Contrastive Learning and a Real-world Dataset	Jun 20, 2021	BenchmarkingContrastive Learning	CodeCode Available	1
Intrinsic Image Harmonization	Jun 19, 2021	BenchmarkingImage Harmonization	CodeCode Available	1
Perception Matters: Detecting Perception Failures of VQA Models Using Metamorphic Testing	Jun 19, 2021	BenchmarkingDNN Testing	CodeCode Available	1
Understanding and Evaluating Racial Biases in Image Captioning	Jun 16, 2021	BenchmarkingImage Captioning	CodeCode Available	1
Selection of Source Images Heavily Influences the Effectiveness of Adversarial Attacks	Jun 14, 2021	Benchmarking	CodeCode Available	1
Online Learning with Optimism and Delay	Jun 13, 2021	BenchmarkingWeather Forecasting	CodeCode Available	1
Shades of BLEU, Flavours of Success: The Case of MultiWOZ	Jun 10, 2021	BenchmarkingTask-Oriented Dialogue Systems	CodeCode Available	1
Signals to Spikes for Neuromorphic Regulated Reservoir Computing and EMG Hand Gesture Recognition	Jun 9, 2021	BenchmarkingEMG Gesture Recognition	CodeCode Available	1
RobustNav: Towards Benchmarking Robustness in Embodied Navigation	Jun 8, 2021	BenchmarkingData Augmentation	CodeCode Available	1
Benchmarking Bias Mitigation Algorithms in Representation Learning through Fairness Metrics	Jun 8, 2021	Age And Gender ClassificationBenchmarking	CodeCode Available	1
EXPObench: Benchmarking Surrogate-based Optimisation Algorithms on Expensive Black-box Functions	Jun 8, 2021	Bayesian OptimisationBenchmarking	CodeCode Available	1
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation	Jun 8, 2021	BenchmarkingDecision Making	CodeCode Available	1
DFGC 2021: A DeepFake Game Competition	Jun 2, 2021	BenchmarkingDeepFake Detection	CodeCode Available	1
FedScale: Benchmarking Model and System Performance of Federated Learning at Scale	May 24, 2021	BenchmarkingFederated Learning	CodeCode Available	1
Benchmarking the Performance of Bayesian Optimization across Multiple Experimental Materials Science Domains	May 23, 2021	Active LearningBayesian Optimisation	CodeCode Available	1
Anabranch Network for Camouflaged Object Segmentation	May 20, 2021	BenchmarkingCamouflaged Object Segmentation	CodeCode Available	1
DACBench: A Benchmark Library for Dynamic Algorithm Configuration	May 18, 2021	Benchmarking	CodeCode Available	1
Multimodal Fusion via Teacher-Student Network for Indoor Action Recognition	May 18, 2021	Action RecognitionAction Recognition In Videos	CodeCode Available	1
Best practices for constructing, preparing, and evaluating protein-ligand binding affinity benchmarks	May 13, 2021	BenchmarkingDrug Discovery	CodeCode Available	1
A Reinforcement Learning Environment for Multi-Service UAV-enabled Wireless Systems	May 11, 2021	BenchmarkingEdge-computing	CodeCode Available	1
AnomalyHop: An SSL-based Image Anomaly Localization Method	May 8, 2021	Anomaly LocalizationBenchmarking	CodeCode Available	1
D2S: Document-to-Slide Generation Via Query-Based Text Summarization	May 8, 2021	BenchmarkingLong Form Question Answering	CodeCode Available	1
Open Radar Initiative: Large Scale Dataset for Benchmarking of micro-Doppler Recognition Algorithms	May 7, 2021	Benchmarking	CodeCode Available	1
dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal Processing	Apr 27, 2021	BenchmarkingRetrieval	CodeCode Available	1
2.5D Visual Relationship Detection	Apr 26, 2021	BenchmarkingDepth Estimation	CodeCode Available	1
Knodle: Modular Weakly Supervised Learning with PyTorch	Apr 23, 2021	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
Data Generating Process to Evaluate Causal Discovery Techniques for Time Series Data	Apr 16, 2021	BenchmarkingCausal Discovery	CodeCode Available	1
Towards Standardising Reinforcement Learning Approaches for Production Scheduling Problems	Apr 16, 2021	Benchmarkingreinforcement-learning	CodeCode Available	1
Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability	Apr 14, 2021	BenchmarkingLink Prediction	CodeCode Available	1
Safety-enhanced UAV Path Planning with Spherical Vector-based Particle Swarm Optimization	Apr 13, 2021	BenchmarkingMetaheuristic Optimization	CodeCode Available	1
StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style Transfer	Apr 12, 2021	BenchmarkingSentence	CodeCode Available	1
Robust Semantic Interpretability: Revisiting Concept Activation Vectors	Apr 6, 2021	Benchmarkingcounterfactual	CodeCode Available	1
CBench: Towards Better Evaluation of Question Answering Over Knowledge Graphs	Apr 5, 2021	BenchmarkingKnowledge Graphs	CodeCode Available	1
Remote Sensing Image Classification with the SEN12MS Dataset	Apr 1, 2021	BenchmarkingClassification	CodeCode Available	1
Simultaneous Navigation and Construction Benchmarking Environments	Mar 31, 2021	BenchmarkingDeep Reinforcement Learning	CodeCode Available	1
Benchmarks for Deep Off-Policy Evaluation	Mar 30, 2021	Benchmarkingcontinuous-control	CodeCode Available	1
3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding	Mar 30, 2021	Affordance DetectionBenchmarking	CodeCode Available	1
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events	Mar 29, 2021	Autonomous VehiclesBenchmarking	CodeCode Available	1
Marine Snow Removal Benchmarking Dataset	Mar 26, 2021	BenchmarkingSand	CodeCode Available	1
Learning to Optimize: A Primer and A Benchmark	Mar 23, 2021	Benchmarking	CodeCode Available	1
Neural Multi-Hop Reasoning With Logical Rules on Biomedical Knowledge Graphs	Mar 18, 2021	BenchmarkingKnowledge Graphs	CodeCode Available	1
SHARP: Environment and Person Independent Activity Recognition with Commodity IEEE 802.11 Access Points	Mar 17, 2021	Activity RecognitionBenchmarking	CodeCode Available	1
A Large-Scale Dataset for Benchmarking Elevator Button Segmentation and Character Recognition	Mar 16, 2021	BenchmarkingPosition	CodeCode Available	1
The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation	Mar 15, 2021	BenchmarkingDomain Adaptation	CodeCode Available	1
Recent Advances on Neural Network Pruning at Initialization	Mar 11, 2021	BenchmarkingNetwork Pruning	CodeCode Available	1
A Computed Tomography Vertebral Segmentation Dataset with Anatomical Variations and Multi-Vendor Scanner Data	Mar 10, 2021	AnatomyBenchmarking	CodeCode Available	1

Show:10 25 50

← PrevPage 27 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified