Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4001–4050 of 5548 papers

Title	Date	Tasks	Status	Hype
The importance of being constrained: dealing with infeasible solutions in Differential Evolution and beyond	Mar 7, 2022	Benchmarking	CodeCode Available	1
Systematic Comparison of Path Planning Algorithms using PathBench	Mar 7, 2022	Benchmarking	—Unverified	0
Multi-channel deep convolutional neural networks for multi-classifying thyroid disease	Mar 6, 2022	BenchmarkingBinary Classification	—Unverified	0
Automated Machine Learning: A Case Study on Non-Intrusive Appliance Load Monitoring	Mar 6, 2022	AutoMLBayesian Optimization	—Unverified	0
A Large-scale Comprehensive Dataset and Copy-overlap Aware Evaluation Protocol for Segment-level Video Copy Detection	Mar 5, 2022	BenchmarkingCopy Detection	CodeCode Available	1
Just Rank: Rethinking Evaluation with Word and Sentence Similarities	Mar 5, 2022	BenchmarkingSemantic Similarity	CodeCode Available	1
Benchmarking real-time algorithms for in-phase auditory stimulation of low amplitude slow waves with wearable EEG devices during sleep	Mar 4, 2022	BenchmarkingComputational Efficiency	—Unverified	0
Benchmarking Instance-Centric Counterfactual Algorithms for XAI: From White Box to Black Box	Mar 4, 2022	Benchmarkingcounterfactual	CodeCode Available	0
Graph clustering with Boltzmann machines	Mar 4, 2022	BenchmarkingClustering	—Unverified	0
Towards Benchmarking and Evaluating Deepfake Detection	Mar 4, 2022	BenchmarkingDeepFake Detection	—Unverified	0
KamNet: An Integrated Spatiotemporal Deep Neural Network for Rare Event Search in KamLAND-Zen	Mar 3, 2022	Benchmarking	CodeCode Available	0
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction	Mar 3, 2022	Action SegmentationBenchmarking	CodeCode Available	1
Mukayese: Turkish NLP Strikes Back	Mar 2, 2022	BenchmarkingLanguage Modeling	CodeCode Available	1
3D Common Corruptions and Data Augmentation	Mar 2, 2022	BenchmarkingData Augmentation	CodeCode Available	1
Adaptive Gradient Methods with Local Guarantees	Mar 2, 2022	Benchmarking	—Unverified	0
Benchmarking Robustness of Deep Learning Classifiers Using Two-Factor Perturbation	Mar 2, 2022	BenchmarkingDeep Learning	—Unverified	0
Reliable validation of Reinforcement Learning Benchmarks	Mar 2, 2022	BenchmarkingData Compression	—Unverified	0
A predictive analytics approach for stroke prediction using machine learning and neural networks	Mar 1, 2022	BenchmarkingBIG-bench Machine Learning	CodeCode Available	0
Towards IID representation learning and its application on biomedical data	Mar 1, 2022	BenchmarkingRepresentation Learning	CodeCode Available	0
GraphWorld: Fake Graphs Bring Real Insights for GNNs	Feb 28, 2022	Benchmarking	CodeCode Available	1
PMC-Patients: A Large-scale Dataset of Patient Summaries and Relations for Benchmarking Retrieval-based Clinical Decision Support Systems	Feb 28, 2022	ArticlesBenchmarking	CodeCode Available	1
Towards Class-agnostic Tracking Using Feature Decorrelation in Point Clouds	Feb 28, 2022	BenchmarkingObject Tracking	—Unverified	0
Prepare for Trouble and Make it Double. Supervised and Unsupervised Stacking for AnomalyBased Intrusion Detection	Feb 28, 2022	BenchmarkingIntrusion Detection	—Unverified	0
Generalised Gaussian Process Latent Variable Models (GPLVM) with Stochastic Variational Inference	Feb 25, 2022	BenchmarkingDimensionality Reduction	—Unverified	0
Spatio-Temporal Latent Graph Structure Learning for Traffic Forecasting	Feb 25, 2022	BenchmarkingGraph Neural Network	—Unverified	0
SUTD-PRCM Dataset and Neural Architecture Search Approach for Complex Metasurface Design	Feb 24, 2022	Benchmarkingimage-classification	—Unverified	0
Measuring CLEVRness: Blackbox testing of Visual Reasoning Models	Feb 24, 2022	BenchmarkingDiagnostic	—Unverified	0
Benchmarking Generative Latent Variable Models for Speech	Feb 22, 2022	BenchmarkingImage Generation	CodeCode Available	0
Evaluating Feature Attribution Methods in the Image Domain	Feb 22, 2022	Benchmarking	CodeCode Available	0
Benchmarking the Linear Algebra Awareness of TensorFlow and PyTorch	Feb 20, 2022	Benchmarking	CodeCode Available	0
How to Manage Tiny Machine Learning at Scale: An Industrial Perspective	Feb 18, 2022	BenchmarkingBIG-bench Machine Learning	CodeCode Available	0
Rethinking Pareto Frontier for Performance Evaluation of Deep Neural Networks	Feb 18, 2022	BenchmarkingDeep Learning	—Unverified	0
MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery	Feb 18, 2022	BenchmarkingRepresentation Learning	CodeCode Available	1
Benchmarking missing-values approaches for predictive models on health databases	Feb 17, 2022	AttributeBenchmarking	CodeCode Available	0
On loss functions and evaluation metrics for music source separation	Feb 16, 2022	Audio Source SeparationBenchmarking	—Unverified	0
Benchmarking of DL Libraries and Models on Mobile Devices	Feb 14, 2022	BenchmarkingGPU	CodeCode Available	1
Benchmarking Online Sequence-to-Sequence and Character-based Handwriting Recognition from IMU-Enhanced Pens	Feb 14, 2022	BenchmarkingHandwriting Recognition	—Unverified	0
Benchmarking Robot Manipulation with the Rubik's Cube	Feb 14, 2022	BenchmarkingRobot Manipulation	—Unverified	0
MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts	Feb 14, 2022	Benchmarking	CodeCode Available	1
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark	Feb 14, 2022	BenchmarkingContrastive Learning	CodeCode Available	0
Dual Task Framework for Improving Persona-grounded Dialogue Dataset	Feb 11, 2022	Benchmarking	—Unverified	0
High Fidelity RF Clutter Modeling and Simulation	Feb 10, 2022	BenchmarkingVocal Bursts Intensity Prediction	—Unverified	0
Lightweight Jet Reconstruction and Identification as an Object Detection Task	Feb 9, 2022	Benchmarkingobject-detection	—Unverified	0
BIQ2021: A Large-Scale Blind Image Quality Assessment Database	Feb 8, 2022	BenchmarkingBlind Image Quality Assessment	—Unverified	0
ECRECer: Enzyme Commission Number Recommendation and Benchmarking based on Multiagent Dual-core Learning	Feb 8, 2022	BenchmarkingLanguage Modelling	CodeCode Available	1
Comparative Study Between Distance Measures On Supervised Optimum-Path Forest Classification	Feb 8, 2022	Anomaly DetectionBenchmarking	CodeCode Available	0
What are the best systems? New perspectives on NLP Benchmarking	Feb 8, 2022	Benchmarking	CodeCode Available	1
RECOVER: sequential model optimization platform for combination drug repurposing identifies novel synergistic compounds in vitro	Feb 7, 2022	BenchmarkingModel Optimization	CodeCode Available	1
Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration	Feb 7, 2022	BenchmarkingEvolutionary Algorithms	CodeCode Available	0
Benchmarking and Analyzing Point Cloud Classification under Corruptions	Feb 7, 2022	BenchmarkingClassification	CodeCode Available	1

Show:10 25 50

← PrevPage 81 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified