Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4101–4150 of 5548 papers

Title	Date	Tasks	Status
FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks	Oct 17, 2022	BenchmarkingGraph Neural Network	—Unverified
Conditional Neural Processes for Molecules	Oct 17, 2022	Bayesian OptimizationBenchmarking	—Unverified
DyFEn: Agent-Based Fee Setting in Payment Channel Networks	Oct 15, 2022	BenchmarkingDeep Reinforcement Learning	—Unverified
A Survey of Parameters Associated with the Quality of Benchmarks in NLP	Oct 14, 2022	Benchmarking	—Unverified
TweetNERD -- End to End Entity Linking Benchmark for Tweets	Oct 14, 2022	BenchmarkingEntity Linking	CodeCode Available
Benchmarking Long-tail Generalization with Likelihood Splits	Oct 13, 2022	BenchmarkingLanguage Modeling	CodeCode Available
OpenOOD: Benchmarking Generalized Out-of-Distribution Detection	Oct 13, 2022	Anomaly DetectionBenchmarking	CodeCode Available
Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets	Oct 12, 2022	BenchmarkingMulti-Armed Bandits	CodeCode Available
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems	Oct 11, 2022	BenchmarkingRecommendation Systems	—Unverified
Vote'n'Rank: Revision of Benchmarking with Social Choice Theory	Oct 11, 2022	BenchmarkingResult aggregation	CodeCode Available
A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing	Oct 10, 2022	BenchmarkingData Augmentation	—Unverified
Quantifying Social Biases Using Templates is Unreliable	Oct 9, 2022	AttributeBenchmarking	—Unverified
Are All Steps Equally Important? Benchmarking Essentiality Detection of Events	Oct 8, 2022	AllBenchmarking	—Unverified
Is margin all you need? An extensive empirical study of active learning on tabular data	Oct 7, 2022	Active LearningAll	—Unverified
A Theory of Dynamic Benchmarks	Oct 6, 2022	Benchmarking	—Unverified
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)	Oct 6, 2022	Benchmarking	CodeCode Available
SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data	Oct 6, 2022	BenchmarkingRepresentation Learning	—Unverified
MEDFAIR: Benchmarking Fairness for Medical Imaging	Oct 4, 2022	BenchmarkingFairness	CodeCode Available
Detection and Evaluation of Clusters within Sequential Data	Oct 4, 2022	BenchmarkingClustering	—Unverified
A Framework for Large Scale Synthetic Graph Dataset Generation	Oct 4, 2022	BenchmarkingDataset Generation	—Unverified
Benchmarking Learnt Radio Localisation under Distribution Shift	Oct 4, 2022	Benchmarking	—Unverified
The current state of single-cell proteomics data analysis	Oct 3, 2022	Benchmarking	CodeCode Available
DELAD: Deep Landweber-guided deconvolution with Hessian and sparse prior	Sep 30, 2022	BenchmarkingBlind Image Deblurring	—Unverified
Benchmarking Learning Efficiency in Deep Reservoir Computing	Sep 29, 2022	Benchmarking	CodeCode Available
Towards Parameter-Efficient Integration of Pre-Trained Language Models In Temporal Video Grounding	Sep 26, 2022	BenchmarkingNatural Language Queries	CodeCode Available
Deep Feature Selection Using a Novel Complementary Feature Mask	Sep 25, 2022	Benchmarkingfeature selection	—Unverified
Feature Encodings for Gradient Boosting with Automunge	Sep 25, 2022	BenchmarkingBinarization	—Unverified
Removal of Ocular Artifacts in EEG Using Deep Learning	Sep 24, 2022	BenchmarkingDeep Learning	—Unverified
How Good Is Neural Combinatorial Optimization? A Systematic Evaluation on the Traveling Salesman Problem	Sep 22, 2022	BenchmarkingCombinatorial Optimization	—Unverified
Periodic Extrapolative Generalisation in Neural Networks	Sep 21, 2022	Benchmarking	CodeCode Available
Progressive with Purpose: Guiding Progressive Inpainting DNNs through Context and Structure	Sep 21, 2022	BenchmarkingImage Inpainting	—Unverified
Benchmarking Apache Spark and Hadoop MapReduce on Big Data Classification	Sep 21, 2022	BenchmarkingManagement	CodeCode Available
Benchmarking energy consumption and latency for neuromorphic computing in condensed matter and particle physics	Sep 21, 2022	Anomaly DetectionBenchmarking	—Unverified
FACT: Learning Governing Abstractions Behind Integer Sequences	Sep 20, 2022	Benchmarking	—Unverified
Feature embedding in click-through rate prediction	Sep 20, 2022	BenchmarkingClick-Through Rate Prediction	CodeCode Available
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning	Sep 19, 2022	Atari GamesBenchmarking	—Unverified
Skills and Liquidity Barriers to Youth Employment: Medium-term Evidence from a Cash Benchmarking Experiment in Rwanda	Sep 18, 2022	Benchmarking	—Unverified
LAVIS: A Library for Language-Vision Intelligence	Sep 15, 2022	BenchmarkingImage Captioning	—Unverified
Is Synthetic Dataset Reliable for Benchmarking Generalizable Person Re-Identification?	Sep 12, 2022	BenchmarkingGeneralizable Person Re-identification	—Unverified
OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning	Sep 11, 2022	BenchmarkingClassification	—Unverified
Application of Machine Learning for Online Reputation Systems	Sep 10, 2022	BenchmarkingRecommendation Systems	—Unverified
FORLORN: A Framework for Comparing Offline Methods and Reinforcement Learning for Optimization of RAN Parameters	Sep 8, 2022	Benchmarkingcontinuous-control	CodeCode Available
Improving plant disease classification by adaptive minimal ensembling	Sep 8, 2022	BenchmarkingClassification	—Unverified
RF Fingerprinting Needs Attention: Multi-task Approach for Real-World WiFi and Bluetooth	Sep 7, 2022	Benchmarking	—Unverified
Low Complexity Hybrid Beamforming for mmWave Full-Duplex Integrated Access and Backhaul	Sep 5, 2022	Benchmarking	CodeCode Available
Complexity of Representations in Deep Learning	Sep 1, 2022	BenchmarkingDeep Learning	—Unverified
An evaluation framework for comparing causal inference models	Aug 31, 2022	BenchmarkingCausal Inference	—Unverified
AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels	Aug 30, 2022	Benchmarking	—Unverified
Hardware-aware mobile building block evaluation for computer vision	Aug 26, 2022	BenchmarkingEfficient Neural Network	—Unverified
Benchmarking Human Face Similarity Using Identical Twins	Aug 25, 2022	Benchmarking	—Unverified

Show:10 25 50

← PrevPage 83 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified