Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4101–4125 of 5548 papers

Title	Date	Tasks	Status
FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks	Oct 17, 2022	BenchmarkingGraph Neural Network	—Unverified
Conditional Neural Processes for Molecules	Oct 17, 2022	Bayesian OptimizationBenchmarking	—Unverified
DyFEn: Agent-Based Fee Setting in Payment Channel Networks	Oct 15, 2022	BenchmarkingDeep Reinforcement Learning	—Unverified
A Survey of Parameters Associated with the Quality of Benchmarks in NLP	Oct 14, 2022	Benchmarking	—Unverified
TweetNERD -- End to End Entity Linking Benchmark for Tweets	Oct 14, 2022	BenchmarkingEntity Linking	CodeCode Available
Benchmarking Long-tail Generalization with Likelihood Splits	Oct 13, 2022	BenchmarkingLanguage Modeling	CodeCode Available
OpenOOD: Benchmarking Generalized Out-of-Distribution Detection	Oct 13, 2022	Anomaly DetectionBenchmarking	CodeCode Available
Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets	Oct 12, 2022	BenchmarkingMulti-Armed Bandits	CodeCode Available
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems	Oct 11, 2022	BenchmarkingRecommendation Systems	—Unverified
Vote'n'Rank: Revision of Benchmarking with Social Choice Theory	Oct 11, 2022	BenchmarkingResult aggregation	CodeCode Available
A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing	Oct 10, 2022	BenchmarkingData Augmentation	—Unverified
Quantifying Social Biases Using Templates is Unreliable	Oct 9, 2022	AttributeBenchmarking	—Unverified
Are All Steps Equally Important? Benchmarking Essentiality Detection of Events	Oct 8, 2022	AllBenchmarking	—Unverified
Is margin all you need? An extensive empirical study of active learning on tabular data	Oct 7, 2022	Active LearningAll	—Unverified
A Theory of Dynamic Benchmarks	Oct 6, 2022	Benchmarking	—Unverified
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)	Oct 6, 2022	Benchmarking	CodeCode Available
SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data	Oct 6, 2022	BenchmarkingRepresentation Learning	—Unverified
MEDFAIR: Benchmarking Fairness for Medical Imaging	Oct 4, 2022	BenchmarkingFairness	CodeCode Available
Detection and Evaluation of Clusters within Sequential Data	Oct 4, 2022	BenchmarkingClustering	—Unverified
A Framework for Large Scale Synthetic Graph Dataset Generation	Oct 4, 2022	BenchmarkingDataset Generation	—Unverified
Benchmarking Learnt Radio Localisation under Distribution Shift	Oct 4, 2022	Benchmarking	—Unverified
The current state of single-cell proteomics data analysis	Oct 3, 2022	Benchmarking	CodeCode Available
DELAD: Deep Landweber-guided deconvolution with Hessian and sparse prior	Sep 30, 2022	BenchmarkingBlind Image Deblurring	—Unverified
Benchmarking Learning Efficiency in Deep Reservoir Computing	Sep 29, 2022	Benchmarking	CodeCode Available
Towards Parameter-Efficient Integration of Pre-Trained Language Models In Temporal Video Grounding	Sep 26, 2022	BenchmarkingNatural Language Queries	CodeCode Available

Show:10 25 50

← PrevPage 165 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified