Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3651–3675 of 5548 papers

Title	Date	Tasks	Status	Hype
MTEB: Massive Text Embedding Benchmark	Oct 13, 2022	BenchmarkingInformation Retrieval	CodeCode Available	4
OpenOOD: Benchmarking Generalized Out-of-Distribution Detection	Oct 13, 2022	Anomaly DetectionBenchmarking	CodeCode Available	0
Benchmarking Long-tail Generalization with Likelihood Splits	Oct 13, 2022	BenchmarkingLanguage Modeling	CodeCode Available	0
Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets	Oct 12, 2022	BenchmarkingMulti-Armed Bandits	CodeCode Available	0
Vote'n'Rank: Revision of Benchmarking with Social Choice Theory	Oct 11, 2022	BenchmarkingResult aggregation	CodeCode Available	0
DCL-Net: Deep Correspondence Learning Network for 6D Pose Estimation	Oct 11, 2022	6D Pose Estimation6D Pose Estimation using RGB	CodeCode Available	1
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems	Oct 11, 2022	BenchmarkingRecommendation Systems	—Unverified	0
Benchmarking saliency methods for chest X-ray interpretation	Oct 10, 2022	BenchmarkingDecision Making	CodeCode Available	1
A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing	Oct 10, 2022	BenchmarkingData Augmentation	—Unverified	0
Benchmarking Reinforcement Learning Techniques for Autonomous Navigation	Oct 10, 2022	Autonomous NavigationBenchmarking	CodeCode Available	1
Quantifying Social Biases Using Templates is Unreliable	Oct 9, 2022	AttributeBenchmarking	—Unverified	0
ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints	Oct 8, 2022	Autonomous DrivingBenchmarking	CodeCode Available	1
Are All Steps Equally Important? Benchmarking Essentiality Detection of Events	Oct 8, 2022	AllBenchmarking	—Unverified	0
Is margin all you need? An extensive empirical study of active learning on tabular data	Oct 7, 2022	Active LearningAll	—Unverified	0
A Theory of Dynamic Benchmarks	Oct 6, 2022	Benchmarking	—Unverified	0
SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data	Oct 6, 2022	BenchmarkingRepresentation Learning	—Unverified	0
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)	Oct 6, 2022	Benchmarking	CodeCode Available	0
A Framework for Large Scale Synthetic Graph Dataset Generation	Oct 4, 2022	BenchmarkingDataset Generation	—Unverified	0
Benchmarking Learnt Radio Localisation under Distribution Shift	Oct 4, 2022	Benchmarking	—Unverified	0
MEDFAIR: Benchmarking Fairness for Medical Imaging	Oct 4, 2022	BenchmarkingFairness	CodeCode Available	0
Detection and Evaluation of Clusters within Sequential Data	Oct 4, 2022	BenchmarkingClustering	—Unverified	0
rPPG-Toolbox: Deep Remote PPG Toolbox	Oct 3, 2022	BenchmarkingData Augmentation	CodeCode Available	2
The current state of single-cell proteomics data analysis	Oct 3, 2022	Benchmarking	CodeCode Available	0
DELAD: Deep Landweber-guided deconvolution with Hessian and sparse prior	Sep 30, 2022	BenchmarkingBlind Image Deblurring	—Unverified	0
State-specific protein-ligand complex structure prediction with a multi-scale deep generative model	Sep 30, 2022	BenchmarkingBlind Docking	CodeCode Available	2

Show:10 25 50

← PrevPage 147 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified