Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5026–5050 of 5548 papers

Title	Date	Tasks	Status
Benchmarking multi-component signal processing methods in the time-frequency plane	Feb 13, 2024	BenchmarkingDenoising	CodeCode Available
Efficiently solving the thief orienteering problem with a max-min ant colony optimization approach	Sep 21, 2021	Benchmarking	CodeCode Available
A Comparative Analysis of Word-Level Metric Differential Privacy: Benchmarking The Privacy-Utility Trade-off	Apr 4, 2024	Benchmarking	CodeCode Available
Benchmarking MOEAs for solving continuous multi-objective RL problems	May 19, 2025	BenchmarkingEvolutionary Algorithms	CodeCode Available
NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition	May 13, 2024	Benchmarkingnamed-entity-recognition	CodeCode Available
Benchmarking Model-Based Reinforcement Learning	Jul 3, 2019	Benchmarkingmodel	CodeCode Available
Benchmarking Misuse Mitigation Against Covert Adversaries	Jun 6, 2025	BenchmarkingLanguage Modeling	CodeCode Available
To Find Waldo You Need Contextual Cues: Debiasing Who's Waldo	Mar 30, 2022	BenchmarkingPerson-centric Visual Grounding	CodeCode Available
Noisy Ostracods: A Fine-Grained, Imbalanced Real-World Dataset for Benchmarking Robust Machine Learning and Label Correction Methods	Dec 3, 2024	Benchmarking	CodeCode Available
No Metric to Rule Them All: Toward Principled Evaluations of Graph-Learning Datasets	Feb 4, 2025	AllBenchmarking	CodeCode Available
To Find Waldo You Need Contextual Cues: Debiasing Who’s Waldo	May 1, 2022	BenchmarkingPerson-centric Visual Grounding	CodeCode Available
AstroVision: Towards Autonomous Feature Detection and Description for Missions to Small Bodies Using Deep Learning	Aug 3, 2022	Benchmarking	CodeCode Available
AKFruitYield: Modular benchmarking and video analysis software for Azure Kinect cameras for fruit size and fruit yield estimation in apple orchards	Oct 6, 2023	Benchmarking	CodeCode Available
ShuffleMix: Improving Representations via Channel-Wise Shuffle of Interpolated Hidden States	May 30, 2023	BenchmarkingData Augmentation	CodeCode Available
NorEval: A Norwegian Language Understanding and Generation Evaluation Benchmark	Apr 10, 2025	Benchmarking	CodeCode Available
A Stepwise, Label-based Approach for Improving the Adversarial Training in Unsupervised Video Summarization	Oct 21, 2019	BenchmarkingUnsupervised Video Summarization	CodeCode Available
Assigning Species Information to Corresponding Genes by a Sequence Labeling Framework	May 8, 2022	BenchmarkingBinary Classification	CodeCode Available
ASR Benchmarking: Need for a More Representative Conversational Dataset	Sep 18, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Benchmarking missing-values approaches for predictive models on health databases	Feb 17, 2022	AttributeBenchmarking	CodeCode Available
SignalGP-Lite: Event Driven Genetic Programming Library for Large-Scale Artificial Life Applications	Aug 1, 2021	Artificial LifeBenchmarking	CodeCode Available
Benchmarking Minimax Linkage	Jun 7, 2019	BenchmarkingClustering	CodeCode Available
Efficient and Effective Model Extraction	Sep 21, 2024	Benchmarkingmodel	CodeCode Available
Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition	Nov 1, 2022	BenchmarkingDisentanglement	CodeCode Available
signSGD with Majority Vote is Communication Efficient And Fault Tolerant	Oct 11, 2018	Benchmarking	CodeCode Available
To Model or to Intervene: A Comparison of Counterfactual and Online Learning to Rank from User Interactions	Jul 15, 2019	Benchmarkingcounterfactual	CodeCode Available

Show:10 25 50

← PrevPage 202 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified