Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5001–5050 of 5548 papers

Title	Date	Tasks	Status
Enhancing 3D-Air Signature by Pen Tip Tail Trajectory Awareness: Dataset and Featuring by Novel Spatio-temporal CNN	Jan 5, 2024	Benchmarking	CodeCode Available
Neurological Prognostication of Post-Cardiac-Arrest Coma Patients Using EEG Data: A Dynamic Survival Analysis Framework with Competing Risks	Aug 17, 2023	BenchmarkingEEG	CodeCode Available
Asynchronous Batch Bayesian Optimization with Pipelining Evaluations for Experimental Resourcex2013constrained Conditions	Dec 5, 2024	Bayesian OptimizationBenchmarking	CodeCode Available
NeuroMorse: A Temporally Structured Dataset For Neuromorphic Computing	Feb 28, 2025	Benchmarking	CodeCode Available
NeuroSim V1.5: Improved Software Backbone for Benchmarking Compute-in-Memory Accelerators with Device and Circuit-level Non-idealities	May 5, 2025	BenchmarkingQuantization	CodeCode Available
EnergyStar++: Towards more accurate and explanatory building energy benchmarking	Oct 30, 2019	Benchmarkingenergy management	CodeCode Available
Accelerating Large-Scale Inference with Anisotropic Vector Quantization	Aug 27, 2019	BenchmarkingQuantization	CodeCode Available
A survey of probabilistic generative frameworks for molecular simulations	Nov 14, 2024	BenchmarkingDenoising	CodeCode Available
Benchmarking neural embeddings for link prediction in knowledge graphs under semantic and structural changes	May 15, 2020	BenchmarkingKnowledge Graph Completion	CodeCode Available
EmProx: Neural Network Performance Estimation For Neural Architecture Search	Jun 13, 2022	BenchmarkingDecoder	CodeCode Available
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates	Oct 28, 2024	Benchmarking	CodeCode Available
A comparison of translation performance between DeepL and Supertext	Feb 4, 2025	BenchmarkingMachine Translation	CodeCode Available
Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework	Feb 20, 2025	BenchmarkingQuestion Answering	CodeCode Available
Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program	Apr 9, 2025	Benchmarking	CodeCode Available
Benchmarking Machine Translation with Cultural Awareness	May 23, 2023	BenchmarkingIn-Context Learning	CodeCode Available
Benchmarking Multilabel Topic Classification in the Kyrgyz Language	Aug 30, 2023	BenchmarkingClassification	CodeCode Available
Unsupervised Tracklet Person Re-Identification	Mar 1, 2019	BenchmarkingDomain Adaptation	CodeCode Available
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning	Nov 15, 2019	BenchmarkingDiversity	CodeCode Available
TMPNN: High-Order Polynomial Regression Based on Taylor Map Factorization	Jul 30, 2023	BenchmarkingMulti-target regression	CodeCode Available
Nmbr9 as a Constraint Programming Challenge	Jan 13, 2020	BenchmarkingBoard Games	CodeCode Available
EFSA: Towards Event-Level Financial Sentiment Analysis	Apr 8, 2024	ArticlesBenchmarking	CodeCode Available
Efficient, Uncertainty-based Moderation of Neural Networks Text Classifiers	Apr 4, 2022	Benchmarking	CodeCode Available
Efficient Realistic Data Generation Framework leveraging Deep Learning-based Human Digitization	Jun 28, 2021	BenchmarkingDeep Learning	CodeCode Available
Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards	Sep 19, 2024	Benchmarking	CodeCode Available
Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning	Jun 18, 2024	BenchmarkingWorld Knowledge	CodeCode Available
Benchmarking multi-component signal processing methods in the time-frequency plane	Feb 13, 2024	BenchmarkingDenoising	CodeCode Available
Efficiently solving the thief orienteering problem with a max-min ant colony optimization approach	Sep 21, 2021	Benchmarking	CodeCode Available
A Comparative Analysis of Word-Level Metric Differential Privacy: Benchmarking The Privacy-Utility Trade-off	Apr 4, 2024	Benchmarking	CodeCode Available
Benchmarking MOEAs for solving continuous multi-objective RL problems	May 19, 2025	BenchmarkingEvolutionary Algorithms	CodeCode Available
NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition	May 13, 2024	Benchmarkingnamed-entity-recognition	CodeCode Available
Benchmarking Model-Based Reinforcement Learning	Jul 3, 2019	Benchmarkingmodel	CodeCode Available
Benchmarking Misuse Mitigation Against Covert Adversaries	Jun 6, 2025	BenchmarkingLanguage Modeling	CodeCode Available
To Find Waldo You Need Contextual Cues: Debiasing Who's Waldo	Mar 30, 2022	BenchmarkingPerson-centric Visual Grounding	CodeCode Available
Noisy Ostracods: A Fine-Grained, Imbalanced Real-World Dataset for Benchmarking Robust Machine Learning and Label Correction Methods	Dec 3, 2024	Benchmarking	CodeCode Available
No Metric to Rule Them All: Toward Principled Evaluations of Graph-Learning Datasets	Feb 4, 2025	AllBenchmarking	CodeCode Available
To Find Waldo You Need Contextual Cues: Debiasing Who’s Waldo	May 1, 2022	BenchmarkingPerson-centric Visual Grounding	CodeCode Available
AstroVision: Towards Autonomous Feature Detection and Description for Missions to Small Bodies Using Deep Learning	Aug 3, 2022	Benchmarking	CodeCode Available
AKFruitYield: Modular benchmarking and video analysis software for Azure Kinect cameras for fruit size and fruit yield estimation in apple orchards	Oct 6, 2023	Benchmarking	CodeCode Available
ShuffleMix: Improving Representations via Channel-Wise Shuffle of Interpolated Hidden States	May 30, 2023	BenchmarkingData Augmentation	CodeCode Available
NorEval: A Norwegian Language Understanding and Generation Evaluation Benchmark	Apr 10, 2025	Benchmarking	CodeCode Available
A Stepwise, Label-based Approach for Improving the Adversarial Training in Unsupervised Video Summarization	Oct 21, 2019	BenchmarkingUnsupervised Video Summarization	CodeCode Available
Assigning Species Information to Corresponding Genes by a Sequence Labeling Framework	May 8, 2022	BenchmarkingBinary Classification	CodeCode Available
ASR Benchmarking: Need for a More Representative Conversational Dataset	Sep 18, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Benchmarking missing-values approaches for predictive models on health databases	Feb 17, 2022	AttributeBenchmarking	CodeCode Available
SignalGP-Lite: Event Driven Genetic Programming Library for Large-Scale Artificial Life Applications	Aug 1, 2021	Artificial LifeBenchmarking	CodeCode Available
Benchmarking Minimax Linkage	Jun 7, 2019	BenchmarkingClustering	CodeCode Available
Efficient and Effective Model Extraction	Sep 21, 2024	Benchmarkingmodel	CodeCode Available
Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition	Nov 1, 2022	BenchmarkingDisentanglement	CodeCode Available
signSGD with Majority Vote is Communication Efficient And Fault Tolerant	Oct 11, 2018	Benchmarking	CodeCode Available
To Model or to Intervene: A Comparison of Counterfactual and Online Learning to Rank from User Interactions	Jul 15, 2019	Benchmarkingcounterfactual	CodeCode Available

Show:10 25 50

← PrevPage 101 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified