Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5001–5025 of 5548 papers

Title	Date	Tasks	Status
Enhancing 3D-Air Signature by Pen Tip Tail Trajectory Awareness: Dataset and Featuring by Novel Spatio-temporal CNN	Jan 5, 2024	Benchmarking	CodeCode Available
Neurological Prognostication of Post-Cardiac-Arrest Coma Patients Using EEG Data: A Dynamic Survival Analysis Framework with Competing Risks	Aug 17, 2023	BenchmarkingEEG	CodeCode Available
Asynchronous Batch Bayesian Optimization with Pipelining Evaluations for Experimental Resourcex2013constrained Conditions	Dec 5, 2024	Bayesian OptimizationBenchmarking	CodeCode Available
NeuroMorse: A Temporally Structured Dataset For Neuromorphic Computing	Feb 28, 2025	Benchmarking	CodeCode Available
NeuroSim V1.5: Improved Software Backbone for Benchmarking Compute-in-Memory Accelerators with Device and Circuit-level Non-idealities	May 5, 2025	BenchmarkingQuantization	CodeCode Available
EnergyStar++: Towards more accurate and explanatory building energy benchmarking	Oct 30, 2019	Benchmarkingenergy management	CodeCode Available
Accelerating Large-Scale Inference with Anisotropic Vector Quantization	Aug 27, 2019	BenchmarkingQuantization	CodeCode Available
A survey of probabilistic generative frameworks for molecular simulations	Nov 14, 2024	BenchmarkingDenoising	CodeCode Available
Benchmarking neural embeddings for link prediction in knowledge graphs under semantic and structural changes	May 15, 2020	BenchmarkingKnowledge Graph Completion	CodeCode Available
EmProx: Neural Network Performance Estimation For Neural Architecture Search	Jun 13, 2022	BenchmarkingDecoder	CodeCode Available
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates	Oct 28, 2024	Benchmarking	CodeCode Available
A comparison of translation performance between DeepL and Supertext	Feb 4, 2025	BenchmarkingMachine Translation	CodeCode Available
Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework	Feb 20, 2025	BenchmarkingQuestion Answering	CodeCode Available
Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program	Apr 9, 2025	Benchmarking	CodeCode Available
Benchmarking Machine Translation with Cultural Awareness	May 23, 2023	BenchmarkingIn-Context Learning	CodeCode Available
Benchmarking Multilabel Topic Classification in the Kyrgyz Language	Aug 30, 2023	BenchmarkingClassification	CodeCode Available
Unsupervised Tracklet Person Re-Identification	Mar 1, 2019	BenchmarkingDomain Adaptation	CodeCode Available
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning	Nov 15, 2019	BenchmarkingDiversity	CodeCode Available
TMPNN: High-Order Polynomial Regression Based on Taylor Map Factorization	Jul 30, 2023	BenchmarkingMulti-target regression	CodeCode Available
Nmbr9 as a Constraint Programming Challenge	Jan 13, 2020	BenchmarkingBoard Games	CodeCode Available
EFSA: Towards Event-Level Financial Sentiment Analysis	Apr 8, 2024	ArticlesBenchmarking	CodeCode Available
Efficient, Uncertainty-based Moderation of Neural Networks Text Classifiers	Apr 4, 2022	Benchmarking	CodeCode Available
Efficient Realistic Data Generation Framework leveraging Deep Learning-based Human Digitization	Jun 28, 2021	BenchmarkingDeep Learning	CodeCode Available
Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards	Sep 19, 2024	Benchmarking	CodeCode Available
Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning	Jun 18, 2024	BenchmarkingWorld Knowledge	CodeCode Available

Show:10 25 50

← PrevPage 201 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified