Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5401–5425 of 5548 papers

Title	Date	Tasks	Status
Probing Acoustic Representations for Phonetic Properties	Oct 25, 2020	Benchmarkingspeech-recognition	CodeCode Available
Probing Conceptual Understanding of Large Visual-Language Models	Apr 7, 2023	Benchmarking	CodeCode Available
Probing Critical Learning Dynamics of PLMs for Hate Speech Detection	Feb 3, 2024	BenchmarkingHate Speech Detection	CodeCode Available
Using Color To Identify Insider Threats	Nov 25, 2021	Benchmarking	CodeCode Available
An Exploration of Exploration: Measuring the ability of lexicase selection to find obscure pathways to optimality	Jul 20, 2021	BenchmarkingDiagnostic	CodeCode Available
Towards Computational Performance Engineering for Unsupervised Concept Drift Detection -- Complexities, Benchmarking, Performance Analysis	Apr 17, 2023	BenchmarkingDrift Detection	CodeCode Available
Transfer Learning between Motor Imagery Datasets using Deep Learning -- Validation of Framework and Comparison of Datasets	Sep 4, 2023	BenchmarkingMotor Imagery	CodeCode Available
Synth4bench: a framework for generating synthetic genomics data for the evaluation of tumor-only somatic variant calling algorithms	Mar 8, 2024	BenchmarkingSynthetic Data Generation	CodeCode Available
Process Extraction from Text: Benchmarking the State of the Art and Paving the Way for Future Challenges	Oct 7, 2021	BenchmarkingModel extraction	CodeCode Available
Transfer Learning for Prosthetics Using Imitation Learning	Jan 15, 2019	BenchmarkingImitation Learning	CodeCode Available
Benchmarking datasets for Anomaly-based Network Intrusion Detection: KDD CUP 99 alternatives	Nov 13, 2018	BenchmarkingIntrusion Detection	CodeCode Available
Synthetic Datasets for Machine Learning on Spatio-Temporal Graphs using PDEs	Feb 6, 2025	BenchmarkingEpidemiology	CodeCode Available
Synthetic location trajectory generation using categorical diffusion models	Feb 19, 2024	BenchmarkingDecision Making	CodeCode Available
Synthetic Porous Microstructures: Automatic Design, Simulation, and Permeability Analysis	Feb 20, 2025	Benchmarking	CodeCode Available
Synthetic Time Series Forecasting with Transformer Architectures: Extensive Simulation Benchmarks	May 26, 2025	BenchmarkingDecision Making Under Uncertainty	CodeCode Available
An Experimental Study of the Transferability of Spectral Graph Networks	Dec 18, 2020	BenchmarkingGeneral Classification	CodeCode Available
Comparison Performance of Spectrogram and Scalogram as Input of Acoustic Recognition Task	Mar 6, 2024	Benchmarking	CodeCode Available
Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs as Science Communicators	Sep 21, 2024	Benchmarking	CodeCode Available
Benchmarking Data Heterogeneity Evaluation Approaches for Personalized Federated Learning	Oct 9, 2024	BenchmarkingFairness	CodeCode Available
Comparing Machine Learning Algorithms by Union-Free Generic Depth	Dec 20, 2023	Benchmarking	CodeCode Available
SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists	Aug 30, 2024	BenchmarkingSentiment Analysis	CodeCode Available
Transformation-Interaction-Rational Representation for Symbolic Regression	Apr 25, 2022	BenchmarkingForm	CodeCode Available
Towards Enhancing Fault Tolerance in Neural Networks	Jul 6, 2019	Benchmarking	CodeCode Available
Robust Model-Based Optimization for Challenging Fitness Landscapes	May 23, 2023	Benchmarkingmodel	CodeCode Available
Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual Navigation	Jun 2, 2019	BenchmarkingDeep Reinforcement Learning	CodeCode Available

Show:10 25 50

← PrevPage 217 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified