Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3551–3575 of 5548 papers

Title	Date	Tasks	Status
On Evaluation of Bangla Word Analogies	Apr 10, 2023	BenchmarkingWord Embeddings	—Unverified
On Evaluation of Document Classification using RVL-CDIP	Jun 21, 2023	BenchmarkingClassification	—Unverified
On General Language Understanding	Oct 27, 2023	BenchmarkingEthics	—Unverified
Online Model-based Anomaly Detection in Multivariate Time Series: Taxonomy, Survey, Research Challenges and Future Directions	Aug 7, 2024	Anomaly DetectionBenchmarking	—Unverified
Online vs Offline: A Comparative Study of First-Party and Third-Party Evaluations of Social Chatbots	Sep 12, 2024	BenchmarkingChatbot	—Unverified
On loss functions and evaluation metrics for music source separation	Feb 16, 2022	Audio Source SeparationBenchmarking	—Unverified
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling	Jul 19, 2019	BenchmarkingMotion Estimation	—Unverified
On Machine Learning Approaches for Protein-Ligand Binding Affinity Prediction	Jul 15, 2024	Active LearningBenchmarking	—Unverified
An Approach to Evaluate Modeling Adequacy for Small-Signal Stability Analysis of IBR-related SSOs in Multimachine Systems	Mar 12, 2024	Benchmarking	—Unverified
On Neural Inertial Classification Networks for Pedestrian Activity Recognition	Feb 23, 2025	Activity RecognitionBenchmarking	—Unverified
On quantifying and improving realism of images generated with diffusion	Sep 26, 2023	AttributeBenchmarking	—Unverified
On Symbiosis of Attribute Prediction and Semantic Segmentation	Nov 23, 2019	AttributeBenchmarking	—Unverified
On the Assessment of Benchmark Suites for Algorithm Comparison	Apr 15, 2021	Benchmarking	—Unverified
On the Benchmarking of LLMs for Open-Domain Dialogue Evaluation	Jul 4, 2024	BenchmarkingChatbot	—Unverified
Decisions and Performance Under Bounded Rationality: A Computational Benchmarking Approach	May 26, 2020	BenchmarkingDecision Making	—Unverified
On the Evaluation of Speech Foundation Models for Spoken Language Understanding	Jun 14, 2024	BenchmarkingPrediction	—Unverified
On the Evaluation of User Privacy in Deep Neural Networks using Timing Side Channel	Aug 1, 2022	Benchmarkingimage-classification	—Unverified
On the Impact of Data Heterogeneity in Federated Learning Environments with Application to Healthcare Networks	Apr 29, 2024	BenchmarkingFederated Learning	—Unverified
Broadening the Scope of Neural Network Potentials through Direct Inclusion of Additional Molecular Attributes	Mar 22, 2024	Benchmarking	—Unverified
On the Interaction of Belief Bias and Explanations	Jun 29, 2021	Benchmarking	—Unverified
On the Performance of Multimodal Language Models	Oct 4, 2023	BenchmarkingBinary Classification	—Unverified
On the Potential of Large Language Models to Solve Semantics-Aware Process Mining Tasks	Apr 29, 2025	Anomaly DetectionBenchmarking	—Unverified
On the project risk baseline: integrating aleatory uncertainty into project scheduling	May 31, 2024	BenchmarkingScheduling	—Unverified
On the Real-Time Semantic Segmentation of Aphid Clusters in the Wild	Jul 17, 2023	BenchmarkingReal-Time Semantic Segmentation	—Unverified
On the reduction of Linear Parameter-Varying State-Space models	Apr 2, 2024	BenchmarkingDimensionality Reduction	—Unverified

Show:10 25 50

← PrevPage 143 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified