Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5051–5075 of 5548 papers

Title	Date	Tasks	Status
Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate Gradients	Jul 17, 2023	BenchmarkingGPU	CodeCode Available
SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented Data	Apr 8, 2023	BenchmarkingData Augmentation	CodeCode Available
NSINA: A News Corpus for Sinhala	Mar 25, 2024	ArticlesBenchmarking	CodeCode Available
Improving Sequential Recommendation Models with an Enhanced Loss Function	Jan 3, 2023	BenchmarkingRecommendation Systems	CodeCode Available
Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks	Sep 8, 2019	BenchmarkingClassification	CodeCode Available
Editing Factual Knowledge and Explanatory Ability of Medical Large Language Models	Feb 28, 2024	BenchmarkingHallucination	CodeCode Available
SimBench: A Rule-Based Multi-Turn Interaction Benchmark for Evaluating an LLM's Ability to Generate Digital Twins	Aug 21, 2024	Benchmarking	CodeCode Available
A Seq2Seq approach to Symbolic Regression	Oct 17, 2020	Benchmarkingregression	CodeCode Available
A Collection of Quality Diversity Optimization Problems Derived from Hyperparameter Optimization of Machine Learning Models	Apr 28, 2022	BenchmarkingDiversity	CodeCode Available
Simitate: A Hybrid Imitation Learning Benchmark	May 15, 2019	BenchmarkingImitation Learning	CodeCode Available
Echo State Networks with Self-Normalizing Activations on the Hyper-Sphere	Mar 27, 2019	Benchmarking	CodeCode Available
ECBD: Evidence-Centered Benchmark Design for NLP	Jun 13, 2024	Benchmarking	CodeCode Available
A Continuous Optimisation Benchmark Suite from Neural Network Regression	Sep 12, 2021	BenchmarkingEvolutionary Algorithms	CodeCode Available
An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum Disorder	Sep 20, 2023	BenchmarkingClustering	CodeCode Available
Dyport: Dynamic Importance-based Hypothesis Generation Benchmarking Technique	Dec 6, 2023	BenchmarkingKnowledge Graphs	CodeCode Available
DynCIM: Dynamic Curriculum for Imbalanced Multimodal Learning	Mar 9, 2025	BenchmarkingDecision Making	CodeCode Available
DynamoRep: Trajectory-Based Population Dynamics for Classification of Black-box Optimization Problems	Jun 8, 2023	BenchmarkingDescriptive	CodeCode Available
Simple GNNs with Low Rank Non-parametric Aggregators	Oct 8, 2023	BenchmarkingNode Classification	CodeCode Available
Effective Stabilized Self-Training on Few-Labeled Graph Data	Oct 7, 2019	BenchmarkingModel Selection	CodeCode Available
Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets	Oct 12, 2022	BenchmarkingMulti-Armed Bandits	CodeCode Available
A Deep Reinforcement Learning Framework for Dynamic Portfolio Optimization: Evidence from China's Stock Market	Dec 24, 2024	BenchmarkingDecision Making	CodeCode Available
DyKgChat: Benchmarking Dialogue Generation Grounding on Dynamic Knowledge Graphs	Oct 1, 2019	BenchmarkingDialogue Generation	CodeCode Available
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition	Jul 16, 2025	BenchmarkingKnowledge Distillation	CodeCode Available
Referenced Thermodynamic Integration for Bayesian Model Selection: Application to COVID-19 Model Selection	Sep 8, 2020	BenchmarkingEpidemiology	CodeCode Available
Simulation-based Benchmarking for Causal Structure Learning in Gene Perturbation Experiments	Jul 8, 2024	BenchmarkingDecision Making	CodeCode Available

Show:10 25 50

← PrevPage 203 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified