Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5276–5300 of 5548 papers

Title	Date	Tasks	Status
C-TLSAN: Content-Enhanced Time-Aware Long- and Short-Term Attention Network for Personalized Recommendation	Jun 16, 2025	BenchmarkingRecommendation Systems	CodeCode Available
Performance Evaluation of Real-Time Object Detection for Electric Scooters	May 5, 2024	Autonomous VehiclesBenchmarking	CodeCode Available
Benchmarking Framework for Performance-Evaluation of Causal Inference Analysis	Feb 14, 2018	BenchmarkingCausal Inference	CodeCode Available
A General Benchmarking Framework for Text Generation	Dec 1, 2020	BenchmarkingKnowledge Graphs	CodeCode Available
Performance Modeling of Data Storage Systems using Generative Models	Jul 5, 2023	Benchmarking	CodeCode Available
Zero-Shot Hyperspectral Pansharpening Using Hysteresis-Based Tuning for Spectral Quality Control	May 22, 2025	BenchmarkingPansharpening	CodeCode Available
Vector-Based Data Improves Left-Right Eye-Tracking Classifier Performance After a Covariate Distributional Shift	Jul 31, 2022	BenchmarkingEEG	CodeCode Available
AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge	Dec 18, 2024	BenchmarkingWorld Knowledge	CodeCode Available
Periodic Extrapolative Generalisation in Neural Networks	Sep 21, 2022	Benchmarking	CodeCode Available
Standardizing Structural Causal Models	Jun 17, 2024	BenchmarkingCausal Inference	CodeCode Available
Standard Vs Uniform Binary Search and Their Variants in Learned Static Indexing: The Case of the Searching on Sorted Data Benchmarking Software Platform	Jan 5, 2022	Benchmarking	CodeCode Available
StarBASE-GP: Biologically-Guided Automated Machine Learning for Genotype-to-Phenotype Association Analysis	May 28, 2025	Benchmarking	CodeCode Available
Benchmarking framework for machine learning classification from fNIRS data	Mar 3, 2023	BenchmarkingBrain Computer Interface	CodeCode Available
PersoBench: Benchmarking Personalized Response Generation in Large Language Models	Oct 4, 2024	BenchmarkingDialogue Generation	CodeCode Available
STA: Self-controlled Text Augmentation for Improving Text Classifications	Feb 24, 2023	BenchmarkingText Augmentation	CodeCode Available
Architecture Analysis and Benchmarking of 3D U-shaped Deep Learning Models for Thoracic Anatomical Segmentation	Feb 5, 2024	BenchmarkingImage Segmentation	CodeCode Available
XCompress: LLM assisted Python-based text compression toolkit	Aug 12, 2024	BenchmarkingLanguage Modeling	CodeCode Available
A Framework for Generating Informative Benchmark Instances	May 29, 2022	Benchmarking	CodeCode Available
What's Different between Visual Question Answering for Machine "Understanding" Versus for Accessibility?	Oct 26, 2022	BenchmarkingQuestion Answering	CodeCode Available
Towards Robust Metrics for Concept Representation Evaluation	Jan 25, 2023	BenchmarkingDisentanglement	CodeCode Available
Statistical Multicriteria Evaluation of LLM-Generated Text	Jun 22, 2025	BenchmarkingDiversity	CodeCode Available
ANTHROPOS-V: benchmarking the novel task of Crowd Volume Estimation	Jan 3, 2025	BenchmarkingCrowd Counting	CodeCode Available
Answer Consolidation: Formulation and Benchmarking	Apr 29, 2022	BenchmarkingQuestion Answering	CodeCode Available
A Benchmark on Extremely Weakly Supervised Text Classification: Reconcile Seed Matching and Prompting Approaches	May 22, 2023	BenchmarkingClassification	CodeCode Available
A novel evaluation methodology for supervised Feature Ranking algorithms	Jul 9, 2022	BenchmarkingFeature Importance	CodeCode Available

Show:10 25 50

← PrevPage 212 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified