Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3801–3825 of 5548 papers

Title	Date	Tasks	Status
Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 Challenge	Mar 5, 2025	BenchmarkingImage Reconstruction	—Unverified
Towards Visual Text Grounding of Multimodal Large Language Model	Apr 7, 2025	BenchmarkingLanguage Modeling	—Unverified
Near-Term Quantum Computing Techniques: Variational Quantum Algorithms, Error Mitigation, Circuit Compilation, Benchmarking and Classical Simulation	Nov 16, 2022	Benchmarking	—Unverified
Benchmarking deep generative models for diverse antibody sequence design	Nov 12, 2021	BenchmarkingDiversity	—Unverified
Benchmarking Deep Facial Expression Recognition: An Extensive Protocol with Balanced Dataset in the Wild	Nov 6, 2023	BenchmarkingFacial Expression Recognition	—Unverified
Towards Zero-Shot Differential Morphing Attack Detection with Multimodal Large Language Models	May 21, 2025	BenchmarkingPrompt Engineering	—Unverified
NeIn: Telling What You Don't Want	Sep 9, 2024	BenchmarkingNegation	—Unverified
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks	Jul 27, 2022	Adversarial RobustnessBenchmarking	—Unverified
TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning	Apr 11, 2025	BenchmarkingLanguage Modeling	—Unverified
Benchmarking Deep AUROC Optimization: Loss Functions and Algorithmic Choices	Mar 27, 2022	Benchmarkingimbalanced classification	—Unverified
Benchmarking Deepart Detection	Feb 28, 2023	BenchmarkingDeepFake Detection	—Unverified
Benchmarking Decoupled Neural Interfaces with Synthetic Gradients	Dec 22, 2017	Benchmarking	—Unverified
NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods	Jun 25, 2024	3DGSBenchmarking	—Unverified
Adaptive Experimentation at Scale: A Computational Framework for Flexible Batches	Mar 21, 2023	BenchmarkingThompson Sampling	—Unverified
Benchmarking data encoding methods in Quantum Machine Learning	May 20, 2025	BenchmarkingQuantum Machine Learning	—Unverified
Adaptive Epidemic Forecasting and Community Risk Evaluation of COVID-19	Jun 3, 2021	BenchmarkingDecision Making	—Unverified
Hyperparameter optimization with REINFORCE and Transformers	Jun 1, 2020	BenchmarkingHyperparameter Optimization	—Unverified
Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation	Dec 20, 2023	Benchmarking	—Unverified
Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models	Aug 24, 2023	Action LocalizationBenchmarking	—Unverified
Benchmarking Data-driven Automatic Text Simplification for German	May 1, 2020	BenchmarkingMachine Translation	—Unverified
Neural Network Approach for Non-Markovian Dissipative Dynamics of Many-Body Open Quantum Systems	Apr 17, 2024	BenchmarkingQuantization	—Unverified
Tracking Everything in Robotic-Assisted Surgery	Sep 29, 2024	Benchmarking	—Unverified
GIM: Gaussian Isolation Machines	Feb 6, 2020	BenchmarkingGeneral Classification	—Unverified
Neural Networks for Fast Optimisation in Model Predictive Control: A Review	Sep 6, 2023	BenchmarkingModel Predictive Control	—Unverified
Benchmarking Cross-Domain Audio-Visual Deception Detection	May 11, 2024	BenchmarkingDeception Detection	—Unverified

Show:10 25 50

← PrevPage 153 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified