Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3801–3850 of 5548 papers

Title	Date	Tasks	Status
Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 Challenge	Mar 5, 2025	BenchmarkingImage Reconstruction	—Unverified
Towards Visual Text Grounding of Multimodal Large Language Model	Apr 7, 2025	BenchmarkingLanguage Modeling	—Unverified
Near-Term Quantum Computing Techniques: Variational Quantum Algorithms, Error Mitigation, Circuit Compilation, Benchmarking and Classical Simulation	Nov 16, 2022	Benchmarking	—Unverified
Benchmarking deep generative models for diverse antibody sequence design	Nov 12, 2021	BenchmarkingDiversity	—Unverified
Benchmarking Deep Facial Expression Recognition: An Extensive Protocol with Balanced Dataset in the Wild	Nov 6, 2023	BenchmarkingFacial Expression Recognition	—Unverified
Towards Zero-Shot Differential Morphing Attack Detection with Multimodal Large Language Models	May 21, 2025	BenchmarkingPrompt Engineering	—Unverified
NeIn: Telling What You Don't Want	Sep 9, 2024	BenchmarkingNegation	—Unverified
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks	Jul 27, 2022	Adversarial RobustnessBenchmarking	—Unverified
TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning	Apr 11, 2025	BenchmarkingLanguage Modeling	—Unverified
Benchmarking Deep AUROC Optimization: Loss Functions and Algorithmic Choices	Mar 27, 2022	Benchmarkingimbalanced classification	—Unverified
Benchmarking Deepart Detection	Feb 28, 2023	BenchmarkingDeepFake Detection	—Unverified
Benchmarking Decoupled Neural Interfaces with Synthetic Gradients	Dec 22, 2017	Benchmarking	—Unverified
NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods	Jun 25, 2024	3DGSBenchmarking	—Unverified
Adaptive Experimentation at Scale: A Computational Framework for Flexible Batches	Mar 21, 2023	BenchmarkingThompson Sampling	—Unverified
Benchmarking data encoding methods in Quantum Machine Learning	May 20, 2025	BenchmarkingQuantum Machine Learning	—Unverified
Adaptive Epidemic Forecasting and Community Risk Evaluation of COVID-19	Jun 3, 2021	BenchmarkingDecision Making	—Unverified
Hyperparameter optimization with REINFORCE and Transformers	Jun 1, 2020	BenchmarkingHyperparameter Optimization	—Unverified
Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation	Dec 20, 2023	Benchmarking	—Unverified
Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models	Aug 24, 2023	Action LocalizationBenchmarking	—Unverified
Benchmarking Data-driven Automatic Text Simplification for German	May 1, 2020	BenchmarkingMachine Translation	—Unverified
Neural Network Approach for Non-Markovian Dissipative Dynamics of Many-Body Open Quantum Systems	Apr 17, 2024	BenchmarkingQuantization	—Unverified
Tracking Everything in Robotic-Assisted Surgery	Sep 29, 2024	Benchmarking	—Unverified
GIM: Gaussian Isolation Machines	Feb 6, 2020	BenchmarkingGeneral Classification	—Unverified
Neural Networks for Fast Optimisation in Model Predictive Control: A Review	Sep 6, 2023	BenchmarkingModel Predictive Control	—Unverified
Benchmarking Cross-Domain Audio-Visual Deception Detection	May 11, 2024	BenchmarkingDeception Detection	—Unverified
Benchmarking Counterfactual Interpretability in Deep Learning Models for Time Series Classification	Aug 22, 2024	Benchmarkingcounterfactual	—Unverified
Neural Text Generation: Past, Present and Beyond	Mar 15, 2018	BenchmarkingDiversity	—Unverified
Benchmarking Convolutional Neural Network and Graph Neural Network based Surrogate Models on a Real-World Car External Aerodynamics Dataset	Apr 9, 2025	BenchmarkingGraph Neural Network	—Unverified
Benchmarking Conventional Vision Models on Neuromorphic Fall Detection and Action Recognition Dataset	Jan 28, 2022	Action RecognitionBenchmarking	—Unverified
Benchmarking Conventional and Learned Video Codecs with a Low-Delay Configuration	Aug 9, 2024	BenchmarkingVideo Compression	—Unverified
Adaptive Deep Kernel Learning	May 28, 2019	BenchmarkingDrug Discovery	—Unverified
Neuromorphic Vision-based Motion Segmentation with Graph Transformer Neural Network	Apr 16, 2024	BenchmarkingMotion Segmentation	—Unverified
Towards Self-adaptive Mutation in Evolutionary Multi-Objective Algorithms	Mar 8, 2023	BenchmarkingEvolutionary Algorithms	—Unverified
Adaptive Control of an Inverted Pendulum by a Reinforcement Learning-based LQR Method	Sep 30, 2023	BenchmarkingReinforcement Learning (RL)	—Unverified
Benchmarking Continual Learning from Cognitive Perspectives	Dec 6, 2023	BenchmarkingContinual Learning	—Unverified
Training Mixed-Domain Translation Models via Federated Learning	May 3, 2022	BenchmarkingFederated Learning	—Unverified
New Loss Functions for Fast Maximum Inner Product Search	Jan 1, 2020	BenchmarkingQuantization	—Unverified
NEWS 2018 Whitepaper	Jul 1, 2018	BenchmarkingMachine Translation	—Unverified
Benchmarking Constraint-Based Bayesian Structure Learning Algorithms: Role of Network Topology	Jan 2, 2025	BenchmarkingSensitivity	—Unverified
Training neural mapping schemes for satellite altimetry with simulation data	Sep 19, 2023	Benchmarking	—Unverified
NEWTS: A Corpus for News Topic-Focused Summarization	May 31, 2022	BenchmarkingText Summarization	—Unverified
NEXT-EVAL: Next Evaluation of Traditional and LLM Web Data Record Extraction	May 21, 2025	BenchmarkingHallucination	—Unverified
Next-generation MRD assays: do we have the tools to evaluate them properly?	Oct 31, 2023	BenchmarkingSensitivity	—Unverified
Benchmarking confound regression strategies for the control of motion artifact in studies of functional connectivity	Aug 11, 2016	BenchmarkingFunctional Connectivity	—Unverified
NL2KQL: From Natural Language to Kusto Query	Apr 3, 2024	BenchmarkingNatural Language Queries	—Unverified
Benchmarking and Building Zero-Shot Hindi Retrieval Model with Hindi-BEIR and NLLB-E5	Sep 9, 2024	BenchmarkingInformation Retrieval	—Unverified
Benchmarking common uncertainty estimation methods with histopathological images under domain shift and label noise	Jan 3, 2023	BenchmarkingClassification	—Unverified
NLPre: a revised approach towards language-centric benchmarking of Natural Language Preprocessing systems	Mar 7, 2024	BenchmarkingDependency Parsing	—Unverified
A CUDA-Based Real Parameter Optimization Benchmark	Jul 29, 2014	BenchmarkingCPU	—Unverified
Benchmarking Collaborative Learning Methods Cost-Effectiveness for Prostate Segmentation	Sep 29, 2023	BenchmarkingFederated Learning	—Unverified

Show:10 25 50

← PrevPage 77 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified