Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2801–2850 of 5548 papers

Title	Date	Tasks	Status
DailyQA: A Benchmark to Evaluate Web Retrieval Augmented LLMs Based on Capturing Real-World Changes	May 22, 2025	BenchmarkingRAG	—Unverified
Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization	Feb 3, 2022	3D ReconstructionBenchmarking	—Unverified
DarkBench: Benchmarking Dark Patterns in Large Language Models	Mar 13, 2025	Benchmarking	—Unverified
DASB -- Discrete Audio and Speech Benchmark	Jun 20, 2024	BenchmarkingEmotion Recognition	—Unverified
Data Analysis in the Era of Generative AI	Sep 27, 2024	Benchmarking	—Unverified
Data and its (dis)contents: A survey of dataset development and use in machine learning research	Dec 9, 2020	BenchmarkingBIG-bench Machine Learning	—Unverified
Data Augmentation for Continual RL via Adversarial Gradient Episodic Memory	Aug 24, 2024	BenchmarkingData Augmentation	—Unverified
Data Augmentation for Traffic Classification	Jan 19, 2024	BenchmarkingClassification	—Unverified
Data Collection of Real-Life Knowledge Work in Context: The RLKWiC Dataset	Apr 16, 2024	BenchmarkingManagement	—Unverified
Data-driven Approach for Static Hedging of Exchange Traded Options	Feb 1, 2023	BenchmarkingInterpretable Machine Learning	—Unverified
Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning	Jan 14, 2025	BenchmarkingManagement	—Unverified
Data-driven Power Flow Linearization: Simulation	Jun 10, 2024	BenchmarkingComputational Efficiency	—Unverified
Data-driven surrogate modelling and benchmarking for process equipment	Mar 13, 2020	Active LearningBenchmarking	—Unverified
Data-Driven Target Localization: Benchmarking Gradient Descent Using the Cramer-Rao Bound	Jan 20, 2024	Benchmarking	—Unverified
Data needs and challenges for quantum dot devices automation	Dec 21, 2023	Benchmarking	—Unverified
Multi-scale data reconstruction of turbulent rotating flows with Gappy POD, Extended POD and Generative Adversarial Networks	Oct 21, 2022	BenchmarkingGenerative Adversarial Network	—Unverified
Dataset and Benchmarking of Real-Time Embedded Object Detection for RoboCup SSL	Jun 28, 2021	BenchmarkingObject	—Unverified
DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition	Jun 11, 2024	BenchmarkingCross-corpus	—Unverified
DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation	Sep 7, 2023	BenchmarkingNeural Architecture Search	—Unverified
DDR-ID: Dual Deep Reconstruction Networks Based Image Decomposition for Anomaly Detection	Jul 18, 2020	Adversarial AttackAdversarial Attack Detection	—Unverified
DeAR: Debiasing Vision-Language Models with Additive Residuals	Mar 18, 2023	AttributeBenchmarking	—Unverified
DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis	May 20, 2025	BenchmarkingFairness	—Unverified
Decentralized Federated Learning on the Edge over Wireless Mesh Networks	Nov 2, 2023	BenchmarkingFederated Learning	—Unverified
Decentralized Joint Beamforming, User Scheduling and QoS Management in 5G and Beyond Systems	Feb 23, 2021	BenchmarkingManagement	—Unverified
Decentralized Learning for Overparameterized Problems: A Multi-Agent Kernel Approximation Approach	Sep 29, 2021	Benchmarking	—Unverified
Deciphering the Definition of Adversarial Robustness for post-hoc OOD Detectors	Jun 21, 2024	Adversarial DefenseAdversarial Robustness	—Unverified
Decoding Complexity: Intelligent Pattern Exploration with CHPDA (Context Aware Hybrid Pattern Detection Algorithm)	Feb 9, 2025	BenchmarkingCPU	—Unverified
Decoding the Diversity: A Review of the Indic AI Research Landscape	Jun 13, 2024	BenchmarkingDiversity	—Unverified
Deep-6DPose: Recovering 6D Object Pose from a Single RGB Image	Feb 28, 2018	BenchmarkingInstance Segmentation	—Unverified
Deep Convolutional Generative Adversarial Network Based Food Recognition Using Partially Labeled Data	Dec 26, 2018	BenchmarkingFood Recognition	—Unverified
Deep Crowd Anomaly Detection: State-of-the-Art, Challenges, and Future Research Directions	Oct 25, 2022	Anomaly DetectionBenchmarking	—Unverified
Deep Diffusion Models and Unsupervised Hyperspectral Unmixing for Realistic Abundance Map Synthesis	Jun 16, 2025	BenchmarkingData Augmentation	—Unverified
DeepEdgeBench: Benchmarking Deep Neural Networks on Edge Devices	Aug 21, 2021	BenchmarkingEdge-computing	—Unverified
Deeper Insights into the Robustness of ViTs towards Common Corruptions	Apr 26, 2022	BenchmarkingData Augmentation	—Unverified
DeepFake Doctor: Diagnosing and Treating Audio-Video Fake Detection	Jun 6, 2025	BenchmarkingDeepFake Detection	—Unverified
Deep Feature Selection Using a Novel Complementary Feature Mask	Sep 25, 2022	Benchmarkingfeature selection	—Unverified
Deep filter banks for texture recognition, description, and segmentation	Jul 9, 2015	Benchmarking	—Unverified
Deep Generative Models for Physiological Signals: A Systematic Literature Review	Jul 12, 2023	BenchmarkingEEG	—Unverified
Deep Hedging of Long-Term Financial Derivatives	Jul 29, 2020	BenchmarkingDeep Reinforcement Learning	—Unverified
Deep Image Compositing	Mar 29, 2021	Benchmarking	—Unverified
Deep Imputation of Missing Values in Time Series Health Data: A Review with Benchmarking	Feb 10, 2023	BenchmarkingDeep Learning	—Unverified
Deep Learning and Computer Vision for Glaucoma Detection: A Review	Jul 31, 2023	BenchmarkingDeep Learning	—Unverified
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular Design -- Toward a Unified Approach: State-of-the-Art and Future Directions	May 18, 2020	BenchmarkingDeep Learning	—Unverified
Deep Learning-Based Multiple Object Visual Tracking on Embedded System for IoT and Mobile Edge Computing Applications	Jul 31, 2018	BenchmarkingDeep Learning	—Unverified
Deep learning for action spotting in association football videos	Oct 2, 2024	Action SpottingBenchmarking	—Unverified
Deep learning for extracting protein-protein interactions from biomedical literature	Jun 5, 2017	BenchmarkingCross-corpus	—Unverified
Deep learning for molecular design - a review of the state of the art	Mar 11, 2019	Benchmarkingreinforcement-learning	—Unverified
Optimal Design of Volt/VAR Control Rules of Inverters using Deep Learning	Nov 17, 2022	BenchmarkingUnity	—Unverified
Deep Learning for Virtual Screening: Five Reasons to Use ROC Cost Functions	Jun 25, 2020	BenchmarkingDrug Discovery	—Unverified
Deep Learning Logo Detection with Data Expansion by Synthesising Context	Dec 29, 2016	BenchmarkingDeep Learning	—Unverified

Show:10 25 50

← PrevPage 57 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified