SOTAVerified

Benchmarking

Papers

Showing 28012850 of 5548 papers

TitleStatusHype
DailyQA: A Benchmark to Evaluate Web Retrieval Augmented LLMs Based on Capturing Real-World Changes0
Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization0
DarkBench: Benchmarking Dark Patterns in Large Language Models0
DASB -- Discrete Audio and Speech Benchmark0
Data Analysis in the Era of Generative AI0
Data and its (dis)contents: A survey of dataset development and use in machine learning research0
Data Augmentation for Continual RL via Adversarial Gradient Episodic Memory0
Data Augmentation for Traffic Classification0
Data Collection of Real-Life Knowledge Work in Context: The RLKWiC Dataset0
Data-driven Approach for Static Hedging of Exchange Traded Options0
Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning0
Data-driven Power Flow Linearization: Simulation0
Data-driven surrogate modelling and benchmarking for process equipment0
Data-Driven Target Localization: Benchmarking Gradient Descent Using the Cramer-Rao Bound0
Data needs and challenges for quantum dot devices automation0
Multi-scale data reconstruction of turbulent rotating flows with Gappy POD, Extended POD and Generative Adversarial Networks0
Dataset and Benchmarking of Real-Time Embedded Object Detection for RoboCup SSL0
DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition0
DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation0
DDR-ID: Dual Deep Reconstruction Networks Based Image Decomposition for Anomaly Detection0
DeAR: Debiasing Vision-Language Models with Additive Residuals0
DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis0
Decentralized Federated Learning on the Edge over Wireless Mesh Networks0
Decentralized Joint Beamforming, User Scheduling and QoS Management in 5G and Beyond Systems0
Decentralized Learning for Overparameterized Problems: A Multi-Agent Kernel Approximation Approach0
Deciphering the Definition of Adversarial Robustness for post-hoc OOD Detectors0
Decoding Complexity: Intelligent Pattern Exploration with CHPDA (Context Aware Hybrid Pattern Detection Algorithm)0
Decoding the Diversity: A Review of the Indic AI Research Landscape0
Deep-6DPose: Recovering 6D Object Pose from a Single RGB Image0
Deep Convolutional Generative Adversarial Network Based Food Recognition Using Partially Labeled Data0
Deep Crowd Anomaly Detection: State-of-the-Art, Challenges, and Future Research Directions0
Deep Diffusion Models and Unsupervised Hyperspectral Unmixing for Realistic Abundance Map Synthesis0
DeepEdgeBench: Benchmarking Deep Neural Networks on Edge Devices0
Deeper Insights into the Robustness of ViTs towards Common Corruptions0
DeepFake Doctor: Diagnosing and Treating Audio-Video Fake Detection0
Deep Feature Selection Using a Novel Complementary Feature Mask0
Deep filter banks for texture recognition, description, and segmentation0
Deep Generative Models for Physiological Signals: A Systematic Literature Review0
Deep Hedging of Long-Term Financial Derivatives0
Deep Image Compositing0
Deep Imputation of Missing Values in Time Series Health Data: A Review with Benchmarking0
Deep Learning and Computer Vision for Glaucoma Detection: A Review0
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular Design -- Toward a Unified Approach: State-of-the-Art and Future Directions0
Deep Learning-Based Multiple Object Visual Tracking on Embedded System for IoT and Mobile Edge Computing Applications0
Deep learning for action spotting in association football videos0
Deep learning for extracting protein-protein interactions from biomedical literature0
Deep learning for molecular design - a review of the state of the art0
Optimal Design of Volt/VAR Control Rules of Inverters using Deep Learning0
Deep Learning for Virtual Screening: Five Reasons to Use ROC Cost Functions0
Deep Learning Logo Detection with Data Expansion by Synthesising Context0
Show:102550
← PrevPage 57 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified