SOTAVerified

Benchmarking

Papers

Showing 28512900 of 5548 papers

TitleStatusHype
Deep Learning Models for UAV-Assisted Bridge Inspection: A YOLO Benchmark Analysis0
Deep Learning of Intrinsically Motivated Options in the Arcade Learning Environment0
Deep Learning vs. Gradient Boosting: Benchmarking state-of-the-art machine learning algorithms for credit scoring0
Deeply Supervised Depth Map Super-Resolution as Novel View Synthesis0
Deep Nets: What have they ever done for Vision?0
Deep One-Class Hate Speech Detection Model0
Deep recommender engine based on efficient product embeddings neural pipeline0
Deep Recurrent Modelling of Stationary Bitcoin Price Formation Using the Order Flow0
Deep Reinforcement Learning Algorithms for Hybrid V2X Communication: A Benchmarking Study0
Deep Reinforcement Learning for Autonomous Cyber Defence: A Survey0
Deep Reinforcement Learning for Continuous Docking Control of Autonomous Underwater Vehicles: A Benchmarking Study0
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research0
DeepSIC: Deep Semantic Image Compression0
Deep State-Space Model for Predicting Cryptocurrency Price0
Deep Unlearn: Benchmarking Machine Unlearning0
Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective0
Defining and Evaluating Visual Language Models' Basic Spatial Abilities: A Perspective from Psychometrics0
DELAD: Deep Landweber-guided deconvolution with Hessian and sparse prior0
Demographic Parity: Mitigating Biases in Real-World Data0
Demonstrating Almost Linear Time Complexity of Bus Admittance Matrix-Based Distribution Network Power Flow: An Empirical Approach0
Depression Detection on Social Media with Large Language Models0
Design2Code: Benchmarking Multimodal Code Generation for Automated Front-End Engineering0
Design and benchmarking of a two degree of freedom tendon driver unit for cable-driven wearable technologies0
Design and Realization of a Benchmarking Testbed for Evaluating Autonomous Platooning Algorithms0
Design, Benchmarking and Explainability Analysis of a Game-Theoretic Framework towards Energy Efficiency in Smart Infrastructure0
Designing labeled graph classifiers by exploiting the Rényi entropy of the dissimilarity representation0
Design of intelligent proofreading system for English translation based on CNN and BERT0
Design of Supervision-Scalable Learning Systems: Methodology and Performance Benchmarking0
Design Target Achievement Index: A Differentiable Metric to Enhance Deep Generative Models in Multi-Objective Inverse Design0
Detecting Finger-Vein Presentation Attacks Using 3D Shape & Diffuse Reflectance Decomposition0
Detecting Out-Of-Distribution Samples Using Low-Order Deep Features Statistics0
Detection and Evaluation of Clusters within Sequential Data0
Detection of Adversarial Attacks and Characterization of Adversarial Subspace0
Determinants of Performance in European ATM -- How to Analyze a Diverse Industry0
DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection0
detrex: Benchmarking Detection Transformers0
Development details and computational benchmarking of DEPAM0
Device Modeling Bias in ReRAM-based Neural Network Simulations0
DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes0
DFTR: Depth-supervised Fusion Transformer for Salient Object Detection0
DHP Benchmark: Are LLMs Good NLG Evaluators?0
Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking0
Dialogue Games for Benchmarking Language Understanding: Motivation, Taxonomy, Strategy0
DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale0
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs0
Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset0
DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior0
Different Horses for Different Courses: Comparing Bias Mitigation Algorithms in ML0
Differential Privacy for Adaptive Weight Aggregation in Federated Tumor Segmentation0
Diffusion-Driven Domain Adaptation for Generating 3D Molecules0
Show:102550
← PrevPage 58 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified