SOTAVerified

Benchmarking

Papers

Showing 50015050 of 5548 papers

TitleStatusHype
Design2Code: Benchmarking Multimodal Code Generation for Automated Front-End Engineering0
Design and benchmarking of a two degree of freedom tendon driver unit for cable-driven wearable technologies0
Sum Secrecy Rate Maximization for Full Duplex ISAC Systems0
Design and Realization of a Benchmarking Testbed for Evaluating Autonomous Platooning Algorithms0
Super-Resolution via Deep Learning0
Design, Benchmarking and Explainability Analysis of a Game-Theoretic Framework towards Energy Efficiency in Smart Infrastructure0
Designing labeled graph classifiers by exploiting the Rényi entropy of the dissimilarity representation0
Design of intelligent proofreading system for English translation based on CNN and BERT0
Design of Supervision-Scalable Learning Systems: Methodology and Performance Benchmarking0
Design Target Achievement Index: A Differentiable Metric to Enhance Deep Generative Models in Multi-Objective Inverse Design0
Computational and Exploratory Landscape Analysis of the GKLS Generator0
A Meta-Engine Framework for Interleaved Task and Motion Planning using Topological Refinements0
Detecting Finger-Vein Presentation Attacks Using 3D Shape & Diffuse Reflectance Decomposition0
Detecting Out-Of-Distribution Samples Using Low-Order Deep Features Statistics0
Support Vector Machines and generalisation in HEP0
Surface Reconstruction from Point Clouds: A Survey and a Benchmark0
Detection and Evaluation of Clusters within Sequential Data0
Detection of Adversarial Attacks and Characterization of Adversarial Subspace0
Comprehensive Review and Empirical Evaluation of Causal Discovery Algorithms for Numerical Data0
Determinants of Performance in European ATM -- How to Analyze a Diverse Industry0
DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection0
detrex: Benchmarking Detection Transformers0
Comprehensive Energy Footprint Benchmarking of Strong Parallel Electrified Powertrain0
Development details and computational benchmarking of DEPAM0
Device Modeling Bias in ReRAM-based Neural Network Simulations0
Comprehensive Energy Footprint Benchmarking Algorithm for Electrified Powertrains0
DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes0
SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis0
Comprehensive Benchmarking of Machine Learning Methods for Risk Prediction Modelling from Large-Scale Survival Data: A UK Biobank Study0
Comprehensive Benchmarking of Entropy and Margin Based Scoring Metrics for Data Selection0
DFTR: Depth-supervised Fusion Transformer for Salient Object Detection0
DHP Benchmark: Are LLMs Good NLG Evaluators?0
Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking0
A Benchmarking Environment for Worker Flexibility in Flexible Job Shop Scheduling Problems0
Dialogue Games for Benchmarking Language Understanding: Motivation, Taxonomy, Strategy0
Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition0
A Metadata-Driven Approach to Understand Graph Neural Networks0
DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale0
Surprise Potential as a Measure of Interactivity in Driving Scenarios0
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs0
Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset0
DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior0
Different Horses for Different Courses: Comparing Bias Mitigation Algorithms in ML0
Differential Privacy for Adaptive Weight Aggregation in Federated Tumor Segmentation0
Complexity of Representations in Deep Learning0
Diffusion-Driven Domain Adaptation for Generating 3D Molecules0
DIG: A Turnkey Library for Diving into Graph Deep Learning Research0
Complex Human Action Recognition in Live Videos Using Hybrid FR-DL Method0
Completing Spatial Transcriptomics Data for Gene Expression Prediction Benchmarking0
DiLiGenT102: A Photometric Stereo Benchmark Dataset With Controlled Shape and Material Variation0
Show:102550
← PrevPage 101 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified