SOTAVerified

Benchmarking

Papers

Showing 37013725 of 5548 papers

TitleStatusHype
Benchmarking Algorithmic Bias in Face Recognition: An Experimental Approach Using Synthetic Faces and Human Evaluation0
Spintronics for image recognition: performance benchmarking via ultrafast data-driven simulations0
Enhancing Architecture Frameworks by Including Modern Stakeholders and their Views/Viewpoints0
Benchmarking LLM powered Chatbots: Methods and Metrics0
RECipe: Does a Multi-Modal Recipe Knowledge Graph Fit a Multi-Purpose Recommendation System?0
Microvasculature Segmentation in Human BioMolecular Atlas Program (HuBMAP)0
Precise Benchmarking of Explainable AI Attribution MethodsCode0
A Survey of Spanish Clinical Language Models0
ChatGPT for GTFS: Benchmarking LLMs on GTFS Understanding and RetrievalCode0
RobustMQ: Benchmarking Robustness of Quantized Models0
Benchmarking Adaptative Variational Quantum Algorithms on QUBO Instances0
Differential Privacy for Adaptive Weight Aggregation in Federated Tumor Segmentation0
Capsa: A Unified Framework for Quantifying Risk in Deep Neural Networks0
CLAMS: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering0
Benchmarking Ultra-High-Definition Image Reflection RemovalCode0
Deep Learning and Computer Vision for Glaucoma Detection: A Review0
TMPNN: High-Order Polynomial Regression Based on Taylor Map FactorizationCode0
Benchmarking Jetson Edge Devices with an End-to-end Video-based Anomaly Detection SystemCode0
Benchmarking Performance of Deep Learning Model for Material Segmentation on Two HPC Systems0
Quantitative Metrics for Benchmarking Human-Aware Robot NavigationCode0
Fluorescent Neuronal Cells v2: Multi-Task, Multi-Format Annotations for Deep Learning in Microscopy0
YOLOBench: Benchmarking Efficient Object Detectors on Embedded SystemsCode0
Towards an AI Accountability Policy0
Implementing and Benchmarking the Locally Competitive Algorithm on the Loihi 2 Neuromorphic Processor0
Towards Long-Term predictions of Turbulence using Neural Operators0
Show:102550
← PrevPage 149 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified