SOTAVerified

Benchmarking

Papers

Showing 37013750 of 5548 papers

TitleStatusHype
Benchmarking Algorithmic Bias in Face Recognition: An Experimental Approach Using Synthetic Faces and Human Evaluation0
Spintronics for image recognition: performance benchmarking via ultrafast data-driven simulations0
Enhancing Architecture Frameworks by Including Modern Stakeholders and their Views/Viewpoints0
Benchmarking LLM powered Chatbots: Methods and Metrics0
RECipe: Does a Multi-Modal Recipe Knowledge Graph Fit a Multi-Purpose Recommendation System?0
Microvasculature Segmentation in Human BioMolecular Atlas Program (HuBMAP)0
Precise Benchmarking of Explainable AI Attribution MethodsCode0
A Survey of Spanish Clinical Language Models0
ChatGPT for GTFS: Benchmarking LLMs on GTFS Understanding and RetrievalCode0
RobustMQ: Benchmarking Robustness of Quantized Models0
Benchmarking Adaptative Variational Quantum Algorithms on QUBO Instances0
Differential Privacy for Adaptive Weight Aggregation in Federated Tumor Segmentation0
Capsa: A Unified Framework for Quantifying Risk in Deep Neural Networks0
CLAMS: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering0
Benchmarking Ultra-High-Definition Image Reflection RemovalCode0
Deep Learning and Computer Vision for Glaucoma Detection: A Review0
TMPNN: High-Order Polynomial Regression Based on Taylor Map FactorizationCode0
Benchmarking Jetson Edge Devices with an End-to-end Video-based Anomaly Detection SystemCode0
Benchmarking Performance of Deep Learning Model for Material Segmentation on Two HPC Systems0
Quantitative Metrics for Benchmarking Human-Aware Robot NavigationCode0
Fluorescent Neuronal Cells v2: Multi-Task, Multi-Format Annotations for Deep Learning in Microscopy0
YOLOBench: Benchmarking Efficient Object Detectors on Embedded SystemsCode0
Towards an AI Accountability Policy0
Implementing and Benchmarking the Locally Competitive Algorithm on the Loihi 2 Neuromorphic Processor0
Towards Long-Term predictions of Turbulence using Neural Operators0
Benchmarking and Analyzing Generative Data for Visual Recognition0
When Multi-Task Learning Meets Partial Supervision: A Computer Vision ReviewCode0
UPREVE: An End-to-End Causal Discovery Benchmarking System0
The Impact of Genomic Variation on Function (IGVF) Consortium0
Selecting the motion ground truth for loose-fitting wearables: benchmarking optical MoCap methodsCode0
The Extractive-Abstractive Axis: Measuring Content "Borrowing" in Generative Language Models0
Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate GradientsCode0
Benchmarking fixed-length Fingerprint Representations across different Embedding Sizes and Sensor Types0
Approaches for benchmarking single-cell gene regulatory network inference methods0
On the Real-Time Semantic Segmentation of Aphid Clusters in the Wild0
Machine Learning for Ranking f-wave Extraction Methods in Single-Lead ECGs0
Revisiting Implicit Models: Sparsity Trade-offs Capability in Weight-tied Model for Vision Tasks0
Benchmarking the Effectiveness of Classification Algorithms and SVM Kernels for Dry Beans0
Joint Batching and Scheduling for High-Throughput Multiuser Edge AI with Asynchronous Task Arrivals0
Benchmarking Explanatory Models for Inertia Forecasting using Public Data of the Nordic Area0
Challenge Results Are Not Reproducible0
Pathway: a fast and flexible unified stream data processing framework for analytical and Machine Learning applications0
Deep Generative Models for Physiological Signals: A Systematic Literature Review0
Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation0
Temporal Graphs Anomaly Emergence Detection: Benchmarking For Social Media Interactions0
Assessing the efficacy of large language models in generating accurate teacher responses0
Fast Empirical Scenarios0
Fairness-Aware Graph Neural Networks: A Survey0
Performance Modeling of Data Storage Systems using Generative ModelsCode0
Structural Property Prediction0
Show:102550
← PrevPage 75 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified