SOTAVerified

Benchmarking

Papers

Showing 30763100 of 5548 papers

TitleStatusHype
Benchmarking Performance of Deep Learning Model for Material Segmentation on Two HPC Systems0
Quantitative Metrics for Benchmarking Human-Aware Robot NavigationCode0
YOLOBench: Benchmarking Efficient Object Detectors on Embedded SystemsCode0
Fluorescent Neuronal Cells v2: Multi-Task, Multi-Format Annotations for Deep Learning in Microscopy0
Foundational Models Defining a New Era in Vision: A Survey and OutlookCode2
Towards Long-Term predictions of Turbulence using Neural Operators0
When Multi-Task Learning Meets Partial Supervision: A Computer Vision ReviewCode0
UPREVE: An End-to-End Causal Discovery Benchmarking System0
Implementing and Benchmarking the Locally Competitive Algorithm on the Loihi 2 Neuromorphic Processor0
Benchmarking and Analyzing Generative Data for Visual Recognition0
Towards an AI Accountability Policy0
The Impact of Genomic Variation on Function (IGVF) Consortium0
Remote Bio-Sensing: Open Source Benchmark Framework for Fair Evaluation of rPPGCode2
PLANTAIN: Diffusion-inspired Pose Score Minimization for Fast and Accurate Molecular DockingCode1
Selecting the motion ground truth for loose-fitting wearables: benchmarking optical MoCap methodsCode0
JoinGym: An Efficient Query Optimization Environment for Reinforcement LearningCode1
Decoding the Enigma: Benchmarking Humans and AIs on the Many Facets of Working MemoryCode1
The Extractive-Abstractive Axis: Measuring Content "Borrowing" in Generative Language Models0
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language ModelsCode1
Benchmarking Potential Based Rewards for Learning Humanoid LocomotionCode2
On the Real-Time Semantic Segmentation of Aphid Clusters in the Wild0
Efficient Prediction of Peptide Self-assembly through Sequential and Graphical EncodingCode1
Examining the Effects of Degree Distribution and Homophily in Graph Learning ModelsCode1
Towards Heterogeneous Long-tailed Learning: Benchmarking, Metrics, and ToolboxCode1
Approaches for benchmarking single-cell gene regulatory network inference methods0
Show:102550
← PrevPage 124 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified