SOTAVerified

Benchmarking

Papers

Showing 43264350 of 5548 papers

TitleStatusHype
Comparative Analysis of Packages and Algorithms for the Analysis of Spatially Resolved Transcriptomics Data0
Quantum machine learning of large datasets using randomized measurementsCode1
CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation AlgorithmsCode1
Multilingual Protest News Detection - Shared Task 1, CASE 20210
Benchmarking: Past, Present and FutureCode1
Benchmarking Neural Topic Models: An Empirical Study0
The Effect of Domain and Diacritics in Yoruba–English Neural Machine Translation0
Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes0
What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts0
Benchmarking Scalable Methods for Streaming Cross Document Entity CoreferenceCode0
Cross-replication Reliability - An Empirical Approach to Interpreting Inter-rater Reliability0
Improving Model Generalization: A Chinese Named Entity Recognition Case Study0
SignalGP-Lite: Event Driven Genetic Programming Library for Large-Scale Artificial Life ApplicationsCode0
Contemporary Symbolic Regression Methods and their Relative PerformanceCode1
Reradiation and Scattering from a Reconfigurable Intelligent Surface: A General Macroscopic Model0
AA3DNet: Attention Augmented Real Time 3D Object Detection0
Benchmarking AutoML Frameworks for Disease Prediction Using Medical Claims0
3D fluorescence microscopy data synthesis for segmentation and benchmarkingCode0
An Exploration of Exploration: Measuring the ability of lexicase selection to find obscure pathways to optimalityCode0
Learned Sorted Table Search and Static Indexes in Small Model SpaceCode0
PhD Thesis on Code Modulated Interferometric Imaging System using Phased Arrays0
Attribution of Predictive Uncertainties in Classification ModelsCode0
Better force fields start with better data -- A data set of cation dipeptide interactionsCode0
ECG-Adv-GAN: Detecting ECG Adversarial Examples with Conditional Generative Adversarial Networks0
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi0
Show:102550
← PrevPage 174 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified