SOTAVerified

Benchmarking

Papers

Showing 43014350 of 5548 papers

TitleStatusHype
Searching for an Effective Defender: Benchmarking Defense against Adversarial Word SubstitutionCode1
Pulling Up by the Causal Bootstraps: Causal Data Augmentation for Pre-training DebiasingCode1
Benchmarking high-fidelity pedestrian tracking systems for research, real-time monitoring and crowd control0
Technological Approaches to Detecting Online Disinformation and Manipulation0
A Unified Taxonomy and Multimodal Dataset for Events in Invasion GamesCode1
A Benchmark for Spray from Nearby Cutting Vehicles0
Evolving Evolutionary Algorithms using Linear Genetic Programming0
DeepEdgeBench: Benchmarking Deep Neural Networks on Edge Devices0
AutoLay: Benchmarking amodal layout estimation for autonomous driving0
Generative Wind Power Curve Modeling Via Machine Vision: A Self-learning Deep Convolutional Network Based MethodCode1
Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks0
Discriminating modelling approaches for Point in Time Economic Scenario Generation0
SSH: A Self-Supervised Framework for Image HarmonizationCode1
SIAM: Chiplet-based Scalable In-Memory Acceleration with Mesh for Deep Neural Networks0
A Dataset for Answering Time-Sensitive QuestionsCode1
A Systematic Benchmarking Analysis of Transfer Learning for Medical Image AnalysisCode1
Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based HateCode1
Distributional Depth-Based Estimation of Object Articulation ModelsCode0
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture SearchCode0
A Look at the Evaluation Setup of the M5 Forecasting Competition0
Secure Neuroimaging Analysis using Federated Learning with Homomorphic Encryption0
Intelligent Railway Foreign Object Detection: A Semi-supervised Convolutional Autoencoder Based Method0
Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An ApproachCode1
Deep Reinforcement Learning for Continuous Docking Control of Autonomous Underwater Vehicles: A Benchmarking Study0
Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney0
Comparative Analysis of Packages and Algorithms for the Analysis of Spatially Resolved Transcriptomics Data0
Quantum machine learning of large datasets using randomized measurementsCode1
CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation AlgorithmsCode1
Multilingual Protest News Detection - Shared Task 1, CASE 20210
Benchmarking: Past, Present and FutureCode1
Benchmarking Neural Topic Models: An Empirical Study0
The Effect of Domain and Diacritics in Yoruba–English Neural Machine Translation0
Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes0
What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts0
Benchmarking Scalable Methods for Streaming Cross Document Entity CoreferenceCode0
Cross-replication Reliability - An Empirical Approach to Interpreting Inter-rater Reliability0
Improving Model Generalization: A Chinese Named Entity Recognition Case Study0
SignalGP-Lite: Event Driven Genetic Programming Library for Large-Scale Artificial Life ApplicationsCode0
Contemporary Symbolic Regression Methods and their Relative PerformanceCode1
Reradiation and Scattering from a Reconfigurable Intelligent Surface: A General Macroscopic Model0
AA3DNet: Attention Augmented Real Time 3D Object Detection0
Benchmarking AutoML Frameworks for Disease Prediction Using Medical Claims0
3D fluorescence microscopy data synthesis for segmentation and benchmarkingCode0
An Exploration of Exploration: Measuring the ability of lexicase selection to find obscure pathways to optimalityCode0
Learned Sorted Table Search and Static Indexes in Small Model SpaceCode0
PhD Thesis on Code Modulated Interferometric Imaging System using Phased Arrays0
Attribution of Predictive Uncertainties in Classification ModelsCode0
Better force fields start with better data -- A data set of cation dipeptide interactionsCode0
ECG-Adv-GAN: Detecting ECG Adversarial Examples with Conditional Generative Adversarial Networks0
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi0
Show:102550
← PrevPage 87 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified