SOTAVerified

Benchmarking

Papers

Showing 43014325 of 5548 papers

TitleStatusHype
Searching for an Effective Defender: Benchmarking Defense against Adversarial Word SubstitutionCode1
Pulling Up by the Causal Bootstraps: Causal Data Augmentation for Pre-training DebiasingCode1
Benchmarking high-fidelity pedestrian tracking systems for research, real-time monitoring and crowd control0
Technological Approaches to Detecting Online Disinformation and Manipulation0
A Unified Taxonomy and Multimodal Dataset for Events in Invasion GamesCode1
A Benchmark for Spray from Nearby Cutting Vehicles0
Evolving Evolutionary Algorithms using Linear Genetic Programming0
DeepEdgeBench: Benchmarking Deep Neural Networks on Edge Devices0
AutoLay: Benchmarking amodal layout estimation for autonomous driving0
Generative Wind Power Curve Modeling Via Machine Vision: A Self-learning Deep Convolutional Network Based MethodCode1
Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks0
Discriminating modelling approaches for Point in Time Economic Scenario Generation0
SSH: A Self-Supervised Framework for Image HarmonizationCode1
SIAM: Chiplet-based Scalable In-Memory Acceleration with Mesh for Deep Neural Networks0
A Dataset for Answering Time-Sensitive QuestionsCode1
A Systematic Benchmarking Analysis of Transfer Learning for Medical Image AnalysisCode1
Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based HateCode1
Distributional Depth-Based Estimation of Object Articulation ModelsCode0
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture SearchCode0
A Look at the Evaluation Setup of the M5 Forecasting Competition0
Secure Neuroimaging Analysis using Federated Learning with Homomorphic Encryption0
Intelligent Railway Foreign Object Detection: A Semi-supervised Convolutional Autoencoder Based Method0
Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An ApproachCode1
Deep Reinforcement Learning for Continuous Docking Control of Autonomous Underwater Vehicles: A Benchmarking Study0
Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney0
Show:102550
← PrevPage 173 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified