SOTAVerified

Benchmarking

Papers

Showing 53015325 of 5548 papers

TitleStatusHype
Tackling the Story Ending Biases in The Story Cloze Test0
NEWS 2018 Whitepaper0
Benchmarking the Hill-Valley Evolutionary Algorithm for the GECCO 2018 Competition on Niching Methods Multimodal OptimizationCode0
Hyperspectral Image Dataset for Benchmarking on Salient Object DetectionCode0
Learning a Saliency Evaluation Metric Using Crowdsourced Perceptual Judgments0
Person Re-Identification in Identity Regression Space0
End-to-End Neural Ranking for eCommerce Product Search: an application of task models and textual embeddings0
The Neural Painter: Multi-Turn Image Generation0
Real-time cryo-EM data pre-processing with WarpCode0
Benchmarking Evolutionary Algorithms For Single Objective Real-valued Constrained Optimization - A Critical Review0
Deep Reinforcement Learning for General Video Game AICode0
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark0
Adversarial Reinforcement Learning Framework for Benchmarking Collision Avoidance Mechanisms in Autonomous Vehicles0
A Dataset for Web-Scale Knowledge Base PopulationCode0
CVM-Net: Cross-View Matching Network for Image-Based Ground-to-Aerial Geo-LocalizationCode0
A Report on the 2018 VUA Metaphor Detection Shared Task0
Syntactically Aware Neural Architectures for Definition Extraction0
NengoDL: Combining deep learning and neuromorphic modelling methodsCode0
Quantum classification of the MNIST dataset with Slow Feature Analysis0
Simulation of Large Scale Neural Networks for Evaluation Applications0
The EuroCity Persons Dataset: A Novel Benchmark for Object Detection0
Deep Nets: What have they ever done for Vision?0
Comparative evaluation of instrument segmentation and tracking methods in minimally invasive surgery0
Resource Interoperability for Sustainable Benchmarking: The Case of EventsCode0
Le benchmarking de la reconnaissance d'entit\'es nomm\'ees pour le fran (Benchmarking for French NER)0
Show:102550
← PrevPage 213 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified