SOTAVerified

Benchmarking

Papers

Showing 381390 of 5548 papers

TitleStatusHype
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement LearningCode2
Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and TrackingCode2
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval ModelsCode2
Learning to Fly -- a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter ControlCode2
Learning Transferable Visual Models From Natural Language SupervisionCode2
Evaluating Large-Vocabulary Object Detectors: The Devil is in the DetailsCode2
PyHealth: A Python Library for Health Predictive ModelsCode2
TadGAN: Time Series Anomaly Detection Using Generative Adversarial NetworksCode2
Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial ExamplesCode2
Bringing Light Into the Dark: A Large-scale Evaluation of Knowledge Graph Embedding Models Under a Unified FrameworkCode2
Show:102550
← PrevPage 39 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified