SOTAVerified

Benchmarking

Papers

Showing 51015125 of 5548 papers

TitleStatusHype
Safe Trajectory Generation for Complex Urban Environments Using Spatio-temporal Semantic CorridorCode0
Alchemy: A Quantum Chemistry Dataset for Benchmarking AI ModelsCode0
Exploring Model-based Planning with Policy NetworksCode0
Energy Models for Better Pseudo-Labels: Improving Semi-Supervised Classification with the 1-Laplacian Graph Energy0
Light Field Saliency Detection with Deep Convolutional NetworksCode0
Performance Evaluation Methodology for Long-Term Visual Object Tracking0
PyRobot: An Open-source Robotics Framework for Research and BenchmarkingCode1
Analysis | OPEN | Published: 17 June 2019 Multitask learning and benchmarking with clinical time series dataCode0
Benchmarking Neural Machine Translation for Southern African LanguagesCode0
MMDetection: Open MMLab Detection Toolbox and BenchmarkCode1
Hardware Aware Neural Network Architectures using FbNetCode0
Transfer Learning in Biomedical Natural Language Processing: An Evaluation of BERT and ELMo on Ten Benchmarking DatasetsCode1
Benchmarking Minimax LinkageCode0
Object Pose Estimation in Robotics Revisited0
MNIST-C: A Robustness Benchmark for Computer VisionCode1
Towards Fair and Privacy-Preserving Federated Deep ModelsCode0
RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies0
Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual NavigationCode0
The Principle of Unchanged Optimality in Reinforcement Learning Generalization0
Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling Baselines0
Benchmarking Hierarchical Script KnowledgeCode0
Natural Image Noise DatasetCode0
MaxpoolNMS: Getting Rid of NMS Bottlenecks in Two-Stage Object Detectors0
Meta-Surrogate Benchmarking for Hyperparameter OptimizationCode1
Benchmarking Regression Methods: A comparison with CGANCode1
Show:102550
← PrevPage 205 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified