SOTAVerified

Benchmarking

Papers

Showing 45264550 of 5548 papers

TitleStatusHype
The Dota 2 Bot Competition0
Learning to Fly -- a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter ControlCode2
Benchmarking Robustness of Deep Learning Classifiers Using Two-Factor PerturbationCode0
Adversarial Environment Generation for Learning to Navigate the WebCode0
Accounting for Variance in Machine Learning Benchmarks0
Towards Personalized Federated Learning0
Improving Medical Image Classification with Label Noise Using Dual-uncertainty Estimation0
OpenICS: Open Image Compressive Sensing Toolbox and BenchmarkCode1
Variational Laplace for Bayesian neural networks0
Connecting convex energy-based inference and optimal transport for domain adaptation0
Learning Transferable Visual Models From Natural Language SupervisionCode2
Benchmarking and Survey of Explanation Methods for Black Box ModelsCode1
State-of-the-Art in Human Scanpath Prediction0
AutoAI-TS: AutoAI for Time Series Forecasting0
Benchmarking Graph Neural Networks on Link Prediction0
4D Panoptic LiDAR SegmentationCode1
The Curious Case of Integrator Reach Sets, Part I: Basic Theory0
Decentralized Joint Beamforming, User Scheduling and QoS Management in 5G and Beyond Systems0
Deluca -- A Differentiable Control Library: Environments, Methods, and BenchmarkingCode1
NuCLS: A scalable crowdsourcing, deep learning approach and dataset for nucleus classification, localization and segmentationCode1
A Review of Testing Object-Based Environment Perception for Safe Automated DrivingCode0
GraphGallery: A Platform for Fast Benchmarking and Easy Development of Graph Neural Networks Based Intelligent SoftwareCode1
Geometric feature performance under downsampling for EEG classification tasks0
HAWKS: Evolving Challenging Benchmark Sets for Cluster AnalysisCode1
Leveraging Benchmarking Data for Informed One-Shot Dynamic Algorithm Selection0
Show:102550
← PrevPage 182 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified