SOTAVerified

Benchmarking

Papers

Showing 51015150 of 5548 papers

TitleStatusHype
Safe Trajectory Generation for Complex Urban Environments Using Spatio-temporal Semantic CorridorCode0
Alchemy: A Quantum Chemistry Dataset for Benchmarking AI ModelsCode0
Exploring Model-based Planning with Policy NetworksCode0
Energy Models for Better Pseudo-Labels: Improving Semi-Supervised Classification with the 1-Laplacian Graph Energy0
Light Field Saliency Detection with Deep Convolutional NetworksCode0
Performance Evaluation Methodology for Long-Term Visual Object Tracking0
PyRobot: An Open-source Robotics Framework for Research and BenchmarkingCode1
Analysis | OPEN | Published: 17 June 2019 Multitask learning and benchmarking with clinical time series dataCode0
Benchmarking Neural Machine Translation for Southern African LanguagesCode0
MMDetection: Open MMLab Detection Toolbox and BenchmarkCode1
Hardware Aware Neural Network Architectures using FbNetCode0
Transfer Learning in Biomedical Natural Language Processing: An Evaluation of BERT and ELMo on Ten Benchmarking DatasetsCode1
Benchmarking Minimax LinkageCode0
Object Pose Estimation in Robotics Revisited0
MNIST-C: A Robustness Benchmark for Computer VisionCode1
Towards Fair and Privacy-Preserving Federated Deep ModelsCode0
RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies0
Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual NavigationCode0
The Principle of Unchanged Optimality in Reinforcement Learning Generalization0
Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling Baselines0
Benchmarking Hierarchical Script KnowledgeCode0
Natural Image Noise DatasetCode0
MaxpoolNMS: Getting Rid of NMS Bottlenecks in Two-Stage Object Detectors0
Meta-Surrogate Benchmarking for Hyperparameter OptimizationCode1
Benchmarking Regression Methods: A comparison with CGANCode1
Non-linear Multitask Learning with Deep Gaussian Processes0
Matrix-Free Preconditioning in Online Learning0
Adaptive Deep Kernel Learning0
COSET: A Benchmark for Evaluating Neural Program Embeddings0
On Recurrent Neural Networks for Sequence-based Processing in CommunicationsCode0
NTP : A Neural Network Topology Profiler0
Cognitive Model Priors for Predicting Human Decisions0
Benchmarking Deep Learning Architectures for Predicting Readmission to the ICU and Describing Patients-at-RiskCode0
Characterizing SLAM Benchmarks and Methods for the Robust Perception AgeCode0
Robust measurement of innovation performances in Europe with a hierarchy of interacting composite indicators0
SAWNet: A Spatially Aware Deep Neural Network for 3D Point Cloud Processing0
REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning0
LEAF: A Benchmark for Federated Settings0
Simitate: A Hybrid Imitation Learning BenchmarkCode0
IPC: A Benchmark Data Set for Learning with Graph-Structured DataCode0
Strong and Simple Baselines for Multimodal Utterance EmbeddingsCode0
The Algonauts Project: A Platform for Communication between the Sciences of Biological and Artificial Intelligence0
VizNet: Towards A Large-Scale Visualization Learning and Benchmarking RepositoryCode0
Long Short-Term Memory with Gate and State Level Fusion for Light Field-Based Face Recognition0
Machine Learning Cryptanalysis of a Quantum Random Number GeneratorCode0
Scaling and Benchmarking Self-Supervised Visual Representation LearningCode0
Detecting Out-Of-Distribution Samples Using Low-Order Deep Features Statistics0
Evaluation Methodology for Attacks Against Confidence Thresholding Models0
On the Use of ArXiv as a DatasetCode0
A Deep Q-Learning Method for Downlink Power Allocation in Multi-Cell Networks0
Show:102550
← PrevPage 103 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified