SOTAVerified

Benchmarking

Papers

Showing 47514800 of 5548 papers

TitleStatusHype
Classification of Single-View Object Point Clouds0
Calibrated Adaptive Probabilistic ODE SolversCode0
Uncertainty Estimation with Deep Learning for Rainfall-Runoff Modelling0
Data and its (dis)contents: A survey of dataset development and use in machine learning research0
Hybrid Quantum Computing -- Tabu Search Algorithm for Partitioning Problems: preliminary study on the Traveling Salesman Problem0
JANUS: Benchmarking Commercial and Open-Source Cloud and Edge Platforms for Object and Anomaly Detection Workloads0
MOLTR: Multiple Object Localisation, Tracking, and Reconstruction from Monocular RGB Videos0
MultiON: Benchmarking Semantic Map Memory using Multi-Object Navigation0
Benchmarking Commercial Intent Detection Services with Practice-Driven EvaluationsCode0
AuthNet: A Deep Learning based Authentication Mechanism using Temporal Facial Feature MovementsCode0
Benchmarking Automated Clinical Language Simplification: Dataset, Algorithm, and EvaluationCode0
SMPLy Benchmarking 3D Human Pose Estimation in the Wild0
Benchmarking Energy-Conserving Neural Networks for Learning Dynamics from Data0
mlOSP: Towards a Unified Implementation of Regression Monte Carlo AlgorithmsCode0
ABSA-Bench: Towards the Unified Evaluation of Aspect-based Sentiment Analysis Research0
AraBench: Benchmarking Dialectal Arabic-English Machine Translation0
Benchmarking of Transformer-Based Pre-Trained Models on Social Media Text Classification Datasets0
A General Benchmarking Framework for Text GenerationCode0
Benchmarking Automated Review Response Generation for the Hospitality Domain0
Bayesian Multi-type Mean Field Multi-agent Imitation Learning0
Meta learning to classify intent and slot labels with noisy few shot examples0
RealCause: Realistic Causal Inference Benchmarking0
Class-agnostic Object Detection0
A survey of benchmarking frameworks for reinforcement learning0
Improving Augmentation and Evaluation Schemes for Semantic Image Synthesis0
Cable Tree Wiring -- Benchmarking Solvers on a Real-World Scheduling Problem with a Variety of Precedence ConstraintsCode0
Benchmarking Inference Performance of Deep Learning Models on Analog Devices0
Spatially Correlated Patterns in Adversarial Images0
Variational Laplace for Bayesian neural networks0
FedEval: A Holistic Evaluation Framework for Federated Learning0
Automatic Microprocessor Performance Bug Detection0
Benchmarking Domain Randomisation for Visual Sim-to-Real Transfer0
Cryo-RALib -- a modular library for accelerating alignment in cryo-EMCode0
Perturbation-based exploration methods in deep reinforcement learning0
Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR0
Characterizing Transactional Databases for Frequent Itemset Mining0
A Comprehensive Comparison of Multi-Dimensional Image Denoising MethodsCode0
Beyond Marginal Uncertainty: How Accurately can Bayesian Regression Models Estimate Posterior Predictive Correlations?Code0
Defense-friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation DifficultyCode0
InferBench: Understanding Deep Learning Inference Serving with an Automatic Benchmarking System0
The Forchheim Image Database for Camera Identification in the Wild0
EEGS: A Transparent Model of Emotions0
Face Morphing Attack Generation & Detection: A Comprehensive Survey0
Rearrangement: A Challenge for Embodied AI0
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP0
Neural Network Design: Learning from Neural Architecture SearchCode0
Alibaba’s Submission for the WMT 2020 APE Shared Task: Improving Automatic Post-Editing with Pre-trained Conditional Cross-Lingual BERT0
Cross-lingual sentiment classification in low-resource Bengali languageCode0
On the Reliability and Validity of Detecting Approval of Political Actors in Tweets0
Is Transfer Learning Necessary for Protein Landscape Prediction?0
Show:102550
← PrevPage 96 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified