SOTAVerified

Benchmarking

Papers

Showing 46264650 of 5548 papers

TitleStatusHype
ABSA-Bench: Towards the Unified Evaluation of Aspect-based Sentiment Analysis Research0
Benchmarking Automated Review Response Generation for the Hospitality Domain0
AraBench: Benchmarking Dialectal Arabic-English Machine Translation0
mlOSP: Towards a Unified Implementation of Regression Monte Carlo AlgorithmsCode0
Evaluating Attribution for Graph Neural NetworksCode1
Bayesian Multi-type Mean Field Multi-agent Imitation Learning0
Meta learning to classify intent and slot labels with noisy few shot examples0
PMLB v1.0: An open source dataset collection for benchmarking machine learning methodsCode1
RealCause: Realistic Causal Inference Benchmarking0
Class-agnostic Object Detection0
A survey of benchmarking frameworks for reinforcement learning0
Improving Augmentation and Evaluation Schemes for Semantic Image Synthesis0
Cable Tree Wiring -- Benchmarking Solvers on a Real-World Scheduling Problem with a Variety of Precedence ConstraintsCode0
Benchmarking Image Retrieval for Visual LocalizationCode1
Benchmarking Inference Performance of Deep Learning Models on Analog Devices0
RobustPointSet: A Dataset for Benchmarking Robustness of Point Cloud ClassifiersCode1
Spatially Correlated Patterns in Adversarial Images0
Variational Laplace for Bayesian neural networks0
FedEval: A Holistic Evaluation Framework for Federated Learning0
Automatic Microprocessor Performance Bug Detection0
Real-Time Polyp Detection, Localization and Segmentation in Colonoscopy Using Deep LearningCode1
Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and BenchmarkingCode1
SoftGym: Benchmarking Deep Reinforcement Learning for Deformable Object ManipulationCode1
Benchmarking Domain Randomisation for Visual Sim-to-Real Transfer0
tvopt: A Python Framework for Time-Varying OptimizationCode1
Show:102550
← PrevPage 186 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified