SOTAVerified

Benchmarking

Papers

Showing 37763800 of 5548 papers

TitleStatusHype
Diverse Community Data for Benchmarking Data Privacy Algorithms0
Did the Models Understand Documents? Benchmarking Models for Language Understanding in Document-Level Relation ExtractionCode0
Benchmarking Robustness of Deep Reinforcement Learning approaches to Online Portfolio Management0
Fairness Index Measures to Evaluate Bias in Biometric Recognition0
Using Motif Transitions for Temporal Graph GenerationCode0
Formal Covariate Benchmarking to Bound Omitted Variable Bias0
MA-BBOB: Many-Affine Combinations of BBOB Functions for Evaluating AutoML Approaches in Noiseless Numerical Black-Box Optimization Contexts0
Benchmarking Deep Learning Architectures for Urban Vegetation Point Cloud Semantic Segmentation from MLS0
Framework and Benchmarks for Combinatorial and Mixed-variable Bayesian Optimization0
ALP: Action-Aware Embodied Learning for Perception0
Acoustic Identification of Ae. aegypti Mosquitoes using Smartphone Apps and Residual Convolutional Neural NetworksCode0
Convolutional and Deep Learning based techniques for Time Series Ordinal Classification0
Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality Fusion0
One Law, Many Languages: Benchmarking Multilingual Legal Reasoning for Judicial SupportCode0
Large-Scale Quantum Separability Through a Reproducible Machine Learning Lens0
DISC: a Dataset for Integrated Sensing and Communication in mmWave Systems0
DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning0
BED: Bi-Encoder-Based Detectors for Out-of-Distribution DetectionCode0
Re-Benchmarking Pool-Based Active Learning for Binary ClassificationCode0
RRSIS: Referring Remote Sensing Image Segmentation0
MUBen: Benchmarking the Uncertainty of Molecular Representation ModelsCode0
A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews0
detrex: Benchmarking Detection Transformers0
Contribution à l'Optimisation d'un Comportement Collectif pour un Groupe de Robots Autonomes0
A Large-Scale Analysis on Self-Supervised Video Representation Learning0
Show:102550
← PrevPage 152 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified