SOTAVerified

Benchmarking

Papers

Showing 45764600 of 5548 papers

TitleStatusHype
Inverse Contextual Bandits: Learning How Behavior Evolves over TimeCode0
R3L: Connecting Deep Reinforcement Learning to Recurrent Neural Networks for Image Denoising via Residual Recovery0
A Framework and Benchmarking Study for Counterfactual Generating Methods on Tabular Data0
Intrinsic uncertainties and where to find them0
Connectivity Matters: Neural Network Pruning Through the Lens of Effective SparsityCode0
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents0
Modelling Neuronal Behaviour with Time Series Regression: Recurrent Neural Networks on C. Elegans Data0
Benchmarking ASR Systems Based on Post-Editing Effort and Error Analysis0
CityNet: A Comprehensive Multi-Modal Urban Dataset for Advanced Research in Urban ComputingCode0
Exploring Context Generalizability in Citywide Crowd Mobility Prediction: An Analytic Framework and BenchmarkCode0
On the Interaction of Belief Bias and Explanations0
Dataset and Benchmarking of Real-Time Embedded Object Detection for RoboCup SSL0
Efficient Realistic Data Generation Framework leveraging Deep Learning-based Human DigitizationCode0
Rail-5k: a Real-World Dataset for Rail Surface Defects Detection0
Mitigating severe over-parameterization in deep convolutional neural networks through forced feature abstraction and compression with an entropy-based heuristic0
PatentNet: A Large-Scale Incomplete Multiview, Multimodal, Multilabel Industrial Goods Image Database0
CUDA-GHR: Controllable Unsupervised Domain Adaptation for Gaze and Head RedirectionCode0
Learning Graphs for Knowledge Transfer With Limited Labels0
A Survey on Semi-Supervised Learning for Delayed Partially Labelled Data Streams0
A Spiking Neural Network for Image Segmentation0
Effective Evaluation of Deep Active Learning on Image Classification Tasks0
Hotel Recognition via Latent Image Embedding0
Node Classification Meets Link Prediction on Knowledge Graphs0
On the Convergence of Differentially Private Federated Learning on Non-Lipschitz Objectives, and with Normalized Client Updates0
Cross-replication Reliability -- An Empirical Approach to Interpreting Inter-rater Reliability0
Show:102550
← PrevPage 184 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified