SOTAVerified

Benchmarking

Papers

Showing 45514600 of 5548 papers

TitleStatusHype
Deep Reinforcement Learning for Continuous Docking Control of Autonomous Underwater Vehicles: A Benchmarking Study0
Intelligent Railway Foreign Object Detection: A Semi-supervised Convolutional Autoencoder Based Method0
Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney0
Comparative Analysis of Packages and Algorithms for the Analysis of Spatially Resolved Transcriptomics Data0
The Effect of Domain and Diacritics in Yoruba–English Neural Machine Translation0
Improving Model Generalization: A Chinese Named Entity Recognition Case Study0
What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts0
Benchmarking Scalable Methods for Streaming Cross Document Entity CoreferenceCode0
Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes0
Multilingual Protest News Detection - Shared Task 1, CASE 20210
Benchmarking Neural Topic Models: An Empirical Study0
Cross-replication Reliability - An Empirical Approach to Interpreting Inter-rater Reliability0
SignalGP-Lite: Event Driven Genetic Programming Library for Large-Scale Artificial Life ApplicationsCode0
Reradiation and Scattering from a Reconfigurable Intelligent Surface: A General Macroscopic Model0
AA3DNet: Attention Augmented Real Time 3D Object Detection0
Benchmarking AutoML Frameworks for Disease Prediction Using Medical Claims0
3D fluorescence microscopy data synthesis for segmentation and benchmarkingCode0
An Exploration of Exploration: Measuring the ability of lexicase selection to find obscure pathways to optimalityCode0
PhD Thesis on Code Modulated Interferometric Imaging System using Phased Arrays0
Attribution of Predictive Uncertainties in Classification ModelsCode0
Learned Sorted Table Search and Static Indexes in Small Model SpaceCode0
Better force fields start with better data -- A data set of cation dipeptide interactionsCode0
ECG-Adv-GAN: Detecting ECG Adversarial Examples with Conditional Generative Adversarial Networks0
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi0
The Benchmark Lottery0
Inverse Contextual Bandits: Learning How Behavior Evolves over TimeCode0
R3L: Connecting Deep Reinforcement Learning to Recurrent Neural Networks for Image Denoising via Residual Recovery0
A Framework and Benchmarking Study for Counterfactual Generating Methods on Tabular Data0
Intrinsic uncertainties and where to find them0
Connectivity Matters: Neural Network Pruning Through the Lens of Effective SparsityCode0
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents0
Modelling Neuronal Behaviour with Time Series Regression: Recurrent Neural Networks on C. Elegans Data0
Benchmarking ASR Systems Based on Post-Editing Effort and Error Analysis0
CityNet: A Comprehensive Multi-Modal Urban Dataset for Advanced Research in Urban ComputingCode0
Exploring Context Generalizability in Citywide Crowd Mobility Prediction: An Analytic Framework and BenchmarkCode0
On the Interaction of Belief Bias and Explanations0
Dataset and Benchmarking of Real-Time Embedded Object Detection for RoboCup SSL0
Efficient Realistic Data Generation Framework leveraging Deep Learning-based Human DigitizationCode0
Rail-5k: a Real-World Dataset for Rail Surface Defects Detection0
Mitigating severe over-parameterization in deep convolutional neural networks through forced feature abstraction and compression with an entropy-based heuristic0
PatentNet: A Large-Scale Incomplete Multiview, Multimodal, Multilabel Industrial Goods Image Database0
CUDA-GHR: Controllable Unsupervised Domain Adaptation for Gaze and Head RedirectionCode0
Learning Graphs for Knowledge Transfer With Limited Labels0
A Survey on Semi-Supervised Learning for Delayed Partially Labelled Data Streams0
A Spiking Neural Network for Image Segmentation0
Effective Evaluation of Deep Active Learning on Image Classification Tasks0
Hotel Recognition via Latent Image Embedding0
Node Classification Meets Link Prediction on Knowledge Graphs0
On the Convergence of Differentially Private Federated Learning on Non-Lipschitz Objectives, and with Normalized Client Updates0
Cross-replication Reliability -- An Empirical Approach to Interpreting Inter-rater Reliability0
Show:102550
← PrevPage 92 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified