SOTAVerified

Benchmarking

Papers

Showing 45514600 of 5548 papers

TitleStatusHype
Towards Large Scale Automated Algorithm Design by Integrating Modular Benchmarking FrameworksCode1
Nonstochastic Bandits with Infinitely Many Experts0
Benchmarking Deep Graph Generative Models for Optimizing New Drug Molecules for COVID-19Code1
Performance Evaluation of Transcriptomics Data Normalization for Survival Risk Prediction0
Benchmarking of eight recurrent neural network variants for breath phase and adventitious sound detection on a self-developed open-access lung sound database-HF_Lung_V10
Learning Conjoint Attentions for Graph Neural NetsCode0
Airport Capacity and Performance in Europe -- A study of transport economics, service quality and sustainability0
One Label, One Billion Faces: Usage and Consistency of Racial Categories in Computer Vision0
Benchmarking Quantized Neural Networks on FPGAs with FINNCode1
VIPPrint: A Large Scale Dataset of Printed and Scanned Images for Synthetic Face Images Detection and Source Linking0
Evaluating Large-Vocabulary Object Detectors: The Devil is in the DetailsCode2
Benchmarking of Deep Learning Irradiance Forecasting Models from Sky Images -- an in-depth Analysis0
DRIV100: In-The-Wild Multi-Domain Dataset and Evaluation for Real-World Domain Adaptation of Semantic Segmentation0
Benchmarking real-time monitoring strategies for ethanol production from lignocellulosic biomass0
BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language GenerationCode0
Benchmarking Invertible Architectures on Inverse Problems0
How Good is a Video Summary? A New Benchmarking Dataset and Evaluation Framework Towards Realistic Video Summarization0
Generating a Doppelganger Graph: Resembling but DistinctCode1
A Closer Look at Temporal Sentence Grounding in Videos: Dataset and MetricCode0
Protocol for Executing and Benchmarking Eight Computational Doublet-Detection Methods in Single-Cell RNA Sequencing Data Analysis0
Noisy intermediate-scale quantum (NISQ) algorithms0
Arabic Speech Recognition by End-to-End, Modular Systems and HumanCode0
Benchmarking Perturbation-based Saliency Maps for Explaining Atari AgentsCode0
Label-Efficient Point Cloud Semantic Segmentation: An Active Learning Approach0
Latent Variable Models for Visual Question Answering0
COSMOS: Catching Out-of-Context Misinformation with Self-Supervised LearningCode1
Grid Search Hyperparameter Benchmarking of BERT, ALBERT, and LongFormer on DuoRC0
Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT ScansCode1
Benchmarking Simulation-Based InferenceCode1
Quantum Cognitively Motivated Decision Fusion for Video Sentiment Analysis0
PyHealth: A Python Library for Health Predictive ModelsCode2
BERT-GT: Cross-sentence n-ary relation extraction with BERT and Graph Transformer0
Investigating the Vision Transformer Model for Image Retrieval Tasks0
RISEdb: a Novel Indoor Localization Dataset0
Machine learning classification of non-Markovian noise disturbing quantum dynamicsCode0
Benchmarking Machine Learning: How Fast Can Your Algorithms Go?0
Shallow-UWnet : Compressed Model for Underwater Image EnhancementCode1
Benchmarking Knowledge-Enhanced Commonsense Question Answering via Knowledge-to-Text Transformation0
KArSL: Arabic Sign Language DatabaseCode0
Robust 2D/3D Vehicle Parsing in Arbitrary Camera Views for CVISCode0
Benchmarking Ultra-High-Definition Image Super-Resolution0
Long Range Arena : A Benchmark for Efficient Transformers0
Don't stack layers in graph neural networks, wire them randomly0
Descending through a Crowded Valley — Benchmarking Deep Learning OptimizersCode1
Accurate and fast detection of copy number variations from short-read whole-genome sequencing with deep convolutional neural network0
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms0
A Large-scale Study on Training Sample Memorization in Generative Modeling0
SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-powered Intelligent PhlatCamCode0
Anomaly detection in dynamical systems from measured time series0
Maximum Categorical Cross Entropy (MCCE): A noise-robust alternative loss function to mitigate racial bias in Convolutional Neural Networks (CNNs) by reducing overfitting0
Show:102550
← PrevPage 92 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified