SOTAVerified

Benchmarking

Papers

Showing 41514200 of 5548 papers

TitleStatusHype
TEP-GNN: Accurate Execution Time Prediction of Functional Tests using Graph Neural Networks0
Towards Benchmarking Explainable Artificial Intelligence Methods0
Bugs in the Data: How ImageNet Misrepresents BiodiversityCode0
StEduCov: An Explored and Benchmarked Dataset on Stance Detection in Tweets towards Online Education during COVID-19 Pandemic0
MechProNet: Machine Learning Prediction of Mechanical Properties in Metal Additive Manufacturing0
SIM2E: Benchmarking the Group Equivariant Capability of Correspondence Matching Algorithms0
A biologically-inspired multi-modal evaluation of molecular generative machine learning0
Wildfire Forecasting with Satellite Images and Deep Generative Model0
The Low Emission Oil&Gas Open (LEOGO) Reference Platform of an Off-Grid Energy System for Renewable Integration Studies0
Unsupervised machine learning approach for building composite indicators with fuzzy metrics0
Sensitivity analysis and experimental evaluation of PID-like continuous sliding mode control0
Benchmarking Joint Face Spoofing and Forgery Detection with Visual and Physiological Cues0
Exact lattice-based stochastic cell culture simulation algorithms incorporating spontaneous and contact-dependent reactions0
fMRI-S4: learning short- and long-range dynamic fMRI dependencies using 1D Convolutions and State Space ModelsCode0
QSAM-Net: Rain streak removal by quaternion neural network with self-attention module0
SOMPT22: A Surveillance Oriented Multi-Pedestrian Tracking Dataset0
AstroVision: Towards Autonomous Feature Detection and Description for Missions to Small Bodies Using Deep LearningCode0
Benchmarking zero-shot and few-shot approaches for tokenization, tagging, and dependency parsing of Tagalog text0
Binary Classification with Positive Labeling Sources0
ferret: a Framework for Benchmarking Explainers on TransformersCode0
On the role of benchmarking data sets and simulations in method comparison studies0
Benchmarking Visual-Inertial Deep Multimodal Fusion for Relative Pose Regression and Odometry-aided Absolute Pose Regression0
A Case for Dataset Specific Profiling0
On the Evaluation of User Privacy in Deep Neural Networks using Timing Side Channel0
Vector-Based Data Improves Left-Right Eye-Tracking Classifier Performance After a Covariate Distributional ShiftCode0
PASTA: A Dataset for Modeling Participant States in Narratives0
Benchmarking Azerbaijani Neural Machine Translation0
Content-Aware Differential Privacy with Conditional Invertible Neural NetworksCode0
Towards Large-Scale Small Object Detection: Survey and Benchmarks0
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks0
3DOS: Towards 3D Open Set Learning -- Benchmarking and Understanding Semantic Novelty Detection on Point CloudsCode0
Rethinking the Reference-based Distinctive Image CaptioningCode0
PieTrack: An MOT solution based on synthetic data training and self-supervised domain adaptation0
Benchmarking tools for a priori identifiability analysisCode0
Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific ApplicationsCode0
Benchmarking Transformers-based models on French Spoken Language Understanding tasks0
The Multiple Subnetwork Hypothesis: Enabling Multidomain Learning by Isolating Task-Specific Subnetworks in Feedforward Neural NetworksCode0
Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence ClassificationCode0
GOAL: Towards Benchmarking Few-Shot Sports Game SummarizationCode0
Bias Mitigation for Machine Learning Classifiers: A Comprehensive Survey0
Immunofluorescence Capillary Imaging Segmentation: Cases StudyCode0
Automated Detection of Label Errors in Semantic Segmentation Datasets via Deep Learning and Uncertainty QuantificationCode0
Slot Filling for Extracting Reskilling and Upskilling Options from the WebCode0
A novel evaluation methodology for supervised Feature Ranking algorithmsCode0
Ensemble random forest filter: An alternative to the ensemble Kalman filter for inverse modeling0
OVQA: A Clinically Generated Visual Question Answering Dataset0
Benefits and Challenges of Dynamic Modelling of Cascading Failures in Power Systems0
Identifying the Context Shift between Test Benchmarks and Production Data0
Towards Toxic Positivity Detection0
DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles0
Show:102550
← PrevPage 84 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified