SOTAVerified

Benchmarking

Papers

Showing 41764200 of 5548 papers

TitleStatusHype
PASTA: A Dataset for Modeling Participant States in Narratives0
Benchmarking Azerbaijani Neural Machine Translation0
Content-Aware Differential Privacy with Conditional Invertible Neural NetworksCode0
Towards Large-Scale Small Object Detection: Survey and Benchmarks0
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks0
3DOS: Towards 3D Open Set Learning -- Benchmarking and Understanding Semantic Novelty Detection on Point CloudsCode0
Rethinking the Reference-based Distinctive Image CaptioningCode0
PieTrack: An MOT solution based on synthetic data training and self-supervised domain adaptation0
Benchmarking tools for a priori identifiability analysisCode0
Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific ApplicationsCode0
Benchmarking Transformers-based models on French Spoken Language Understanding tasks0
The Multiple Subnetwork Hypothesis: Enabling Multidomain Learning by Isolating Task-Specific Subnetworks in Feedforward Neural NetworksCode0
Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence ClassificationCode0
GOAL: Towards Benchmarking Few-Shot Sports Game SummarizationCode0
Bias Mitigation for Machine Learning Classifiers: A Comprehensive Survey0
Immunofluorescence Capillary Imaging Segmentation: Cases StudyCode0
Automated Detection of Label Errors in Semantic Segmentation Datasets via Deep Learning and Uncertainty QuantificationCode0
Slot Filling for Extracting Reskilling and Upskilling Options from the WebCode0
A novel evaluation methodology for supervised Feature Ranking algorithmsCode0
Ensemble random forest filter: An alternative to the ensemble Kalman filter for inverse modeling0
OVQA: A Clinically Generated Visual Question Answering Dataset0
Benefits and Challenges of Dynamic Modelling of Cascading Failures in Power Systems0
Identifying the Context Shift between Test Benchmarks and Production Data0
Towards Toxic Positivity Detection0
DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles0
Show:102550
← PrevPage 168 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified