SOTAVerified

Benchmarking

Papers

Showing 41914200 of 5548 papers

TitleStatusHype
Immunofluorescence Capillary Imaging Segmentation: Cases StudyCode0
Automated Detection of Label Errors in Semantic Segmentation Datasets via Deep Learning and Uncertainty QuantificationCode0
Slot Filling for Extracting Reskilling and Upskilling Options from the WebCode0
A novel evaluation methodology for supervised Feature Ranking algorithmsCode0
Ensemble random forest filter: An alternative to the ensemble Kalman filter for inverse modeling0
OVQA: A Clinically Generated Visual Question Answering Dataset0
Benefits and Challenges of Dynamic Modelling of Cascading Failures in Power Systems0
Identifying the Context Shift between Test Benchmarks and Production Data0
Towards Toxic Positivity Detection0
DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles0
Show:102550
← PrevPage 420 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified