SOTAVerified

Binary Classification

Papers

Showing 21512160 of 2574 papers

TitleStatusHype
JARVix at SemEval-2022 Task 2: It Takes One to Know One? Idiomaticity Detection using Zero and One-Shot LearningCode0
SoftPQ: Robust Instance Segmentation Evaluation via Soft Matching and Tunable ThresholdsCode0
ECG-Based Electrolyte Prediction: Evaluating Regression and Probabilistic MethodsCode0
Towards adversarial robustness with 01 loss neural networksCode0
Okapi: Generalising Better by Making Statistical Matches MatchCode0
THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language ModelsCode0
"John is 50 years old, can his son be 65?" Evaluating NLP Models' Understanding of FeasibilityCode0
Solution Path Algorithm for Twin Multi-class Support Vector MachineCode0
Effect of the output activation function on the probabilities and errors in medical image segmentationCode0
Deep Generative Learning via Variational Gradient FlowCode0
Show:102550
← PrevPage 216 of 258Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Trompt + OpenAI embeddingAUROC0.98Unverified
2LightGBM + OpenAI embeddingAUROC0.97Unverified
3FTTransformer + RoBERTa fintuneAUROC0.96Unverified
4LightGBM + RoBERTa embeddingAUROC0.95Unverified
5FTTransformer + RoBERTa embeddingAUROC0.94Unverified
6ResNet + RoBERTa embeddingAUROC0.93Unverified
7ResNet + OpenAI embeddingAUROC0.92Unverified
8FTTransformer + OpenAI embeddingAUROC0.91Unverified
#ModelMetricClaimedVerifiedStatus
1Trompt + OpenAI embeddingAUROC0.81Unverified
2Multimodal-Net All-TextAUROC0.8Unverified
3ResNet + RoBERTa finetuneAUROC0.79Unverified
4LightGBM + RoBERTa embeddingAUROC0.77Unverified
#ModelMetricClaimedVerifiedStatus
1Attention based CNNF1 score0.93Unverified
#ModelMetricClaimedVerifiedStatus
1XGBoostF1-Score98.79Unverified