SOTAVerified

Binary text classification

Papers

Showing 110 of 20 papers

TitleStatusHype
MAGE: Machine-generated Text Detection in the WildCode2
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?Code2
TweepFake: about Detecting Deepfake TweetsCode1
Ghostbuster: Detecting Text Ghostwritten by Large Language ModelsCode1
TURINGBENCH: A Benchmark Environment for Turing Test in the Age of Neural Text GenerationCode1
Active Learning for BERT: An Empirical StudyCode1
GigaCheck: Detecting LLM-generated Content0
Analyzing the Generalizability of Deep Contextualized Language Representations For Text Classification0
Neural Legal Judgment Prediction in English0
Calibrated Large Language Models for Binary Question Answering0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score1Unverified
2GhostbusterF1 score0.99Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)Average Recall0.96Unverified
2LongformerAverage Recall0.91Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score0.99Unverified
2RadarF1 score0.88Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score1Unverified
2RoBERTaF1 score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score0.97Unverified
2RoBERTaF1 score0.52Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score0.94Unverified
2XLNetF1 score0.88Unverified
#ModelMetricClaimedVerifiedStatus
1HIER-BERTMacro F182Unverified