SOTAVerified

Binary text classification

Papers

Showing 120 of 20 papers

TitleStatusHype
MAGE: Machine-generated Text Detection in the WildCode2
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?Code2
TweepFake: about Detecting Deepfake TweetsCode1
Ghostbuster: Detecting Text Ghostwritten by Large Language ModelsCode1
TURINGBENCH: A Benchmark Environment for Turing Test in the Age of Neural Text GenerationCode1
Active Learning for BERT: An Empirical StudyCode1
DACCORD : un jeu de données pour la Détection Automatique d'énonCés COntRaDictoires en françaisCode0
Evaluating shallow and deep learning strategies for the 2018 n2c2 shared task on clinical text classificationCode0
Expectation Backpropagation: Parameter-Free Training of Multilayer Neural Networks with Continuous or Discrete WeightsCode0
Hoaxpedia: A Unified Wikipedia Hoax Articles DatasetCode0
Identification of the Relevance of Comments in Codes Using Bag of Words and Transformer Based ModelsCode0
Learning Representations for Soft Skill MatchingCode0
Calibrated Large Language Models for Binary Question Answering0
CIA_NITT at WNUT-2020 Task 2: Classification of COVID-19 Tweets Using Pre-trained Language Models0
Deception Detection for the Russian Language: Lexical and Syntactic Parameters0
Argumentative Explanations for Pattern-Based Text Classifiers0
Analyzing the Generalizability of Deep Contextualized Language Representations For Text Classification0
Neural Legal Judgment Prediction in English0
Reliable Decision Support with LLMs: A Framework for Evaluating Consistency in Binary Text Classification Applications0
GigaCheck: Detecting LLM-generated Content0
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score1Unverified
2GhostbusterF1 score0.99Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)Average Recall0.96Unverified
2LongformerAverage Recall0.91Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score0.99Unverified
2RadarF1 score0.88Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score1Unverified
2RoBERTaF1 score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score0.97Unverified
2RoBERTaF1 score0.52Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score0.94Unverified
2XLNetF1 score0.88Unverified
#ModelMetricClaimedVerifiedStatus
1HIER-BERTMacro F182Unverified