SOTAVerified

Binary text classification

Papers

Showing 120 of 20 papers

TitleStatusHype
MAGE: Machine-generated Text Detection in the WildCode2
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?Code2
TweepFake: about Detecting Deepfake TweetsCode1
Ghostbuster: Detecting Text Ghostwritten by Large Language ModelsCode1
TURINGBENCH: A Benchmark Environment for Turing Test in the Age of Neural Text GenerationCode1
Active Learning for BERT: An Empirical StudyCode1
GigaCheck: Detecting LLM-generated Content0
Analyzing the Generalizability of Deep Contextualized Language Representations For Text Classification0
Neural Legal Judgment Prediction in English0
Calibrated Large Language Models for Binary Question Answering0
CIA_NITT at WNUT-2020 Task 2: Classification of COVID-19 Tweets Using Pre-trained Language Models0
Deception Detection for the Russian Language: Lexical and Syntactic Parameters0
Argumentative Explanations for Pattern-Based Text Classifiers0
Reliable Decision Support with LLMs: A Framework for Evaluating Consistency in Binary Text Classification Applications0
DACCORD : un jeu de données pour la Détection Automatique d'énonCés COntRaDictoires en françaisCode0
Evaluating shallow and deep learning strategies for the 2018 n2c2 shared task on clinical text classificationCode0
Expectation Backpropagation: Parameter-Free Training of Multilayer Neural Networks with Continuous or Discrete WeightsCode0
Hoaxpedia: A Unified Wikipedia Hoax Articles DatasetCode0
Identification of the Relevance of Comments in Codes Using Bag of Words and Transformer Based ModelsCode0
Learning Representations for Soft Skill MatchingCode0
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score1Unverified
2GhostbusterF1 score0.99Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)Average Recall0.96Unverified
2LongformerAverage Recall0.91Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score0.99Unverified
2RadarF1 score0.88Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score1Unverified
2RoBERTaF1 score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score0.97Unverified
2RoBERTaF1 score0.52Unverified
#ModelMetricClaimedVerifiedStatus
1GigaCheck (Mistral-7B)F1 score0.94Unverified
2XLNetF1 score0.88Unverified
#ModelMetricClaimedVerifiedStatus
1HIER-BERTMacro F182Unverified