SOTAVerified

Hate Speech Detection

Hate speech detection is the task of detecting if communication such as text, audio, and so on contains hatred and or encourages violence towards a person or a group of people. This is usually based on prejudice against 'protected characteristics' such as their ethnicity, gender, sexual orientation, religion, age et al. Some example benchmarks are ETHOS and HateXplain. Models can be evaluated with metrics like the F-score or F-measure.

Papers

Showing 301325 of 507 papers

TitleStatusHype
Analyzing the Real Vulnerability of Hate Speech Detection Systems against Targeted Intentional Noise0
Know-Center at SemEval-2019 Task 5: Multilingual Hate Speech Detection on Twitter using CNNs0
L3Cube-MahaHate: A Tweet-based Marathi Hate Speech Detection Dataset and BERT models0
L3Cube-MahaNLP: Marathi Natural Language Processing Datasets, Models, and Library0
ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection0
LAHM : Large Annotated Dataset for Multi-Domain and Multilingual Hate Speech Identification0
LAMPO: Large Language Models as Preference Machines for Few-shot Ordinal Classification0
Large-Scale Hate Speech Detection with Cross-Domain Transfer0
Analysis and Detection of Multilingual Hate Speech Using Transformer Based Deep Learning0
A multilingual dataset for offensive language and hate speech detection for hausa, yoruba and igbo languages0
Towards A Multi-agent System for Online Hate Speech Detection0
Learning Domain Terms - Empirical Methods to Enhance Enterprise Text Analytics Performance0
American Hate Crime Trends Prediction with Event Extraction0
Leveraging Affective Bidirectional Transformers for Offensive Language Detection0
Leveraging Annotator Disagreement for Text Classification0
Leveraging cross-platform data to improve automated hate speech detection0
Leveraging Intra-User and Inter-User Representation Learning for Automated Hate Speech Detection0
Wisdom of Instruction-Tuned Language Model Crowds. Exploring Model Label Variation0
Leveraging Language Identification to Enhance Code-Mixed Text Classification0
Towards Argument Mining for Social Good: A Survey0
Watching the Watchers: A Comparative Fairness Audit of Cloud-based Content Moderation Services0
Leveraging Transformers for Hate Speech Detection in Conversational Code-Mixed Tweets0
Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models0
Leveraging World Knowledge in Implicit Hate Speech Detection0
Towards Code-switched Classification Exploiting Constituent Language Resources0
Show:102550
← PrevPage 13 of 21Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BiLSTM + static BEF1-score0.8Unverified
2BERTF1-score0.79Unverified
3BiLSTM+Attention+FTF1-score0.77Unverified
4OPT-175B (few-shot)F1-score0.76Unverified
5CNN+Attention+FT+GVF1-score0.74Unverified
6OPT-175B (one-shot)F1-score0.71Unverified
7OPT-175B (zero-shot)F1-score0.67Unverified
8SVMF1-score0.66Unverified
9Random ForestsF1-score0.64Unverified
10Davinci (zero-shot)F1-score0.63Unverified
#ModelMetricClaimedVerifiedStatus
1BERT-MRPAUROC0.86Unverified
2BERT-RPAUROC0.85Unverified
3BERT-HateXplain [Attn]AUROC0.85Unverified
4BERT-HateXplain [LIME]AUROC0.85Unverified
5BERT [Attn]AUROC0.84Unverified
6BiRNN-HateXplain [Attn]AUROC0.81Unverified
7BiRNN-Attn [Attn]AUROC0.8Unverified
8CNN-GRU [LIME]AUROC0.79Unverified
9BiRNN [LIME]AUROC0.77Unverified
10XG-HSI-BERTAccuracy0.75Unverified
#ModelMetricClaimedVerifiedStatus
1MLARAMHamming Loss0.29Unverified
2MLkNNHamming Loss0.16Unverified
3Binary RelevanceHamming Loss0.14Unverified
4Neural Classifier ChainsHamming Loss0.13Unverified
5Neural Binary RelevanceHamming Loss0.11Unverified
#ModelMetricClaimedVerifiedStatus
1Mozafari et al., 2019AAA50.94Unverified
2SVMAAA46.51Unverified
3Kennedy et al., 2020AAA45.5Unverified
#ModelMetricClaimedVerifiedStatus
1HateBERTMacro F10.74Unverified
2BERTMacro F10.72Unverified
#ModelMetricClaimedVerifiedStatus
1mBertAccuracy0.83Unverified
2Logistic RegressionAccuracy0.7Unverified
#ModelMetricClaimedVerifiedStatus
1HXP + CLAP + CLIPTEST F1 (macro)0.85Unverified
2BERT + ViT + MFCCTEST F1 (macro)0.79Unverified
#ModelMetricClaimedVerifiedStatus
1HateBERTMacro F10.49Unverified
2BERTMacro F10.48Unverified
#ModelMetricClaimedVerifiedStatus
1HateBERTMacro F10.81Unverified
2BERTMacro F10.8Unverified
#ModelMetricClaimedVerifiedStatus
1Multilingual BERTF1-score0.75Unverified
2AutoMLF1-score0.74Unverified
#ModelMetricClaimedVerifiedStatus
1AOM mBERTF10.85Unverified
#ModelMetricClaimedVerifiedStatus
1BaselineF10.7Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa-large-STMacro F180.7Unverified
#ModelMetricClaimedVerifiedStatus
1Baseline BERT (task A)F10.77Unverified