Hate Speech Detection

Hate speech detection is the task of detecting if communication such as text, audio, and so on contains hatred and or encourages violence towards a person or a group of people. This is usually based on prejudice against 'protected characteristics' such as their ethnicity, gender, sexual orientation, religion, age et al. Some example benchmarks are ETHOS and HateXplain. Models can be evaluated with metrics like the F-score or F-measure.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–400 of 507 papers

Title	Date	Tasks	Status	Hype
HONEST: Measuring Hurtful Sentence Completion in Language Models	Jun 1, 2021	Hate Speech DetectionHurtful Sentence Completion	CodeCode Available	1
Improving Automatic Hate Speech Detection with Multiword Expression Features	Jun 1, 2021	Hate Speech DetectionSentence	—Unverified	0
Online Hate: Behavioural Dynamics and Relationship with Misinformation	May 28, 2021	Hate Speech DetectionMisinformation	—Unverified	0
Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection	May 25, 2021	Data AugmentationDecoder	—Unverified	0
A systematic review of Hate Speech automatic detection using Natural Language Processing	May 22, 2021	Deep LearningHate Speech Detection	—Unverified	0
A Legal Approach to Hate Speech – Operationalizing the EU’s Legal Framework against the Expression of Hatred as an NLP Task	May 16, 2021	Decision MakingHate Speech Detection	—Unverified	0
Role of Artificial Intelligence in Detection of Hateful Speech for Hinglish Data on Social Media	May 11, 2021	Hate Speech Detection	—Unverified	0
AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News and Hate Speech Detection Dataset	May 7, 2021	ArticlesDialect Identification	CodeCode Available	1
Towards A Multi-agent System for Online Hate Speech Detection	May 3, 2021	Hate Speech Detection	—Unverified	0
Cross-lingual hate speech detection based on multilingual domain-specific word embeddings	Apr 30, 2021	Hate Speech DetectionTransfer Learning	—Unverified	0
Contextual-Lexicon Approach for Abusive Language Detection	Apr 25, 2021	Abusive LanguageHate Speech Detection	—Unverified	0
Sexism detection: The first corpus in Algerian dialect with a code-switching in Arabic/ French and English	Apr 3, 2021	Hate Speech Detection	—Unverified	0
Exploring Stylometric and Emotion-Based Features for Multilingual Cross-Domain Hate Speech Detection	Apr 1, 2021	Hate Speech Detection	—Unverified	0
Zero-shot Cross-lingual Content Filtering: Offensive Language and Hate Speech Detection	Apr 1, 2021	Hate Speech Detection	—Unverified	0
Cross-Lingual Transfer Learning for Hate Speech Detection	Apr 1, 2021	Cross-Lingual TransferHate Speech Detection	—Unverified	0
HateBR: A Large Expert Annotated Corpus of Brazilian Instagram Comments for Offensive Language and Hate Speech Detection	Mar 27, 2021	BIG-bench Machine LearningBinary Classification	—Unverified	0
Detecting Hate Speech with GPT-3	Mar 23, 2021	Few-Shot LearningHate Speech Detection	CodeCode Available	1
Leveraging Multi-domain, Heterogeneous Data using Deep Multitask Learning for Hate Speech Detection	Mar 23, 2021	Hate Speech DetectionMulti-Task Learning	CodeCode Available	0
A Large-scale Dataset for Hate Speech Detection on Vietnamese Social Media Texts	Mar 22, 2021	Hate Speech DetectionVietnamese Hate Speech Detection	CodeCode Available	1
DeepHate: Hate Speech Detection via Multi-Faceted Text Representations	Mar 14, 2021	Hate Speech DetectionWord Embeddings	—Unverified	0
AngryBERT: Joint Learning Target and Emotion for Hate Speech Detection	Mar 14, 2021	Hate Speech DetectionSentiment Analysis	—Unverified	0
Interpretable Multi-Modal Hate Speech Detection	Mar 2, 2021	Hate Speech Detection	—Unverified	0
From Universal Language Model to Downstream Task: Improving RoBERTa-Based Vietnamese Hate Speech Detection	Feb 24, 2021	Hate Speech DetectionLanguage Modeling	—Unverified	0
Towards generalisable hate speech detection: a review on obstacles and solutions	Feb 17, 2021	Hate Speech Detection	—Unverified	0
Emoji-Based Transfer Learning for Sentiment Tasks	Feb 12, 2021	Hate Speech DetectionSentiment Analysis	CodeCode Available	0
Leveraging cross-platform data to improve automated hate speech detection	Feb 9, 2021	Hate Speech Detection	—Unverified	0
A study of text representations in Hate Speech Detection	Feb 8, 2021	Abusive LanguageHate Speech Detection	CodeCode Available	0
HASOCOne@FIRE-HASOC2020: Using BERT and Multilingual BERT models for Hate Speech Detection	Jan 22, 2021	Hate Speech DetectionTransfer Learning	CodeCode Available	0
Hostility Detection in Hindi leveraging Pre-Trained Language Models	Jan 14, 2021	Fake News DetectionHate Speech Detection	CodeCode Available	0
Leveraging Multilingual Transformers for Hate Speech Detection	Jan 8, 2021	feature selectionGeneral Classification	CodeCode Available	0
Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection	Dec 31, 2020	Hate Speech Detection	CodeCode Available	1
HateCheck: Functional Tests for Hate Speech Detection Models	Dec 31, 2020	DiagnosticHate Speech Detection	CodeCode Available	1
Detecting Hate Speech in Multi-modal Memes	Dec 29, 2020	Binary ClassificationHate Speech Detection	CodeCode Available	1
DeepHateExplainer: Explainable Hate Speech Detection in Under-resourced Bengali Language	Dec 28, 2020	Hate Speech DetectionWord Embeddings	CodeCode Available	0
HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection	Dec 18, 2020	Hate Speech DetectionText Classification	CodeCode Available	1
Hate Speech detection in the Bengali language: A dataset and its baseline evaluation	Dec 17, 2020	Hate Speech Detection	—Unverified	0
You Are What You Tweet: Profiling Users by Past Tweets to Improve Hate Speech Detection	Dec 16, 2020	Hate Speech Detection	—Unverified	0
Hate Speech Detection in Saudi Twittersphere: A Deep Learning Approach	Dec 1, 2020	Deep LearningHate Speech Detection	—Unverified	0
HateGAN: Adversarial Generative-Based Data Augmentation for Hate Speech Detection	Dec 1, 2020	Data AugmentationHate Speech Detection	—Unverified	0
Learning Domain Terms - Empirical Methods to Enhance Enterprise Text Analytics Performance	Dec 1, 2020	Hate Speech Detection	—Unverified	0
DAPPER: Learning Domain-Adapted Persona Representation Using Pretrained BERT and External Memory	Dec 1, 2020	Hate Speech DetectionLanguage Modeling	—Unverified	0
Ssn\_nlp at SemEval 2020 Task 12: Offense Target Identification in Social Media Using Traditional and Deep Machine Learning Approaches	Dec 1, 2020	Hate Speech DetectionLanguage Identification	—Unverified	0
TheNorth at SemEval-2020 Task 12: Hate Speech Detection Using RoBERTa	Dec 1, 2020	Hate Speech Detection	—Unverified	0
Effect of Word Embedding Models on Hate and Offensive Speech Detection	Nov 23, 2020	Hate Speech Detection	—Unverified	0
An Online Multilingual Hate speech Recognition System	Nov 23, 2020	Hate Speech Detectionspeech-recognition	CodeCode Available	0
DeL-haTE: A Deep Learning Tunable Ensemble for Hate Speech Detection	Nov 3, 2020	Hate Speech DetectionTransfer Learning	CodeCode Available	0
Towards Code-switched Classification Exploiting Constituent Language Resources	Nov 3, 2020	ClassificationGeneral Classification	—Unverified	0
Impact of Politically Biased Data on Hate Speech Classification	Nov 1, 2020	ClassificationHate Speech Detection	CodeCode Available	0
In Data We Trust: A Critical Analysis of Hate Speech Detection Datasets	Nov 1, 2020	Hate Speech Detection	—Unverified	0
Investigating Annotator Bias with a Graph-Based Approach	Nov 1, 2020	BIG-bench Machine LearningCommunity Detection	CodeCode Available	0

Show:10 25 50

← PrevPage 8 of 11Next →

All datasets Ethos Binary HateXplain Ethos MultiLabel Waseem et al., 2018 AbusEval Automatic Misogynistic Identification HateMM HatEval OffensEval 2019 ToLD-Br bajer_danish_misogyny DKhate

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	BiLSTM + static BE	F1-score	0.8	—	Unverified
2	BERT	F1-score	0.79	—	Unverified
3	BiLSTM+Attention+FT	F1-score	0.77	—	Unverified
4	OPT-175B (few-shot)	F1-score	0.76	—	Unverified
5	CNN+Attention+FT+GV	F1-score	0.74	—	Unverified
6	OPT-175B (one-shot)	F1-score	0.71	—	Unverified
7	OPT-175B (zero-shot)	F1-score	0.67	—	Unverified
8	SVM	F1-score	0.66	—	Unverified
9	Random Forests	F1-score	0.64	—	Unverified
10	Davinci (zero-shot)	F1-score	0.63	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BERT-MRP	AUROC	0.86	—	Unverified
2	BERT-RP	AUROC	0.85	—	Unverified
3	BERT-HateXplain [LIME]	AUROC	0.85	—	Unverified
4	BERT-HateXplain [Attn]	AUROC	0.85	—	Unverified
5	BERT [Attn]	AUROC	0.84	—	Unverified
6	BiRNN-HateXplain [Attn]	AUROC	0.81	—	Unverified
7	BiRNN-Attn [Attn]	AUROC	0.8	—	Unverified
8	CNN-GRU [LIME]	AUROC	0.79	—	Unverified
9	BiRNN [LIME]	AUROC	0.77	—	Unverified
10	XG-HSI-BERT	Accuracy	0.75	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLARAM	Hamming Loss	0.29	—	Unverified
2	MLkNN	Hamming Loss	0.16	—	Unverified
3	Binary Relevance	Hamming Loss	0.14	—	Unverified
4	Neural Classifier Chains	Hamming Loss	0.13	—	Unverified
5	Neural Binary Relevance	Hamming Loss	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Mozafari et al., 2019	AAA	50.94	—	Unverified
2	SVM	AAA	46.51	—	Unverified
3	Kennedy et al., 2020	AAA	45.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HateBERT	Macro F1	0.74	—	Unverified
2	BERT	Macro F1	0.72	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	mBert	Accuracy	0.83	—	Unverified
2	Logistic Regression	Accuracy	0.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HXP + CLAP + CLIP	TEST F1 (macro)	0.85	—	Unverified
2	BERT + ViT + MFCC	TEST F1 (macro)	0.79	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HateBERT	Macro F1	0.49	—	Unverified
2	BERT	Macro F1	0.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HateBERT	Macro F1	0.81	—	Unverified
2	BERT	Macro F1	0.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Multilingual BERT	F1-score	0.75	—	Unverified
2	AutoML	F1-score	0.74	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AOM mBERT	F1	0.85	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Baseline	F1	0.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RoBERTa-large-ST	Macro F1	80.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Baseline BERT (task A)	F1	0.77	—	Unverified