Hate Speech Detection

Hate speech detection is the task of detecting if communication such as text, audio, and so on contains hatred and or encourages violence towards a person or a group of people. This is usually based on prejudice against 'protected characteristics' such as their ethnicity, gender, sexual orientation, religion, age et al. Some example benchmarks are ETHOS and HateXplain. Models can be evaluated with metrics like the F-score or F-measure.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–400 of 507 papers

Title	Date	Tasks	Status
Hate Speech Detection in Roman Urdu	Aug 5, 2021	Hate Speech Detection	—Unverified
Improving Counterfactual Generation for Fair Hate Speech Detection	Aug 3, 2021	counterfactualFairness	—Unverified
You too Brutus! Trapping Hateful Users in Social Media: Challenges, Solutions & Insights	Aug 1, 2021	Hate Speech Detection	CodeCode Available
Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection	Aug 1, 2021	Cross-Lingual TransferHate Speech Detection	—Unverified
Annotating Online Misogyny	Aug 1, 2021	Abusive LanguageHate Speech Detection	CodeCode Available
Towards Argument Mining for Social Good: A Survey	Aug 1, 2021	Argument MiningFact Checking	—Unverified
Detecting Abusive Albanian	Jul 28, 2021	Hate Speech Detection	—Unverified
Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach	Jul 27, 2021	Cultural Vocal Bursts Intensity PredictionData Augmentation	—Unverified
Independent Ethical Assessment of Text Classification Models: A Hate Speech Detection Case Study	Jul 19, 2021	counterfactualHate Speech Detection	—Unverified
Hate speech detection using static BERT embeddings	Jun 29, 2021	Hate Speech DetectionSpecificity	—Unverified
Hate Speech Detection in Clubhouse	Jun 24, 2021	Hate Speech Detection	—Unverified
Statistical Analysis of Perspective Scores on Hate Speech Detection	Jun 22, 2021	Hate Speech Detection	—Unverified
AAA: Fair Evaluation for Abuse Detection Systems Wanted	Jun 21, 2021	Abuse DetectionAbusive Language	CodeCode Available
An Information Retrieval Approach to Building Datasets for Hate Speech Detection	Jun 17, 2021	Active LearningExplanation Generation	CodeCode Available
Multitask Learning for Emotionally Analyzing Sexual Abuse Disclosures	Jun 1, 2021	ClassificationEmotion Classification	CodeCode Available
Improving Automatic Hate Speech Detection with Multiword Expression Features	Jun 1, 2021	Hate Speech DetectionSentence	—Unverified
Understanding and Interpreting the Impact of User Context in Hate Speech Detection	Jun 1, 2021	Hate Speech Detection	—Unverified
Improving Cross-Domain Hate Speech Detection by Reducing the False Positive Rate	Jun 1, 2021	BlockingDeep Learning	—Unverified
Online Hate: Behavioural Dynamics and Relationship with Misinformation	May 28, 2021	Hate Speech DetectionMisinformation	—Unverified
Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection	May 25, 2021	Data AugmentationDecoder	—Unverified
A systematic review of Hate Speech automatic detection using Natural Language Processing	May 22, 2021	Deep LearningHate Speech Detection	—Unverified
A Legal Approach to Hate Speech – Operationalizing the EU’s Legal Framework against the Expression of Hatred as an NLP Task	May 16, 2021	Decision MakingHate Speech Detection	—Unverified
Role of Artificial Intelligence in Detection of Hateful Speech for Hinglish Data on Social Media	May 11, 2021	Hate Speech Detection	—Unverified
Towards A Multi-agent System for Online Hate Speech Detection	May 3, 2021	Hate Speech Detection	—Unverified
Cross-lingual hate speech detection based on multilingual domain-specific word embeddings	Apr 30, 2021	Hate Speech DetectionTransfer Learning	—Unverified
Contextual-Lexicon Approach for Abusive Language Detection	Apr 25, 2021	Abusive LanguageHate Speech Detection	—Unverified
Sexism detection: The first corpus in Algerian dialect with a code-switching in Arabic/ French and English	Apr 3, 2021	Hate Speech Detection	—Unverified
Cross-Lingual Transfer Learning for Hate Speech Detection	Apr 1, 2021	Cross-Lingual TransferHate Speech Detection	—Unverified
Zero-shot Cross-lingual Content Filtering: Offensive Language and Hate Speech Detection	Apr 1, 2021	Hate Speech Detection	—Unverified
Exploring Stylometric and Emotion-Based Features for Multilingual Cross-Domain Hate Speech Detection	Apr 1, 2021	Hate Speech Detection	—Unverified
HateBR: A Large Expert Annotated Corpus of Brazilian Instagram Comments for Offensive Language and Hate Speech Detection	Mar 27, 2021	BIG-bench Machine LearningBinary Classification	—Unverified
Leveraging Multi-domain, Heterogeneous Data using Deep Multitask Learning for Hate Speech Detection	Mar 23, 2021	Hate Speech DetectionMulti-Task Learning	CodeCode Available
AngryBERT: Joint Learning Target and Emotion for Hate Speech Detection	Mar 14, 2021	Hate Speech DetectionSentiment Analysis	—Unverified
DeepHate: Hate Speech Detection via Multi-Faceted Text Representations	Mar 14, 2021	Hate Speech DetectionWord Embeddings	—Unverified
Interpretable Multi-Modal Hate Speech Detection	Mar 2, 2021	Hate Speech Detection	—Unverified
From Universal Language Model to Downstream Task: Improving RoBERTa-Based Vietnamese Hate Speech Detection	Feb 24, 2021	Hate Speech DetectionLanguage Modeling	—Unverified
Towards generalisable hate speech detection: a review on obstacles and solutions	Feb 17, 2021	Hate Speech Detection	—Unverified
Emoji-Based Transfer Learning for Sentiment Tasks	Feb 12, 2021	Hate Speech DetectionSentiment Analysis	CodeCode Available
Leveraging cross-platform data to improve automated hate speech detection	Feb 9, 2021	Hate Speech Detection	—Unverified
A study of text representations in Hate Speech Detection	Feb 8, 2021	Abusive LanguageHate Speech Detection	CodeCode Available
HASOCOne@FIRE-HASOC2020: Using BERT and Multilingual BERT models for Hate Speech Detection	Jan 22, 2021	Hate Speech DetectionTransfer Learning	CodeCode Available
Hostility Detection in Hindi leveraging Pre-Trained Language Models	Jan 14, 2021	Fake News DetectionHate Speech Detection	CodeCode Available
Leveraging Multilingual Transformers for Hate Speech Detection	Jan 8, 2021	feature selectionGeneral Classification	CodeCode Available
DeepHateExplainer: Explainable Hate Speech Detection in Under-resourced Bengali Language	Dec 28, 2020	Hate Speech DetectionWord Embeddings	CodeCode Available
Hate Speech detection in the Bengali language: A dataset and its baseline evaluation	Dec 17, 2020	Hate Speech Detection	—Unverified
You Are What You Tweet: Profiling Users by Past Tweets to Improve Hate Speech Detection	Dec 16, 2020	Hate Speech Detection	—Unverified
TheNorth at SemEval-2020 Task 12: Hate Speech Detection Using RoBERTa	Dec 1, 2020	Hate Speech Detection	—Unverified
Ssn\_nlp at SemEval 2020 Task 12: Offense Target Identification in Social Media Using Traditional and Deep Machine Learning Approaches	Dec 1, 2020	Hate Speech DetectionLanguage Identification	—Unverified
DAPPER: Learning Domain-Adapted Persona Representation Using Pretrained BERT and External Memory	Dec 1, 2020	Hate Speech DetectionLanguage Modeling	—Unverified
HateGAN: Adversarial Generative-Based Data Augmentation for Hate Speech Detection	Dec 1, 2020	Data AugmentationHate Speech Detection	—Unverified

Show:10 25 50

← PrevPage 8 of 11Next →

All datasets Ethos Binary HateXplain Ethos MultiLabel Waseem et al., 2018 AbusEval Automatic Misogynistic Identification HateMM HatEval OffensEval 2019 ToLD-Br bajer_danish_misogyny DKhate

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	BiLSTM + static BE	F1-score	0.8	—	Unverified
2	BERT	F1-score	0.79	—	Unverified
3	BiLSTM+Attention+FT	F1-score	0.77	—	Unverified
4	OPT-175B (few-shot)	F1-score	0.76	—	Unverified
5	CNN+Attention+FT+GV	F1-score	0.74	—	Unverified
6	OPT-175B (one-shot)	F1-score	0.71	—	Unverified
7	OPT-175B (zero-shot)	F1-score	0.67	—	Unverified
8	SVM	F1-score	0.66	—	Unverified
9	Random Forests	F1-score	0.64	—	Unverified
10	Davinci (zero-shot)	F1-score	0.63	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BERT-MRP	AUROC	0.86	—	Unverified
2	BERT-RP	AUROC	0.85	—	Unverified
3	BERT-HateXplain [LIME]	AUROC	0.85	—	Unverified
4	BERT-HateXplain [Attn]	AUROC	0.85	—	Unverified
5	BERT [Attn]	AUROC	0.84	—	Unverified
6	BiRNN-HateXplain [Attn]	AUROC	0.81	—	Unverified
7	BiRNN-Attn [Attn]	AUROC	0.8	—	Unverified
8	CNN-GRU [LIME]	AUROC	0.79	—	Unverified
9	BiRNN [LIME]	AUROC	0.77	—	Unverified
10	XG-HSI-BERT	Accuracy	0.75	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLARAM	Hamming Loss	0.29	—	Unverified
2	MLkNN	Hamming Loss	0.16	—	Unverified
3	Binary Relevance	Hamming Loss	0.14	—	Unverified
4	Neural Classifier Chains	Hamming Loss	0.13	—	Unverified
5	Neural Binary Relevance	Hamming Loss	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Mozafari et al., 2019	AAA	50.94	—	Unverified
2	SVM	AAA	46.51	—	Unverified
3	Kennedy et al., 2020	AAA	45.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HateBERT	Macro F1	0.74	—	Unverified
2	BERT	Macro F1	0.72	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	mBert	Accuracy	0.83	—	Unverified
2	Logistic Regression	Accuracy	0.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HXP + CLAP + CLIP	TEST F1 (macro)	0.85	—	Unverified
2	BERT + ViT + MFCC	TEST F1 (macro)	0.79	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HateBERT	Macro F1	0.49	—	Unverified
2	BERT	Macro F1	0.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HateBERT	Macro F1	0.81	—	Unverified
2	BERT	Macro F1	0.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Multilingual BERT	F1-score	0.75	—	Unverified
2	AutoML	F1-score	0.74	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AOM mBERT	F1	0.85	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Baseline	F1	0.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RoBERTa-large-ST	Macro F1	80.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Baseline BERT (task A)	F1	0.77	—	Unverified