OPT: Open Pre-trained Transformer Language Models May 2, 2022 Decoder Hate Speech Detection
Code Code Available 55 Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA Feb 9, 2024 Event Detection Hate Speech Detection
Code Code Available 45 ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection Mar 17, 2022 Hate Speech Detection Language Modelling
Code Code Available 25 Personalized Large Language Models Feb 14, 2024 Emotion Recognition Hate Speech Detection
Code Code Available 25 Public Wisdom Matters! Discourse-Aware Hyperbolic Fourier Co-Attention for Social-Text Classification Sep 15, 2022 Abstract Meaning Representation Fake News Detection
Code Code Available 15 TurkishBERTweet: Fast and Reliable Large Language Model for Social Media Analysis Nov 29, 2023 Hate Speech Detection Language Modeling
Code Code Available 15 Vietnamese Hate and Offensive Detection using PhoBERT-CNN and Social Media Streaming Data Jun 1, 2022 Hate Speech Detection Vietnamese Hate Speech Detection
Code Code Available 15 Interpretable Unified Language Checking Apr 7, 2023 Fact Checking Fairness
Code Code Available 15 KoMultiText: Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services Oct 6, 2023 Hate Speech Detection Multi-Task Learning
Code Code Available 15 Large-Scale Hate Speech Detection with Cross-Domain Transfer Mar 2, 2022 Hate Speech Detection Transfer Learning
Code Code Available 15 Multilingual Twitter Corpus and Baselines for Evaluating Demographic Bias in Hate Speech Recognition Feb 24, 2020 Document Classification Fairness
Code Code Available 15 NLPositionality: Characterizing Design Biases of Datasets and Models Jun 2, 2023 Hate Speech Detection
Code Code Available 15 STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection Jan 26, 2025 Hate Speech Detection
Code Code Available 15 A Federated Approach for Hate Speech Detection Feb 18, 2023 Hate Speech Detection
Code Code Available 15 Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmarks May 8, 2023 Hate Speech Detection
Code Code Available 15 HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning Nov 1, 2023 Hate Speech Detection
Code Code Available 15 Enhancing Social Network Hate Detection Using Back Translation and GPT-3 Augmentations During Training and Test-Time Jun 13, 2023 Hate Speech Detection Hate Speech Normalization
Code Code Available 15 Generalizable Implicit Hate Speech Detection Using Contrastive Learning Oct 1, 2022 Contrastive Learning Hate Speech Detection
Code Code Available 15 Hate Speech Detection Based on Sentiment Knowledge Sharing Aug 1, 2021 Hate Speech Detection Sentence
Code Code Available 15 HONEST: Measuring Hurtful Sentence Completion in Language Models Jun 1, 2021 Hate Speech Detection Hurtful Sentence Completion
Code Code Available 15 K-HATERS: A Hate Speech Detection Corpus in Korean with Target-Specific Ratings Oct 24, 2023 Hate Speech Detection
Code Code Available 15 K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News Comment Aug 23, 2022 Hate Speech Detection Multi-Label Classification
Code Code Available 15 Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models Nov 6, 2024 Hate Speech Detection Navigate
Code Code Available 15 Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models Jun 20, 2022 Diagnostic Hate Speech Detection
Code Code Available 15 DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter Oct 2, 2019 Hate Speech Detection Knowledge Distillation
Code Code Available 15 Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists Mar 17, 2022 Abuse Detection Bias Detection
Code Code Available 15 Deep Learning Models for Multilingual Hate Speech Detection Apr 14, 2020 Deep Learning Hate Speech Detection
Code Code Available 15 Detecting Hate Speech in Multi-modal Memes Dec 29, 2020 Binary Classification Hate Speech Detection
Code Code Available 15 HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns Jan 28, 2025 Adversarial Attack Benchmarking
Code Code Available 15 Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis Oct 9, 2020 Hate Speech Detection Multi-Label Classification
Code Code Available 15 Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection Dec 31, 2020 Hate Speech Detection
Code Code Available 15 Comparative Studies of Detecting Abusive Language on Twitter Aug 30, 2018 Abuse Detection Abusive Language
Code Code Available 15 Aggression and Misogyny Detection using BERT: A Multi-Task Approach May 1, 2020 Abusive Language Aggression Identification
Code Code Available 15 Enhancing social network hate detection using back translation and GPT-3 augmentations during training and test-time Jun 17, 2023 Hate Speech Detection
Code Code Available 15 A Comprehensive Dataset for German Offensive Language and Conversation Analysis Jul 1, 2022 Hate Speech Detection
Code Code Available 15 ETHOS: an Online Hate Speech Detection Dataset Jun 11, 2020 Hate Speech Detection
Code Code Available 15 BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection May 26, 2020 Hate Speech Detection
Code Code Available 15 Few-shot Learning with Multilingual Language Models Dec 20, 2021 Cross-Lingual Transfer Few-Shot Learning
Code Code Available 15 HateCheck: Functional Tests for Hate Speech Detection Models Dec 31, 2020 Diagnostic Hate Speech Detection
Code Code Available 15 HateMM: A Multi-Modal Dataset for Hate Video Classification May 6, 2023 Classification Hate Speech Detection
Code Code Available 15 A Large-scale Dataset for Hate Speech Detection on Vietnamese Social Media Texts Mar 22, 2021 Hate Speech Detection Vietnamese Hate Speech Detection
Code Code Available 15 HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection Dec 18, 2020 Hate Speech Detection Text Classification
Code Code Available 15 APEACH: Attacking Pejorative Expressions with Analysis on Crowd-Generated Hate Speech Evaluation Datasets Feb 25, 2022 Hate Speech Detection
Code Code Available 15 CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean Mar 11, 2024 Hate Speech Detection
Code Code Available 15 AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News and Hate Speech Detection Dataset May 7, 2021 Articles Dialect Identification
Code Code Available 15 Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network Apr 11, 2020 Articles Classification
Code Code Available 15 Deep Learning for Hate Speech Detection: A Comparative Study Feb 19, 2022 Computational Efficiency Deep Learning
Code Code Available 15 Detecting Hate Speech with GPT-3 Mar 23, 2021 Few-Shot Learning Hate Speech Detection
Code Code Available 15 MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili Jul 28, 2024 Hate Speech Detection Video Classification
Code Code Available 15 Why Is It Hate Speech? Masked Rationale Prediction for Explainable Hate Speech Detection Nov 1, 2022 Hate Speech Detection Sentence
Code Code Available 15