Two Heads are Better than One: Nested PoE for Robust Defense Against Multi-Backdoors Apr 2, 2024 Data Poisoning Hate Speech Detection
Code Code Available 0NLP Systems That Can't Tell Use from Mention Censor Counterspeech, but Teaching the Distinction Helps Apr 2, 2024 Hate Speech Detection Misinformation
Code Code Available 0Securing Social Spaces: Harnessing Deep Learning to Eradicate Cyberbullying Apr 1, 2024 Deep Learning Hate Speech Detection
— Unverified 0A Comprehensive Study on NLP Data Augmentation for Hate Speech Detection: Legacy Methods, BERT, and LLMs Mar 30, 2024 Data Augmentation Hate Speech Detection
— Unverified 0Improving Adversarial Data Collection by Supporting Annotators: Lessons from GAHD, a German Hate Speech Dataset Mar 28, 2024 Hate Speech Detection
Code Code Available 0NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data Mar 28, 2024 Hate Speech Detection
Code Code Available 0Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales Mar 19, 2024 Hate Speech Detection Language Modeling
Code Code Available 0Exploring Tokenization Strategies and Vocabulary Sizes for Enhanced Arabic Language Models Mar 17, 2024 Computational Efficiency Hate Speech Detection
Code Code Available 0Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection Mar 12, 2024 Hate Speech Detection Sentiment Analysis
— Unverified 0CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean Mar 11, 2024 Hate Speech Detection
Code Code Available 1Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models Mar 4, 2024 Few-Shot Learning Hate Speech Detection
— Unverified 0Subjective Isms? On the Danger of Conflating Hate and Offence in Abusive Language Detection Mar 4, 2024 Abusive Language Hate Speech Detection
— Unverified 0Z-AGI Labs at ClimateActivism 2024: Stance and Hate Event Detection on Social Media Feb 26, 2024 Event Detection Hate Speech Detection
— Unverified 0GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection? Feb 23, 2024 Diagnostic Hate Speech Detection
Code Code Available 0MM-Soc: Benchmarking Multimodal Large Language Models in Social Media Platforms Feb 21, 2024 Benchmarking Hate Speech Detection
Code Code Available 0Don't Go To Extremes: Revealing the Excessive Sensitivity and Calibration Limitations of LLMs in Implicit Hate Speech Detection Feb 18, 2024 Fairness Hate Speech Detection
— Unverified 0Whose Emotions and Moral Sentiments Do Language Models Reflect? Feb 16, 2024 Hate Speech Detection
— Unverified 0Exploring the Adversarial Capabilities of Large Language Models Feb 14, 2024 Hate Speech Detection Text Generation
— Unverified 0Personalized Large Language Models Feb 14, 2024 Emotion Recognition Hate Speech Detection
Code Code Available 2Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA Feb 9, 2024 Event Detection Hate Speech Detection
Code Code Available 4Probing Critical Learning Dynamics of PLMs for Hate Speech Detection Feb 3, 2024 Benchmarking Hate Speech Detection
Code Code Available 0Identifying False Content and Hate Speech in Sinhala YouTube Videos by Analyzing the Audio Jan 30, 2024 Hate Speech Detection Misinformation
— Unverified 0Analysis and Detection of Multilingual Hate Speech Using Transformer Based Deep Learning Jan 19, 2024 Hate Speech Detection
— Unverified 0Attentive Fusion: A Transformer-based Approach to Multimodal Hate Speech Detection Jan 19, 2024 Hate Speech Detection
Code Code Available 0Multilingual acoustic word embeddings for zero-resource languages Jan 19, 2024 Hate Speech Detection Keyword Spotting
— Unverified 0MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection Jan 12, 2024 Hate Speech Detection
Code Code Available 0An Investigation of Large Language Models for Real-World Hate Speech Detection Jan 7, 2024 Hate Speech Detection
— Unverified 0TuPy-E: detecting hate speech in Brazilian Portuguese social media with a novel dataset and comprehensive analysis of models Dec 29, 2023 Hate Speech Detection
Code Code Available 0HCDIR: End-to-end Hate Context Detection, and Intensity Reduction model for online comments Dec 20, 2023 Hate Speech Detection Language Modeling
— Unverified 0Hate Speech and Offensive Content Detection in Indo-Aryan Languages: A Battle of LSTM and Transformers Dec 9, 2023 Hate Speech Detection Model Selection
— Unverified 0TurkishBERTweet: Fast and Reliable Large Language Model for Social Media Analysis Nov 29, 2023 Hate Speech Detection Language Modeling
Code Code Available 1Improving Cross-Domain Hate Speech Generalizability with Emotion Knowledge Nov 24, 2023 Hate Speech Detection
Code Code Available 0Contextualizing Internet Memes Across Social Media Platforms Nov 18, 2023 Hate Speech Detection
— Unverified 0Latent Feature-based Data Splits to Improve Generalisation Evaluation: A Hate Speech Detection Case Study Nov 16, 2023 Hate Speech Detection
Code Code Available 0Generative AI for Hate Speech Detection: Evaluation and Findings Nov 16, 2023 Hate Speech Detection Text Generation
— Unverified 0GPT-4V(ision) as A Social Media Analysis Engine Nov 13, 2023 Hallucination Hate Speech Detection
Code Code Available 0Automatic Textual Normalization for Hate Speech Detection Nov 12, 2023 Hate Speech Detection Lexical Normalization
Code Code Available 0mahaNLP: A Marathi Natural Language Processing Library Nov 5, 2023 Hate Speech Detection NER
Code Code Available 0Explainable Identification of Hate Speech towards Islam using Graph Neural Networks Nov 2, 2023 Decoder Hate Speech Detection
— Unverified 0HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning Nov 1, 2023 Hate Speech Detection
Code Code Available 1LLMs and Finetuning: Benchmarking cross-domain performance for hate speech detection Oct 29, 2023 Benchmarking Diversity
— Unverified 0Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks Oct 25, 2023 Hate Speech Detection
— Unverified 0K-HATERS: A Hate Speech Detection Corpus in Korean with Target-Specific Ratings Oct 24, 2023 Hate Speech Detection
Code Code Available 1GASCOM: Graph-based Attentive Semantic Context Modeling for Online Conversation Understanding Oct 21, 2023 Graph Attention Hate Speech Detection
— Unverified 0Probing LLMs for hate speech detection: strengths and vulnerabilities Oct 19, 2023 Hate Speech Detection
— Unverified 0InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations Oct 9, 2023 Dialogue Act Classification Hate Speech Detection
Code Code Available 0KoMultiText: Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services Oct 6, 2023 Hate Speech Detection Multi-Task Learning
Code Code Available 1Hate Speech Detection in Limited Data Contexts using Synthetic Data Generation Oct 4, 2023 Data Augmentation Hate Speech Detection
— Unverified 0Harnessing Pre-Trained Sentence Transformers for Offensive Language Detection in Indian Languages Oct 3, 2023 Hate Speech Detection Sentence
— Unverified 0Cordyceps@LT-EDI: Patching Language-Specific Homophobia/Transphobia Classifiers with a Multilingual Understanding Sep 24, 2023 Hate Speech Detection
— Unverified 0