SOTAVerified

Abuse Detection

Abuse detection is the task of identifying abusive behaviors, such as hate speech, offensive language, sexism and racism, in utterances from social media platforms (Source: https://arxiv.org/abs/1802.00385).

Papers

Showing 150 of 73 papers

TitleStatusHype
Creating and Evaluating Code-Mixed Nepali-English and Telugu-English Datasets for Abusive Language Detection Using Traditional and Deep Learning Models0
Predictive Response Optimization: Using Reinforcement Learning to Fight Online Social Network Abuse0
A survey of textual cyber abuse detection using cutting-edge language models and large language models0
HP-BERT: A framework for longitudinal study of Hinduphobia on social media via LLMsCode0
Towards Cross-Lingual Audio Abuse Detection in Low-Resource Settings with Few-Shot LearningCode0
DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection0
CoLLAB: A Collaborative Approach for Multilingual Abuse Detection0
Breaking the Silence Detecting and Mitigating Gendered Abuse in Hindi, Tamil, and Indian English Online SpacesCode0
Overview of the 2023 ICON Shared Task on Gendered Abuse Detection in Indic Languages0
Voucher Abuse Detection with Prompt-based Fine-tuning on Graph Neural Networks0
Detection of Children Abuse by Voice and Audio Classification by Short-Time Fourier Transform Machine Learning implemented on Nvidia Edge GPU device0
TCAB: A Large-Scale Text Classification Attack BenchmarkCode0
Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods0
Explainable Abuse Detection as Intent Classification and Slot FillingCode0
Adversarial Robustness for Tabular Data through Cost and Utility Awareness0
Enriching Abusive Language Detection with Community Context0
Darkness can not drive out darkness: Investigating Bias in Hate SpeechDetection Models0
DE-ABUSE@TamilNLP-ACL 2022: Transliteration as Data Augmentation for Abuse Detection in Tamil0
Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation VectorsCode0
Multilingual and Multimodal Abuse Detection0
The Online Behaviour of the Algerian Abusers in Social Media Networks0
Entropy-based Attention Regularization Frees Unintended Bias Mitigation from ListsCode1
Abuse and Fraud Detection in Streaming Services Using Heuristic-Aware Machine Learning0
ADIMA: Abuse Detection In Multilingual AudioCode0
Identifying Adversarial Attacks on Text Classifiers0
Toxicity Detection for Indic Multilingual Social Media Content0
Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors0
What Models Know About Their Attackers: Deriving Attacker Information From Latent Representations0
ConvAbuse: Data, Analysis, and Benchmarks for Nuanced Abuse Detection in Conversational AICode1
A Large-Scale English Multi-Label Twitter Dataset for Cyberbullying and Online Abuse Detection0
AAA: Fair Evaluation for Abuse Detection Systems WantedCode0
Generalisability of Topic Models in Cross-corpora Abusive Language Detection0
Modeling Users and Online Communities for Abuse Detection: A Position on Ethics and Explainability0
Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective0
AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab PostsCode1
UoB at SemEval-2020 Task 12: Boosting BERT with Corpus Level Information0
KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social MediaCode1
Joint Modelling of Emotion and Abusive Language Detection0
Evaluating Performance of an Adult Pornography Classifier for Child Sexual Abuse DetectionCode0
Intersectional Bias in Hate Speech and Abusive Language DatasetsCode1
LIIR at SemEval-2020 Task 12: A Cross-Lingual Augmentation Approach for Multilingual Offensive Language Identification0
Multimodal Meme Dataset (MultiOFF) for Identifying Offensive Content in Image and TextCode1
Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-Task Learning for Offensive Language DetectionCode1
WAC: A Corpus of Wikipedia Conversations for Online Abuse DetectionCode0
Stereotypical Bias Removal for Hate Speech Detection Task using Knowledge-based Generalizations0
HateMonitors: Language Agnostic Abuse Detection in Social MediaCode0
Tackling Online Abuse: A Survey of Automated Abuse Detection Methods0
Pay ``Attention'' to your Context when Classifying Abusive LanguageCode0
Multi-label Hate Speech and Abusive Language Detection in Indonesian TwitterCode0
Challenges and frontiers in abusive content detectionCode0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.