Hate Speech Detection
Hate speech detection is the task of detecting if communication such as text, audio, and so on contains hatred and or encourages violence towards a person or a group of people. This is usually based on prejudice against 'protected characteristics' such as their ethnicity, gender, sexual orientation, religion, age et al. Some example benchmarks are ETHOS and HateXplain. Models can be evaluated with metrics like the F-score or F-measure.
Papers
Showing 1–10 of 507 papers
All datasetsEthos BinaryHateXplainEthos MultiLabelWaseem et al., 2018AbusEvalAutomatic Misogynistic IdentificationHateMMHatEvalOffensEval 2019ToLD-Brbajer_danish_misogynyDKhate
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | AOM mBERT | F1 | 0.85 | — | Unverified |