SOTAVerified

Hate Speech Detection

Hate speech detection is the task of detecting if communication such as text, audio, and so on contains hatred and or encourages violence towards a person or a group of people. This is usually based on prejudice against 'protected characteristics' such as their ethnicity, gender, sexual orientation, religion, age et al. Some example benchmarks are ETHOS and HateXplain. Models can be evaluated with metrics like the F-score or F-measure.

Papers

Showing 110 of 507 papers

Show:102550
← PrevPage 1 of 51Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BiLSTM + static BEF1-score0.8Unverified
2BERTF1-score0.79Unverified
3BiLSTM+Attention+FTF1-score0.77Unverified
4OPT-175B (few-shot)F1-score0.76Unverified
5CNN+Attention+FT+GVF1-score0.74Unverified
6OPT-175B (one-shot)F1-score0.71Unverified
7OPT-175B (zero-shot)F1-score0.67Unverified
8SVMF1-score0.66Unverified
9Random ForestsF1-score0.64Unverified
10Davinci (zero-shot)F1-score0.63Unverified