SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 101150 of 199 papers

TitleStatusHype
Decoding News Narratives: A Critical Analysis of Large Language Models in Framing Detection0
IndiVec: An Exploration of Leveraging Large Language Models for Media Bias Detection with Fine-Grained Bias IndicatorsCode0
The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media BiasCode0
Multilingual Bias Detection and Mitigation for Indian Languages0
Large Language Model (LLM) Bias Index -- LLMBI0
Extending Variability-Aware Model Selection with Bias Detection in Machine Learning Projects0
Current Topological and Machine Learning Applications for Bias Detection in Text0
Subtle Misogyny Detection and Mitigation: An Expert-Annotated Dataset0
Unmasking Bias in AI: A Systematic Review of Bias Detection and Mitigation Strategies in Electronic Health Record-based Models0
Target-Aware Contextual Political Bias Detection in News0
Unlocking Bias Detection: Leveraging Transformer-Based Models for Content Analysis0
Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative ModelsCode0
Unsupervised Bias Detection in College Student Newspapers0
LUCID-GAN: Conditional Generative Models to Locate UnfairnessCode0
Auditing Predictive Models for Intersectional Biases0
Trade-Offs Between Fairness and Privacy in Language ModelingCode0
Language-Agnostic Bias Detection in Language Models with Bias ProbingCode0
Bipol: A Novel Multi-Axes Bias Evaluation Metric with Explainability for NLPCode0
Disentangling Structure and Style: Political Bias Detection in News by Inducing Document HierarchyCode0
Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models0
Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark DatasetsCode0
Auditing Algorithmic Fairness in Machine Learning for Health with Severity-Based LOGAN0
Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language ModelsCode0
Exploiting Transformer-based Multitask Learning for the Detection of Media Bias in News Articles0
A Keyword Based Approach to Understanding the Overpenalization of Marginalized Groups by English Marginal Abuse Models on Twitter0
Efficient Gender Debiasing of Pre-trained Indic Language Models0
A Unified Comparison of User Modeling Techniques for Predicting Data Interaction and Detecting Exploration BiasCode0
Robots Enact Malignant Stereotypes0
MRCLens: an MRC Dataset Bias Detection Toolkit0
A methodology to characterize bias and harmful stereotypes in natural language processing in Latin AmericaCode0
Incorporating Subjectivity into Gendered Ambiguous Pronoun (GAP) Resolution using Style Transfer0
HeteroCorpus: A Corpus for Heteronormative Language DetectionCode0
Personalized Detection of Cognitive Biases in Actions of Users from Their Logs: Anchoring and Recency Biases0
Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models0
Uncovering bias in the PlantVillage datasetCode0
Beyond Explanation: A Case for Exploratory Text Visualizations of Non-Aggregated, Annotated Datasets0
A Domain-adaptive Pre-training Approach for Language Bias Detection in NewsCode0
How sensitive are translation systems to extra contexts? Mitigating gender bias in Neural Machine Translation models through relevant contextsCode0
Constructive Interpretability with CoLabel: Corroborative Integration, Complementary Features, and Collaborative Learning0
Towards Detecting Political Bias in Hindi News Articles0
A Meta Survey of Quality Evaluation Criteria in Explanation Methods0
Towards Identifying Social Bias in Dialog Systems: Frame, Datasets, and Benchmarks0
Modeling Multi-level Context for Informational Bias Detection by Contrastive Learning and Sentential Graph Network0
An Interdisciplinary Approach for the Automated Detection and Visualization of Media Bias in News Articles0
Forward Composition Propagation for Explainable Neural ReasoningCode0
Quantifying Gender Biases Towards Politicians on RedditCode0
Towards A Reliable Ground-Truth For Biased Language Detection0
Sparse Interventions in Language Models with Differentiable Masking0
Anatomizing Bias in Facial Analysis0
MRCLens: an MRC Dataset Bias Detection Toolkit0
Show:102550
← PrevPage 3 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2BaselineBest-of0.41Unverified
3GemmaBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified