SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 51100 of 199 papers

TitleStatusHype
GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative ModelsCode0
Uncovering Biases with Reflective Large Language Models0
Unboxing Occupational Bias: Grounded Debiasing of LLMs with U.S. Labor Data0
A Study on Bias Detection and Classification in Natural Language Processing0
Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models0
The BIAS Detection Framework: Bias Detection in Word Embeddings and Language Models for European LanguagesCode0
BiasScanner: Automatic Detection and Classification of News Bias to Strengthen Democracy0
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs0
A Review of the Challenges with Massive Web-mined Corpora Used in Large Language Models Pre-Training0
Epistemological Bias As a Means for the Automated Detection of Injustices in Text0
Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms0
Social Bias in Large Language Models For Bangla: An Empirical Study on Gender and Religious BiasCode0
A Study of Nationality Bias in Names and Perplexity using Off-the-Shelf Affect-related Tweet ClassifiersCode0
DocNet: Semantic Structure in Inductive Bias Detection Models0
Experiments in News Bias Detection with Pre-Trained Neural Transformers0
BEADs: Bias Evaluation Across Domains0
Evaluating AI fairness in credit scoring with the BRIO tool0
Gender Bias Detection in Court Decisions: A Brazilian Case StudyCode0
The Point of View of a Sentiment: Towards Clinician Bias Detection in Psychiatric Notes0
A Novel Method for News Article Event-Based Embedding0
DispaRisk: Auditing Fairness Through Usable InformationCode0
SynthesizRR: Generating Diverse Datasets with Retrieval AugmentationCode1
A Deep Dive into Effects of Structural Bias on CMA-ES Performance along Affine Trajectories0
Reinforcement Learning from Multi-role Debates as Feedback for Bias Mitigation in LLMs0
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for HallucinationsCode1
OpenBias: Open-set Bias Detection in Text-to-Image Generative ModelsCode1
The Impact of Unstated Norms in Bias Analysis of Language Models0
Implications of the AI Act for Non-Discrimination Law and Algorithmic Fairness0
ChatGPT v.s. Media Bias: A Comparative Study of GPT-3.5 and Fine-tuned Language Models0
DeNetDM: Debiasing by Network Depth ModulationCode0
RuBia: A Russian Language Bias Detection DatasetCode0
MAGPIE: Multi-Task Media-Bias Analysis Generalization for Pre-Trained Identification of ExpressionsCode0
Decoding News Narratives: A Critical Analysis of Large Language Models in Framing Detection0
IndiVec: An Exploration of Leveraging Large Language Models for Media Bias Detection with Fine-Grained Bias IndicatorsCode0
New Job, New Gender? Measuring the Social Bias in Image Generation ModelsCode1
The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media BiasCode0
Multilingual Bias Detection and Mitigation for Indian Languages0
Large Language Model (LLM) Bias Index -- LLMBI0
Extending Variability-Aware Model Selection with Bias Detection in Machine Learning Projects0
Current Topological and Machine Learning Applications for Bias Detection in Text0
Subtle Misogyny Detection and Mitigation: An Expert-Annotated Dataset0
Unmasking Bias in AI: A Systematic Review of Bias Detection and Mitigation Strategies in Electronic Health Record-based Models0
Target-Aware Contextual Political Bias Detection in News0
Unlocking Bias Detection: Leveraging Transformer-Based Models for Content Analysis0
Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative ModelsCode0
Unsupervised Bias Detection in College Student Newspapers0
LUCID-GAN: Conditional Generative Models to Locate UnfairnessCode0
Auditing Predictive Models for Intersectional Biases0
The Hidden Language of Diffusion ModelsCode1
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark DatasetsCode1
Show:102550
← PrevPage 2 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2GemmaBest-of0.41Unverified
3BaselineBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified