SOTAVerified

Bias Detection

Bias detection is the task of detecting and measuring racism, sexism and otherwise discriminatory behavior in a model (Source: https://stereoset.mit.edu/)

Papers

Showing 51100 of 199 papers

TitleStatusHype
How Neural Networks Organize Concepts: Introducing Concept Trajectory Analysis for Deep Learning InterpretabilityCode0
How sensitive are translation systems to extra contexts? Mitigating gender bias in Neural Machine Translation models through relevant contextsCode0
IFBiD: Inference-Free Bias DetectionCode0
IndiVec: An Exploration of Leveraging Large Language Models for Media Bias Detection with Fine-Grained Bias IndicatorsCode0
Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative ModelsCode0
Language-Agnostic Bias Detection in Language Models with Bias ProbingCode0
LOGAN: Local Group Bias Detection by ClusteringCode0
LUCID-GAN: Conditional Generative Models to Locate UnfairnessCode0
debiaSAE: Benchmarking and Mitigating Vision-Language Model BiasCode0
Measuring Gender Bias in Word Embeddings across Domains and Discovering New Gender Bias Word CategoriesCode0
Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language ModelsCode0
Mitigating Bias in Queer Representation within Large Language Models: A Collaborative Agent ApproachCode0
Multilingual sentence-level bias detection in WikipediaCode0
MAGPIE: Multi-Task Media-Bias Analysis Generalization for Pre-Trained Identification of ExpressionsCode0
My Approach = Your Apparatus? Entropy-Based Topic Modeling on Multiple Domain-Specific Text CollectionsCode0
NewB: 200,000+ Sentences for Political Bias DetectionCode0
Predicting the Leading Political Ideology of YouTube Channels Using Acoustic, Textual, and Metadata InformationCode0
Quantifying Gender Biases Towards Politicians on RedditCode0
Robust Bias Detection in MLMs and its Application to Human Trait RatingsCode0
RuBia: A Russian Language Bias Detection DatasetCode0
Second Order WinoBias (SoWinoBias) Test Set for Latent Gender Bias Detection in Coreference ResolutionCode0
Social Bias in Large Language Models For Bangla: An Empirical Study on Gender and Religious BiasCode0
The BIAS Detection Framework: Bias Detection in Word Embeddings and Language Models for European LanguagesCode0
The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media BiasCode0
The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias DetectionCode0
TinyEmo: Scaling down Emotional Reasoning via Metric ProjectionCode0
To Bias or Not to Bias: Detecting bias in News with bias-detectorCode0
Towards Automatic Bias Detection in Knowledge GraphsCode0
Towards Detection of Subjective Bias using Contextualized Word EmbeddingsCode0
Towards Implicit Bias Detection and Mitigation in Multi-Agent LLM InteractionsCode0
Trade-Offs Between Fairness and Privacy in Language ModelingCode0
Uncovering bias in the PlantVillage datasetCode0
ViLBias: A Comprehensive Framework for Bias Detection through Linguistic and Visual Cues , presenting Annotation Strategies, Evaluation, and Key ChallengesCode0
Evaluating Fairness Metrics in the Presence of Dataset Bias0
Experiments in News Bias Detection with Pre-Trained Neural Transformers0
Auditing Algorithmic Fairness in Machine Learning for Health with Severity-Based LOGAN0
Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool0
Exploiting Transformer-based Multitask Learning for the Detection of Media Bias in News Articles0
A Survey on Predicting the Factuality and the Bias of News Media0
Extending Variability-Aware Model Selection with Bias Detection in Machine Learning Projects0
Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor0
Sexism in the Judiciary0
Sexism in the Judiciary: The Importance of Bias Definition in NLP and In Our Courts0
Fairness via AI: Bias Reduction in Medical Information0
FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing0
Fine-Grained Bias Detection in LLM: Enhancing detection mechanisms for nuanced biases0
Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models0
Sparse Interventions in Language Models with Differentiable Masking0
A Study on Bias Detection and Classification in Natural Language Processing0
A Deep Dive into Effects of Structural Bias on CMA-ES Performance along Affine Trajectories0
Show:102550
← PrevPage 2 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-2 (small)ICAT Score72.97Unverified
2XLNet (large)ICAT Score72.03Unverified
3GPT-2 (medium)ICAT Score71.73Unverified
4BERT (base)ICAT Score71.21Unverified
5GPT-2 (large)ICAT Score70.54Unverified
6BERT (large)ICAT Score69.89Unverified
7RoBERTa (base)ICAT Score67.5Unverified
8GAL 120BICAT Score65.6Unverified
9XLNet (base)ICAT Score62.1Unverified
10GPT-3 (text-davinci-002)ICAT Score60.8Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Best-of0.5Unverified
2BaselineBest-of0.41Unverified
3GemmaBest-of0.41Unverified
4MistralBest-of0.36Unverified
5Llama2Best-of0.34Unverified
#ModelMetricClaimedVerifiedStatus
1BADICAT Score23.44Unverified
#ModelMetricClaimedVerifiedStatus
1RandomForest_default_hyperparametersAccuracy (%)49Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa+ALBERTF170.4Unverified