Part I: Background on LMs
- Root Causes of Bias in LMs
Language Models (LMs) have demonstrated remarkable success across various domains over the years. However, despite their promising performance on various real world tasks, most of these algorithms lack fairness considerations, potentially leading to discriminatory outcomes against marginalized demographic groups and individuals. Many recent publications have explored ways to mitigate bias in LMs. Nevertheless, a comprehensive understanding of the root causes of bias, their effects, and possible limitations of LMs from the perspective of fairness is still in its early stages. To bridge this gap, this tutorial provides a systematic overview of recent advances in fair LMs, beginning with real-world case studies, followed by an analysis of bias causes. We then explore fairness concepts specific to LMs, summarizing bias evaluation strategies and algorithms designed to promote fairness. Finally, we analyze bias in LM datasets and discuss current research challenges and open questions in the field.
Check out our survey on fairness in large language models.
This tutorial is grounded in our surveys and established benchmarks, all available as open-source resources.