arXiv:2409.09569 Abstract | arXiv Analytics

arXiv:2409.09569 [cs.LG]Abstract References Reviews Resources

Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models

Sahil Kuchlous, Marvin Li, Jeffrey G. Wang

Published 2024-09-15Version 1

With the growing adoption of Text-to-Image (TTI) systems, the social biases of these models have come under increased scrutiny. Herein we conduct a systematic investigation of one such source of bias for diffusion models: embedding spaces. First, because traditional classifier-based fairness definitions require true labels not present in generative modeling, we propose statistical group fairness criteria based on a model's internal representation of the world. Using these definitions, we demonstrate theoretically and empirically that an unbiased text embedding space for input prompts is a necessary condition for representationally balanced diffusion models, meaning the distribution of generated images satisfy diversity requirements with respect to protected attributes. Next, we investigate the impact of biased embeddings on evaluating the alignment between generated images and prompts, a process which is commonly used to assess diffusion models. We find that biased multimodal embeddings like CLIP can result in lower alignment scores for representationally balanced TTI models, thus rewarding unfair behavior. Finally, we develop a theoretical framework through which biases in alignment evaluation can be studied and propose bias mitigation methods. By specifically adapting the perspective of embedding spaces, we establish new fairness conditions for diffusion model development and evaluation.

Comments: 19 pages, 4 figures

Categories: cs.LG, cs.CV, cs.CY

Keywords: diffusion model, bias begets bias, biased embeddings, embedding space, generated images satisfy diversity requirements

Related articles: Most relevant | Search more

arXiv:2211.13449 [cs.LG] (Published 2022-11-24)

Fast Sampling of Diffusion Models via Operator Learning

Hongkai Zheng, Weili Nie, Arash Vahdat, Kamyar Azizzadenesheli, Anima Anandkumar

arXiv:2407.03153 [cs.LG] (Published 2024-06-09)

Efficient Shapley Values for Attributing Global Properties of Diffusion Models to Data Group

Chris Lin, Mingyu Lu, Chanwoo Kim, Su-In Lee

arXiv:2405.20971 [cs.LG] (Published 2024-05-31)

Amortizing intractable inference in diffusion models for vision, language, and control

Siddarth Venkatraman et al.

arXiv Analytics

arXiv:2409.09569 [cs.LG]Abstract References Reviews Resources

Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models

Links

Toolbox

arXiv:2409.09569 [cs.LG]AbstractReferencesReviewsResources

Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models

Links

Toolbox

arXiv:2409.09569 [cs.LG]Abstract References Reviews Resources