arXiv:2407.09485 Abstract | arXiv Analytics

arXiv:2407.09485 [cs.HC]Abstract References Reviews Resources

Representation Debiasing of Generated Data Involving Domain Experts

Aditya Bhattacharya, Simone Stumpf, Katrien Verbert

Published 2024-05-17Version 1

Biases in Artificial Intelligence (AI) or Machine Learning (ML) systems due to skewed datasets problematise the application of prediction models in practice. Representation bias is a prevalent form of bias found in the majority of datasets. This bias arises when training data inadequately represents certain segments of the data space, resulting in poor generalisation of prediction models. Despite AI practitioners employing various methods to mitigate representation bias, their effectiveness is often limited due to a lack of thorough domain knowledge. To address this limitation, this paper introduces human-in-the-loop interaction approaches for representation debiasing of generated data involving domain experts. Our work advocates for a controlled data generation process involving domain experts to effectively mitigate the effects of representation bias. We argue that domain experts can leverage their expertise to assess how representation bias affects prediction models. Moreover, our interaction approaches can facilitate domain experts in steering data augmentation algorithms to produce debiased augmented data and validate or refine the generated samples to reduce representation bias. We also discuss how these approaches can be leveraged for designing and developing user-centred AI systems to mitigate the impact of representation bias through effective collaboration between domain experts and AI.

Comments: Pre-print of a paper accepted for ACM UMAP 2024

Journal: Adjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization (UMAP Adjunct '24), July 1--4, 2024, Cagliari, Italy

DOI: 10.1145/3631700.3664910

Categories: cs.HC

Keywords: domain experts, generated data, representation debiasing, representation bias affects prediction models, interaction approaches

Tags: journal article

Related articles: Most relevant | Search more

arXiv:2501.01441 [cs.HC] (Published 2024-12-26)

Explanatory Debiasing: Involving Domain Experts in the Data Generation Process to Mitigate Representation Bias in AI Systems

Aditya Bhattacharya, Simone Stumpf, Robin De Croon, Katrien Verbert

arXiv:2412.01024 [cs.HC] (Published 2024-12-02, updated 2024-12-03)

Towards Understanding the Impact of Guidance in Data Visualization Systems for Domain Experts

Sherry Qiu, Holly Rushmeier, Kim R. M. Blenman

arXiv:2308.10795 [cs.HC] (Published 2023-08-21)