Loading...
Mitigating implicit and explicit bias in structured data without sacrificing accuracy in pattern classification
Hoitsma,Fabian ; Nápoles,Gonzalo ; Güven,Çiçek ; Salgueiro,Yamisleydi
Hoitsma,Fabian
Nápoles,Gonzalo
Güven,Çiçek
Salgueiro,Yamisleydi
Abstract
Using biased data to train Artificial Intelligence (AI) algorithms will lead to biased decisions, discriminating against certain groups or individuals. Bias can be explicit (one or several protected features directly influence the decisions) or implicit (one or several protected features indirectly influence the decisions). Unsurprisingly, biased patterns are difficult to detect and mitigate. This paper investigates the extent to which explicit and implicit against one or more protected features in structured classification data sets can be mitigated simultaneously while retaining the data’s discriminatory power. The main contribution of this paper concerns an optimization-based bias mitigation method that reweights the training instances. The algorithm operates with numerical and nominal data and can mitigate implicit and explicit bias against several protected features simultaneously. The trade-off between bias mitigation and accuracy loss can be controlled using parameters in the objective function. The numerical simulations using real-world data sets show a reduction of up to 77% of implicit bias and a complete removal of explicit bias against protected features at no cost of accuracy of a wrapper classifier trained on the data. Overall, the proposed method outperforms the state-of-the-art bias mitigation methods for the selected data sets.
Description
Publisher Copyright: © The Author(s) 2024.
Date
2024-07-10
Journal Title
Journal ISSN
Volume Title
Publisher
Files
Loading...
mitigating_implicit.pdf
Adobe PDF, 1.98 MB
Research Projects
Organizational Units
Journal Issue
Keywords
Bias mitigation, Fair machine learning, Instance reweighting
Citation
Hoitsma, F, Nápoles, G, Güven, Ç & Salgueiro, Y 2024, 'Mitigating implicit and explicit bias in structured data without sacrificing accuracy in pattern classification', AI & Society: Knowledge, Culture and Communication - Springer Nature. https://doi.org/10.1007/s00146-024-02003-0
