Poisoning attacks on algorithmic fairness

dc.contributor.authorSolans, David
dc.contributor.authorBiggio, Battista
dc.contributor.authorCastillo, Carlos
dc.date.accessioned2021-05-20T08:35:52Z
dc.date.available2021-05-20T08:35:52Z
dc.date.issued2020
dc.descriptionComunicació presentada al ECML PKDD 2020: Machine Learning and Knowledge Discovery in Databases, celebrat del 14 al 18 de setembre de 2020 a Gant, Bèlgica.
dc.description.abstractResearch in adversarial machine learning has shown how the performance of machine learning models can be seriously compromised by injecting even a small fraction of poisoning points into the training data. While the effects on model accuracy of such poisoning attacks have been widely studied, their potential effects on other model performance metrics remain to be evaluated. In this work, we introduce an optimization framework for poisoning attacks against algorithmic fairness, and develop a gradient-based poisoning attack aimed at introducing classification disparities among different groups in the data. We empirically show that our attack is effective not only in the white-box setting, in which the attacker has full access to the target model, but also in a more challenging black-box scenario in which the attacks are optimized against a substitute model and then transferred to the target model. We believe that our findings pave the way towards the definition of an entirely novel set of adversarial attacks targeting algorithmic fairness in different scenarios, and that investigating such vulnerabilities will help design more robust algorithms and countermeasures in the future.en
dc.description.sponsorshipThis research was supported by the European Commission through the ALOHAH2020 project. Also, we wish to acknowledge the usefulness of the Sec-ML library [17] for the execution of the experiments of this paper. C. Castillo thanks La Caixa project LCF/PR/PR16/11110009 for partial support. B. Biggio acknowledges that this work has been partly funded by BMK, BMDW, and the Province of Upper Austria in the frame of the COMET Programme managed by FFG in the COMET Module S3AI.
dc.format.mimetypeapplication/pdf
dc.identifier.citationSolans D, Biggio B, Castillo C. Poisoning attacks on algorithmic fairness. In: Hutter F, Kersting K, Lijffijt J, Valera I, editors. ECML PKDD 2020: Machine Learning and Knowledge Discovery in Databases; 2020 Sep 14-18; Ghent, Belgium. Cham: Springer; 2020. p. 162-77. (LNCS; no. 12457). DOI: 10.1007/978-3-030-67658-2_10
dc.identifier.doihttp://dx.doi.org/10.1007/978-3-030-67658-2_10
dc.identifier.urihttp://hdl.handle.net/10230/47626
dc.language.isoeng
dc.publisherSpringer
dc.relation.ispartofHutter F, Kersting K, Lijffijt J, Valera I, editors. ECML PKDD 2020: Machine Learning and Knowledge Discovery in Databases; 2020 Sep 14-18; Ghent, Belgium. Cham: Springer; 2020. p. 162-77. (LNCS; no. 12457)
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/780788
dc.rights© Springer The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-030-67658-2_10
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.subject.keywordAlgorithmic discriminationen
dc.subject.keywordAlgorithmic fairnessen
dc.subject.keywordPoisoning attacksen
dc.subject.keywordAdversarial machine learningen
dc.subject.keywordMachine learning securityen
dc.titlePoisoning attacks on algorithmic fairnessen
dc.typeinfo:eu-repo/semantics/conferenceObject
dc.type.versioninfo:eu-repo/semantics/acceptedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
solans_ecmlpkdd_poiso.pdf
Size:
957.71 KB
Format:
Adobe Portable Document Format