Privacy-preserving machine learning with tensor networks

ORAL

Abstract

Vast amounts of data are routinely processed in machine learning pipelines, every time covering more aspects of our interactions with the world. When the models are made public, is the safety of the data used for training them guaranteed? This is a crucial question, especially when processing sensitive data such as medical records. The state-of-the-art protection techniques, despite being deployed commercially, consist in adding noise at some stage during the training process, and thus imply a tradeoff between privacy protection and performance.

In this talk I will argue and practically illustrate that insights coming from the tensor network representations of quantum many-body states can help in devising better privacy-preserving machine learning algorithms. First, I will show that standard neural networks are vulnerable to a type of privacy leak that, notably, is a priori resistant to the standard protection mechanisms. Then, I will show that tensor networks, when used as machine learning architectures, are invulnerable to this leak. The proof of the resilience is based on the existence of canonical forms for such architectures. Given that tensor networks are recently showing to compete and even surpass traditional machine learning architectures in certain cases, these results imply that one may not have to be forced to make a choice between accuracy in prediction and ensuring the privacy of the information processed when using machine learning on sensitive data.

*This work is supported by the European Union (Horizon 2020 research and innovation programme-grant agreement No. 648913 and ERDF "A way of making Europe"), the Spanish Ministry of Science and Innovation ("Severo Ochoa Programme for Centres of Excellence in R&D" CEX2019-000904-S and ICMAT Severo Ochoa project SEV-2015-0554, and grants CEX2019-000904-S-20-4, MTM2014-54240-P, MTM2017-88385-P, PGC2018-098321-B-I00 and PID2020-113523GB-I00), the Spanish Ministry of Economic Affairs and Digital Transformation (project QUANTUM ENIA, as part of the Recovery, Transformation and Resilience Plan, funded by EU program NextGenerationEU), Comunidad de Madrid (PEJ-2021-AI/TIC-23267 and QUITEMAD-CM P2018/TCS-4342), and the CSIC Quantum Technologies Platform PTI-001.

Publication: arXiv:2202.12319

Presenters

  • Alejandro Pozas-Kerstjens

    • Institute of Mathematical Sciences

Authors

  • Alejandro Pozas-Kerstjens

    • Institute of Mathematical Sciences