Model Inversion Robustness: Can Transfer Learning Help?

Sy-Tuyen Ho1, Koh Jun Hao1, Keshigeyan Chandrasegaran2, Ngoc-Bao Nguyen1, Ngai-Man Cheung1
1Singapore University of Technology and Design 2Stanford University
CVPR 2024

Abstract

TL;DR: Transfer Learning prevents data leakage in Model Inversion Attacks by reducing memorization and improving generalization in deep neural networks.
Model Inversion (MI) attacks aim to reconstruct private training data by abusing access to machine learning models. Contemporary MI attacks have achieved impressive attack performance, posing serious threats to privacy. Meanwhile, all existing MI defense methods rely on regularization that is in direct conflict with the training objective, resulting in noticeable degradation in model utility. In this work, we take a different perspective, and propose a novel and simple Transfer Learning-based Defense against Model Inversion (TL-DMI) to render MI-robust models. Particularly, by leveraging TL, we limit the number of layers encoding sensitive information from private training dataset, thereby degrading the performance of MI attack. We conduct an analysis using Fisher Information to justify our method. Our defense is remarkably simple to implement. Without bells and whistles, we show in extensive experiments that TL-DMI achieves state-of-the-art (SOTA) MI robustness.

Method Overview

TL-DMI method overview showing transfer learning defense against model inversion

BibTeX

@inproceedings{ho2024model, title={Model inversion robustness: Can transfer learning help?}, author={Ho, Sy-Tuyen and Hao, Koh Jun and Chandrasegaran, Keshigeyan and Nguyen, Ngoc-Bao and Cheung, Ngai-Man}, booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition}, pages={12183--12193}, year={2024} }