MAAG Maria
Supervision : Patrick GALLINARI
Co-supervision : DENOYER Ludovic
Automatic Learning of Anonymization Functions for Graphs and Dynamic Graphs
Data privacy is a major problem that has to be considered before releasing datasets to the public or even to a partner company that would compute statistics or make a deep analysis of these data. Privacy is insured by performing data anonymization as required by legislation. In this context, many different anonymization techniques have been proposed in the literature. These techniques are difficult to use in a general context where attacks can be of different types, and where measures are not known to the anonymizer. Generic methods able to adapt to different situations become desirable.
We are addressing the problem of privacy related to graph data which needs, for different reasons, to be publicly made available. This corresponds to the anonymized graph data publishing problem. We are placing from the perspective of an anonymizer not having access to the methods used to analyze data. A generic methodology is proposed based on machine learning to obtain directly an anonymization function from a set of training data so as to optimize a tradeoff between privacy risk and utility loss. The method thus allows one to get a good anonymization procedure for any kind of attacks, and any characteristic in a given set. The methodology is instantiated for simple graphs and complex timestamped graphs. A tool has been developed implementing the method and has been experimented with success on real anonymized datasets coming from Twitter, Enron or Amazon. Results are compared with baseline and it is showed that the proposed method is generic and can automatically adapt itself to different anonymization contexts.
Defence : 04/08/2015
Jury members :
Fabrice Rossi, Professeur, Université Panthéon-Sorbonne (Rapporteur )
Benjamin Nguyen, Professeur, INSA Val de Loire (Rapporteur)
Patrick Gallinari, Professeur, Université Pierre et Marie Curie
Ludovic Denoyer, Professeur, Université Pierre et Marie Curie
Bernd Amann, Professeur, Université Pierre et Marie Curie
Maryline Laurent, Professeur, Télécom SudParis
Philippe Jacquet, Directeur de recherche, Alcatel-Lucent Bell Labs
Hakim Hacid, Professeur associé, Zayed University, Emirats Arabes Unis
2014-2015 Publications
-
2015
- M. Maag : “Apprentissage automatique de fonctions d’anonymisation pour les graphes et les graphes dynamiques”, thesis, phd defence 04/08/2015, supervision Gallinari, Patrick, co-supervision : Denoyer, Ludovic (2015)
-
2014
- M. Maag, L. Denoyer, P. Gallinari : “Graph Anonymization using Machine Learning”, 2014 IEEE 28th International Conference on Advanced Information Networking and Applications, Victoria, Canada, pp. 1111-1118 (2014)