PUGET Raphaël
Supervision : Patrick GALLINARI
Co-supervision : BASKIOTIS Nicolas
Etude de la classification dans un très grand nombre de catégories
The increase in volume of the data nowadays is at the origin of new problematics for which machine learning does not possess adapted answers.
The usual classification task which requires to assign one or more classes to an example is extended to problems with thousands or even millions of different classes.
Those problems bring new research fields like the complexity reduction of the classification process.
That classification process has a complexity usually linear with the number of classes of the problem, which can be an issue if the number of classes is too large.
Various ways to deal with those new problems have emerged like the construction of a hierarchy of classifiers or the adaptation of ECOC ensemble methods.
The work presented here describes two new methods to answer this extreme classification task.
The first one consists in a new asymmetrical measure to help the partitioning and the hierarchisation of the classes in order to build a classes tree.
The second one proposes a sequential way to aggregate effectively the most interesting classifiers.
Defence : 07/04/2016
Jury members :
Massih-Reza Amini, Laboratoire d'Informatique de Grenoble [Rapporteur]
Marc Tommasi, INRIA Lille [Rapporteur]
Nicolas Baskiotis, Université Pierre et Marie Curie
Patrick Gallinari, Université Pierre et Marie Curie
Marie-Jeanne Lesot, Université Pierre et Marie Curie
Jérémie Mary, INRIA Lille
2014-2020 Publications
-
2020
- P. Cribier‑Delande, R. Puget, V. Guigue, L. Denoyer : “Time Series Prediction using Disentangled Latent Factors”, ESANN 2020 - 28th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, Bruges, Belgium (2020)
-
2016
- R. Puget : “Etude de la classification dans un très grand nombre de catégories”, thesis, phd defence 07/04/2016, supervision Gallinari, Patrick, co-supervision : Baskiotis, Nicolas (2016)
-
2015
- R. Puget, N. Baskiotis : “Hierarchical Label Partitioning for Large Scale Classification”, IEEE International Conference on Data Science and Advanced Analytics, DSAA'2015, Paris, France (2015)
- R. Puget, N. Baskiotis, P. Gallinari : “Sequential Dynamic Classification for Large Scale Multiclass Problems”, Extreme Classification Workshop at ICML, Lille, France (2015)
-
2014
- R. Puget, N. Baskiotis, P. Gallinari : “Scalable Learnability Measure for Hierarchical Learning in Large Scale Multi-Class Classification”, WSDM Workshop Web-Scale Classification: Classifying Big Data from the Web, New York, United States (2014)