LIP6 1997/033: THÈSE de DOCTORAT de l'UNIVERSITÉ PARIS 6
LIP6 /
LIP6 research
reports
137 pages - Juillet/July 1997 -
Document en anglais.
PostScript : 473 Ko /Kb
Contact : par mail / e-mail
Thème/Team: Apprentissage et Acquisition de Connaissances
Titre français : Apprentissage statistique et régularisation pour la régression
Titre anglais : Statistical learning and regularisation for regression
Abstract : This thesis deals with the use of statistical learning and regularisation on regression problems, with a focus on time series modelling and system identification. Both linear models and non-linear neural networks are considered as particular modelling techniques.
Linear and non-linear parametric regression are briefly introduced and their limit is shown using the bias-variance decomposition of the generalisation error. We then show that as such, those problems are ill-posed, and thus need to be regularised. Regularisation introduces a number of hyper-parameters, the setting of which is performed by estimating generalisation error. Several such methods are evoked in the course of this work.
The use of these theoretical aspects is targeted towards two particular problems. First an iterative method relying on generalisation error to extract the relevant delays from time series data is presented. Then a particular regularisation functional is studied, that provides pruning of unnecessary parameters as well as a regularising effect. This last part uses Bayesian estimators, and a brief presentation of those estimators is also given in the thesis.
Key-words : Statistical learning, regularisation, generalisation, neural networks, time series
Publications internes LIP6 1997 / LIP6 research reports 1997