RĂ©mi Monasson (ENS Paris)
Artificial neural networks, introduced decades ago, are now key tools for automatic learning from data. This series of six lectures will focus on a few neural network architectures used in the context of unsupervised learning, that is, of unlabeled data.
In particular we will focus on dimensional reduction, feature extraction, and representation building. We will see how statistical physics, in particular the techniques and concepts of random matrix theory and disordered systems, can be used to understand the properties of these algorithms and the phase transitions taking place in their operation.
Special attention will be devoted to the so-called high-dimensional inference setting, where the numbers of data samples and of defining parameters of the neural nets are comparable. The general principles will be illustrated on recent applications to data coming from neuroscience and genomics, highlighting the potentialities of unsupervised learning for biology.
Some issues:
-
What is unsupervised learning?
-
Hebbian learning for principal component analysis: retarded-learning phase transition and prior information.
-
Bipartite neural nets and representations: auto-encoders, restricted Boltzmann machines, Boltzmann machines.
-
Recurrent neural nets: from point to finite-dimensional attractors, temporal sequences.