Optimal transport and applications to Data Science


Optimal transport (OT) provides rich representations of the discrepancy between probability measures supported on geometric spaces. Recently, thanks to the development of computational techniques, OT has been used to address problems involving massive datasets, as an alternative to usual KL-divergence based approaches. In this talk I will introduce the OT problem and comment on its elementary duality properties. Then, I will present the entropy regularized problem and its (fast) solution via Sinkhorn iterations. Finally, I will overview two applications to Data Science: first, dimensionality reduction via Wasserstein Barycenters and Wasserstein PCA. Second, parameter inference in generative models defined through complex nonlinear transformation of a noise distribution.

Date: Jan 17, 2018 at 16:30 h
Venue: Beauchef 851, Torre Norte, Piso 7, Sala de Seminaros John Von Neumann CMM.
Speaker: Gonzalo E. Mena
Affiliation: PhD student, Columbia University
Coordinator: Prof. Felipe Tobar

Posted on Jan 8, 2018 in Seminario Aprendizaje de Máquinas, Seminars