AlphaGo and the Computational Challenges of Machine Learning
Speaker: Chris Maddison, University of Oxford
Location: 60 Fifth Avenue 150
Date: April 5, 2019, 11 a.m.
Host: Subhash Khot
Many computational challenges in machine learning involve the three problems of optimization, integration, and fixed-point computation. These three can often be reduced to each other, so they may also provide distinct vantages on a single problem. In this talk, I present a small part of this picture through a discussion of my work on AlphaGo and two vignettes on my work on the interplay between optimization and Monte Carlo. AlphaGo is the first computer program to defeat a world-champion player, Lee Sedol, in the board game of Go. My work laid the groundwork of the neural net components of AlphaGo, and culminated in our Nature publication describing AlphaGo's algorithm, at whose core hide these three problems. In the first vignette, I present the Hamiltonian descent methods we introduced for first-order optimization. These methods are inspired by the Monte Carlo literature and can achieve fast linear convergence without strong convexity by using a non-standard kinetic energy to condition the optimization. In the second vignette I cover our A* Sampling method, which reduces the problem of Monte Carlo simulation to an optimization problem, and an application to gradient estimation in stochastic computation graphs.
Chris Maddison is a PhD candidate in the Statistical Machine Learning Group in the Department of Statistics at the University of Oxford. He is an Open Philanthropy AI Fellow and spends two days a week as a Research Scientist at DeepMind. His research is broadly focused on the development of numerical methods for deep learning and machine learning. He has worked on methods for variational inference, numerical optimization, and Monte Carlo estimation with a specific focus on those that might work at scale with few assumptions. Chris received his MSc. from the University of Toronto. He received a NeurIPS Best Paper Award in 2014, and was one of the founding members of the AlphaGo project.
Refreshments will be offered starting 15 minutes prior to the scheduled start of the talk.