Semester: Fall 2012. Time and Location: Tuesday 5:00-6:50pm, Room 1221, 715 Broadway. Instructor: Rob Fergus Office hours: Tuesday 6:50-8:50pm, Room 1226, 12th floor, 715 Broadway. |
|
Computer Vision aims to extract descriptions of the world from pictures or video. In recent years, much progress has been made on this challenging problem. The course will start by looking the established area of a geometric vision. It will then move onto mid-level problems such as tracking and segmentation. The final part of the course will focus on recognition, particularly on the problem of detecting object classes (e.g. bottles, shoes, cars) in images, currently a topic much reserach interest.
The course will be suitable for masters and PhD students. A reasonable knowledge of linear algebra will be required, along with some basic concepts in machine learning. The homeworks will require Matlab, so familiarity with it is desirable, although not essential.
Assessment will be through four graded homework assignments.
Date | Time | Topics | Relevant Book Chapters |
---|---|---|---|
|
|||
|
|
|
Szeliski, Ch. 1 and 2; F & P, Ch. 1 |
|
|||
|
|
|
Szeliski, Ch. 3 and 4; F & P, Ch. 6, 7 and 8 |
|
|||
|
|
|
Szeliski, Ch. 3 and 4; F and P, ch. 3 and 16; Lowe 2004 |
|
|
|
|
|
|||
|
|
|
Szeliski, Ch. 6; F & P sec. 3.1, ch. 15; Winder and Brown 2007 |
|
|||
|
|
|
Szeliski, Ch. 7; H and Z, ch. 9-12; F and P, ch. 10 and 11 |
|
|||
|
|
|
Szeliski, Ch. 7; F and P, ch. 12, 13; H and Z, ch. 18. |
|
|
|
|
|
|
|
|
|
|||
|
|||
|
|||
|
|
|
|
|
|||
|
|
|
Szeliski, Ch. 14. |
|
|||
|
|
|
Szeliski, Ch. 14. |
|
|
|
|
|
|
|
|
|
|||
|
|
|
|
|
|||
|
|
|
|
|
|||
|
|
|
Szeliski, Ch. 5 |
|
|||
|
|
|
Szeliski, Ch. 8 |
|
|
|
|
|
|
|
(Shi
and Malik, PAMI 2000)
|
|
|||
|
|
|
|
|
|||
|
|
|
|
The main text book that we will use is:
Szeliski, Richard, Computer Vision: Algorithms and Applications Springer, 2011. This book is available in electronic form at: Link
There are also a couple of other text books relevant to the course, although we won't be directly using them:
Forsyth, David A., and Ponce, J. Computer Vision: A Modern Approach, Prentice Hall, 2003.
Hartley, R. and Zisserman, A. Multiple View Geometry in Computer Vision, Academic Press, 2002.
Both these are available from the CIMS library.
For the object recognition part of the course, please see the Object Reconition Short Course. Link
Matlab tutorial by Hany Farid and Eero Simoncelli Link
A more comprehensive Matlab tutorial by David Griffiths Link
Further documentation on Matlab can be found here Link
Palmer, Stephen E. Vision Science: Photos to Phenomenology, MIT Press, 1999.
Strang, Gilbert. Linear Algebra and Its Applications 2/e, Academic Press, 1980.
Wandell, Brian A. Foundations of Vision, Sinauer, 1995.