Semester: Spring 2014. Time and Location: Tuesday and Thursday 11:0012:15pm, Room 1221, 719 Broadway. Instructor: Rob Fergus Office hours: Thursday 12:151:15pm, Room 1226, 12th floor, 719 Broadway.


Computer Vision aims to extract descriptions of the world from pictures or video. In recent years, much progress has been made on this challenging problem. The course will start by looking the established area of a geometric vision. It will then move onto midlevel problems such as tracking and segmentation. The final part of the course will focus on recognition, particularly on the problem of detecting object classes (e.g. bottles, shoes, cars) in images, currently a topic much reserach interest.
The course will be suitable for advanced undergraduates. A reasonable knowledge of linear algebra will be required, along with some basic concepts in machine learning. The homeworks will require Matlab, so familiarity with it is desirable, although not essential.
Assessment will be through four graded homework assignments.
The policy regarding late homework is as follows: (a) assignments that are late by less than 24hrs will suffer a 10% reduction; (b) those between 24 and 72 hrs late will suffer a 25%x reduction and (c) those more 72hrs late will suffer a 50% reduction. You are strongly encouraged to start the assignment early and don't be afraid to ask for help from either the TA or myself.
Date  Time  Topics  Relevant Book Chapters 






Szeliski, Ch. 1 and 2; F & P, Ch. 1 









Szeliski, Ch. 3 and 4; F & P, Ch. 6, 7 and 8 









Szeliski, Ch. 3 and 4; F and P, ch. 3 and 16; Lowe 2004 













Szeliski, Ch. 6; F & P sec. 3.1, ch. 15; Winder and Brown 2007 









Szeliski, Ch. 7; H and Z, ch. 912; F and P, ch. 10 and 11 



Szeliski, Ch. 7; H and Z, ch. 912; F and P, ch. 10 and 11 



























Szeliski, Ch. 14. 













Szeliski, Ch. 14. 

































Szeliski, Ch. 5 

















(Shi
and Malik, PAMI 2000)































The main text book that we will use is:
Szeliski, Richard, Computer Vision: Algorithms and Applications Springer, 2011. This book is available in electronic form at: Link
There are also a couple of other text books relevant to the course, although we won't be directly using them:
Forsyth, David A., and Ponce, J. Computer Vision: A Modern Approach, Prentice Hall, 2003.
Hartley, R. and Zisserman, A. Multiple View Geometry in Computer Vision, Academic Press, 2002.
Both these are available from the CIMS library.
For the object recognition part of the course, please see the Object Reconition Short Course. Link
Matlab tutorial by Hany Farid and Eero Simoncelli Link
A more comprehensive Matlab tutorial by David Griffiths Link
Further documentation on Matlab can be found here Link
Palmer, Stephen E. Vision Science: Photos to Phenomenology, MIT Press, 1999.
Strang, Gilbert. Linear Algebra and Its Applications 2/e, Academic Press, 1980.
Wandell, Brian A. Foundations of Vision, Sinauer, 1995.