Computer Vision
CS 554

Fall 2010
Department of Computer Engineering, Bilkent University

 

Lectures

Assignments

Announcements

Policies

 

Instructor: Pinar Duygulu
Office :  EA 433
e-mail : duygulu[at]cs.bilkent.edu.tr
Phone :  (312) 290 31 43
Office hours:  by appointment..
Course web page: http://www.cs.bilkent.edu.tr/~duygulu/Courses/CS554/
Textbook: Computer Vision - A modern Aproach  by David A. Forsyth & Jean Ponce, Prentice Hall, Ed. 1, 2002
Other textbooks:
Computer Vision by Dana Ballard and Chris Brown (available online)
Digital Image Processing by Rafael Gonzalez and Richard Woods
Computer Vision by Linda Shapiro and George Stockman
Related Material: http://www.cs.bilkent.edu.tr/~duygulu/CVlinks.html
Also other complementary articles that will be made available
Time & Location: Mondays 8:40-10:30, Wednesdays 10:40-12:30, EA 502
Course Description
Basic concepts in computational vision. Relation to human visual perception. The analysis and understanding of image and video data. Mathematical foundations, image formation and representation, segmentation, feature extraction, contour and region analysis, camera geometry and calibration, stereo, motion, 3-D reconstruction, object and scene recognition, object and people tracking, human activity recognition and inference.
Prerequisites:Knowledge of linear algebra and calculus, probability and statistics
Topics:
Introduction,  Color and Light, Linear Filters, Texture,  Edge detection,  Interest Points, Cameras, Multi-view Geometry, Stereopsis,  Motion,  Segmentation,  Object recognition,  Face recognition,  Image and Vieo Databases

Grading:
Projects 60%  (there will be 3 projects, group of two students)
Midterm 20%
Paper Presentations 15%

Participation 5%

 


Announcements:

 



Lectures

  



Introduction
(slides)



Basics
(slides)





  • Topics
    • Image Representation,Review of Linear Algebra,Geometrical Transformations, Introduction to Matlab,Handling Images in Matlab
  • Readings:


Image Processing
(slides)




  • Topics
    • Image Formation, Point Processing, Blob Processing, Binary image analysis,Thresholding,Connected component analysis,Mathematical morphology,Region propoerties
  • Readings:
  • Links



Linear Filters
(slides1, slides2)




  • Topics
    • Linear filters, convolution, smoothing, derivatives, Fourier transform, sampling and aliazing, gaussian pyramids
  • Readings


Edge Detection
(slides)




  • Topics
    • Derivatives, Edge detection, Hough Transform 
  • Readings
    • Chapter 8 from Forsyth&Ponce
    • A Computational Approach to Edge Detection, J. Canny, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol 8, No. 6, Nov 1986.
    • Chapter 4 from Olivier Faugeras' book: Three-Dimensional Computer Vision, MIT Press, 1993
  • Links

 

Texture
(slides)

  • Topics
    • Texture analysis and synthesis 


Interest Points
(slides)



  • Topics
    • Harris Detector, Local invariant points, SIFT descriptors


Radiometry
(slides)



  • Topics
    • Radiometry, measuring light
  • Readings:
    • Chapter 4 from Forsyth&Ponce

 

Color
(slides)



 

Cameras
(slides)



Camera Calibration
(slides)






Multi view Geometry
(slides)



  • Topics
    • Epipolar geometry




Stereopsis
(slides)


  • Topics
    • Stereopsis, Matching, Reconstruction 
  • Readings
    • Chapter 11 from Forsyth&Ponce







Motion
(slides)


  • Topics
    • Optical flow, structure from motion, Tracking
  • Readings


Mosaics
(slides)








Segmentation
(slides)




  • Topics
    • Segmentation, Grouping, Fitting





Object Recognition



  • Links

o    Recognizing and Learning Object Categories, by Li Fei-Fei, Rob Fergus, Antonio Torralba




Tracking and understanding human activities

Detection and Recognition of faces

 

Student Presentations


 


Assignments:



Policies

Important notes about evaluation:

    Assignments:

        There will be three programming assignments      

        You will work in groups of two or three
        Late homeworks are not accepted     
        All programming assignments are due midnight and will be sent by e-mail
            In your e-mail use the following format in the title
            CS554 - Programming assignment #
            Your programming assignmenments should be sent as a tar ball in the following format
                <name_surname_PA_#>.tar
       
           Report guidelines:
            Follow IEEE two-column format as shown in the example and the format definition table and glossary.
            The page limit is 6 pages.
            The report should not have any page numbers, headers or footers.
            You can use IEEE's LaTeX template or Word template. (LaTeX users: Be sure to use the template's conference mode.)
            PDF submission is recommended.
       

    Presentations:
    Your presentations will be evaluated according to the following criteria. Please, consider them in preparing your presentations:
        Understanding of the topic - how confident are you with the paper that you present
        Review of the related work - not just mentioning but by reading some of them to understand and relate to your paper
        Giving an overview of the paper -  the main contributions of the paper, and an overview of the approach
        Explaining the details - understanding and explaining the formulas and methods given in the paper
        Presentation - in general how well you are prepared to give the talk
        Use of visual material when available