Announcements

  1. (Feb 7) Course page is online.
  2. (Feb 7) Syllabus is available.
  3. (Feb 7) Slides for Introduction to Pattern Recognition are available.
  4. (Feb 13) Slides for Bayesian Decision Theory are available.
  5. (Feb 20) First part of the slides for Parametric Models is available.
  6. (Feb 20) Second part of the slides for Parametric Models is available.
  7. (Feb 20) Third part of the slides for Parametric Models is available.
  8. (Feb 28) Homework assignment 1 is available.
  9. (Mar 13) Slides for Non-parametric Methods are available.
  10. (Mar 20) First part of the slides for Probabilistic Graphical Models is available.
  11. (Mar 21) Homework assignment 2 is available.
  12. (Mar 27) Second part of the slides for Probabilistic Graphical Models is available.
  13. (Mar 30) Third part of the slides for Probabilistic Graphical Models is available.
  14. (Apr 2) Slides for Feature Reduction and Selection are available.
  15. (Apr 13) Homework assignment 3 is available.
  16. (Apr 13) First part of the slides for Non-Bayesian Classifiers is available.
  17. (Apr 13) Second part of the slides for Non-Bayesian Classifiers is available.
  18. (Apr 19) Slides for Structural and Syntactic Pattern Recognition are available.

Personnel

Instructor: Selim Aksoy
Office: EA 422
Email:

Course Information

Schedule: Tue 15:40-17:30, Fri 13:40-15:30 (EB 104)
Office hours: Fri 15:40-17:30 (EA 422)
Prerequisites: Probability theory, statistics, linear algebra

Texts

Lecture Schedule

Chapters

Contents

Introduction to Pattern Recognition

[ Slides ]

(Feb 7, 10)

Topics:
  • Pattern recognition systems
  • The design cycle
  • An example
References:

Bayesian Decision Theory

[ Slides ]

(Feb 14, 17, 21)

Topics:
  • Modeling using continuous and discrete features
  • Discriminant functions
  • The Gaussian density
  • Error estimation
References:

Parametric Models

[ Slides: Part 1 | Part 2 | Part 3 ]

(Feb 24, 28, Mar 2, 6, 9, 13, 16)

Topics:
  • Maximum-likelihood estimation
  • Bayesian estimation
  • Expectation-Maximization and mixture density estimation
  • Hidden Markov Models
References:

Non-parametric Methods

[ Slides ]

(Mar 20, 23)

Topics:
  • Density estimation
  • Histogram-based estimation
  • Parzen windows estimation
  • Nearest neighbor estimation

Probabilistic Graphical Models

[ Slides: Part 1 | Part 2 | Part 3 ]

(Mar 27, 30, Apr 3, 6)

Topics:
  • Directed graphical models - Bayesian networks
  • Undirected graphical models
  • Inference using graphical models
  • Learning graphical models
References:

Feature Reduction and Selection

[ Slides ]

(Apr 10, 13, 17)

Topics:
  • Problems of dimensionality
  • Component analysis
    • Principal components analysis (PCA)
    • Linear discriminant analysis (LDA)
  • Manifold learning
  • Feature selection
References:

Non-Bayesian Classifiers

[ Slides: Part 1 | Part 2 ]

(Apr 20)

Topics:
  • k-nearest neighbor classifier
  • Linear discriminant functions
  • Support vector machines
References:

Structural and Syntactic Pattern Recognition

[ Slides ]

(May 8, 11, 15)

Topics:
  • Graph-theoretic methods
  • Recognition with strings
  • Grammatical methods
Readings:
  • DHS Ch 8.5-8.6
References:

Exam

Assignments

  1. Homework assignment 1 (Due: March 19, 2012 as online submission)
  2. Homework assignment 2 (Due: April 10, 2012 as online submission)
  3. Homework assignment 3 (Due: May 6, 2012 as online submission)

Late submission policy: Unless you make prior arrangements with me (before the due date), no late homework will be accepted.

Project

The purpose of the project is to enable the students to get some hands-on experience in the design, implementation and evaluation of pattern recognition algorithms by applying them to real-world problems. The objective is to try multiple algorithms for different steps of the design cycle such as feature extraction and selection, model learning and estimation, classification and evaluation, to get an acceptable accuracy on the selected data sets. An acceptable accuracy will be problem dependent but needs to be measured using formal quantitative methods.

You can use your own data from your thesis research, select data sets from the list of data resources below, or contact the instructor. In any case, you should get prior approval before starting your project.

You are free to use any programming language but Matlab is strongly recommended because it is very convenient for prototyping and has many tools available for pattern recognition. You can write the codes yourself or use any code that is available in the public domain. In case you use somebody else's code, you are required to properly cite its source and know the details of the algorithms that the code implements.

You are required to work in groups of two, and submit a project proposal, an interim progress report, a final report written in a conference paper format, and make a presentation during the finals week. Tentative schedule of the project is as follows:

All reports and software can be submitted using the online form.

Final report guidelines:

Grading Policy

Midterm exam:25%
Homework:40%
Project:30%
Class participation:5%

Related Links