Bilkent University*
COMPUTER ENGINEERING DEPARTMENT
CS533:
Information Retrieval Systems (under construction)
Spring 2011
Monday 15:40-17:30, Thursday 13:40-14:30, EA502
INSTRUCTOR : Dr. FAZLI CAN
Office : EA505 (Muhendislik Fakultesi Binasi), e-mail: canf@cs.bilkent.edu.tr
Office Hours (Spring 2011): Monday 14:40-15:30, Thursday 14:40-15:30, or by appointment.
COURSE OBJECTIVES
The main objective of this course is to learn the important concepts, algorithms,
and data/file structures that are necessary to specify, design, and implement Information
Retrieval (IR) systems.
TENTATIVE COURSE SCHEDULE
IR Systems Overview; System Evaluation, Clustering and Cluster Validation; Automatic Indexing and Term
Weighting; Fundamental File Structures: Inverted File, Signature Files, Query Processing,
n-gram-based Files, PAT trees; New Event Detection and Tracking; Information Filtering;
Compression.
Assignments of Spring 2010 Assignment No. 6 : Stemming, PAT trees, TDV, Siganture Files, Information Filtering, Due date: May 21 ( Possible solutions sol 1 , sol 2 and sol 3 by Sefa Kilic, Fatih Cakir, and Abdullah Bulbul; please be critical of the solutions) Assignment No. 5 : Topic-based novelty detection annotation, Due date: March 19 Assignment No. 4 : paper review (not published yet) Assignment No. 3 : 5-minute presentation the list of Recommeneded reading for IR research students, Due date: April 14 (please see the 4th page) Paper assignments, Presentations Assignment No. 2 : IR evaluation, similarity calculation, inverted indexes, clustering, cluster maintenance, IR test collection creation, Due date: March 22, Monday, noon time ( Possible solutions sol 1 and sol 2 by Emre Varol and Sefa Kilic; please be critical of the solutions) Assignment No. 1: Information Retrieval on a Timeline (essay) New due date: March 8, Monday, noon time.
Assignments of Spring 2009 (General comment for all solutions/keys to hw assignments for all years: please be critical) Assignment No. 5 : Summarization annotation Assignment No. 4 : More annotation for topic-based novelty detection Assignment No. 3 : C3M, Cluster validation, Yao's formula
Assignment No. 2 : Annotation for topic-based novelty detection
Assignment No. 1: Evaluation (TREC 6 Appendix A ), Inverted Files, Intro. to Clustering, Web Classification, New due date: March 17, 2009, 11:59 am (possible solutions, by Kaan Onarlioglu)
Single-link clustering is order independent - possible proofs: by Osman Berat Okutan, by Ramazan Yilmaz.
Assignments of Spring 2008 Assignment No. 5 : TDV, Stemming, PAT Tress, Signature Files, Information Filtering, Compression Assignment No. 4.: Again Summarization Annotation Assignment No. 3.: Summarization Annotation Assignment No.2 : Clustering (C3M, Graph theoretical, Yao's formula) Assignment No. 1: Evaluation, Inverted Files, Intro. to Clustering, Cranfield Methodology (key/solutions, by Hidayet Aksu)
Measuring translation consistency using clustering methods, by Damla Arifoglu, Sermetcan Baysal, Mehmet Can Kurt
News portal main page news selection, by Duygu Atilgan, Bahadir Ozdemir, Merve Saglam
News portal new event selection, by Sitar Kortik, Murat Kurtcephe
Prime number-based signature generation for information retrieval, by Cem Mengenci, Kaan Onarlioglu, Reha Oguz Selvitopi, Volkan Yazici
Story link detection, by Faruk Belet, Ilker Murat Karakas
Voices of the Orhan Pamuk Characters, by Mucahid Kutlu, Bahri Turel, Ramazan Yilmaz
Last day of Adding/Dropping Courses : February 7, Monday
Spring Break : April 11-15
Last Day to Withdraw from Courses : May 13, Friday
Last day of classes : May 13, Friday
Final exams : May 16-27
Thursday Class Schedule
Feb. 10: 2 hours *****
=================
Feb. 3 No class
Feb. 17: No class <== There will be class to makeup March 10 bad weather class cancellation
Feb. 24: 2 hours ****
Mar. 3: No class
Mar. 10: 2 hours ****
Mar.17: No class
Mar. 24: 2 hours ****
Mar. 31: No class
Apr. 7: 2 hours ****
Apr. 14: Spring Break
Apr. 21: No class
Apr. 28: 2 hours ****
May 5: No class
May 12: 2 hours ****
EXAM DATES (2011)
Midterm Exam : March 14, Monday March 24, Thursday
Final (comprehensive) : May 23, Monday (Place: EA502, time: 15:40-17:30)
GRADING POLICY
Midterm Exam : 25%
Final exam (comprehensive) : 35%
Project & Assignments : 40% (Term Project Timeline, For 2009)
------------------------- ------
Total 100 %
Letter grades will be determined according to the following table (if needed grades will be curved).
90 - 100 %: A
80 - 89 %: B
70 - 79 %: C
60 - 69 %: D
0 - 59 %: F
GENERAL POLICIES
You are expected to do your homework assignments alone. Group working will be considered as cheating. You may discuss your ideas and approaches, but do not walk the line. Group projects will be specified explicitly.
Your programs will be graded according to their correctness, algorithm design, readability, and neatness of presentation.
Your assignments must be turned in on the due dates. No late homework assignment will be accepted. No make-up/extension can be given for excuses with no proof and no prior notification.
Homework problems may be graded selectively (like 1 or 2 problems out of 5, however you have to solve all of them). The weights of individual assignments may vary.
If you need to supply written documentation with your assignments provide a neat presentation using a word-processor. This is a rule and exceptions will be specified explicitly.
If individual review is needed due to a question on the grade (including exams) this must be no later than one week after receiving your assignment or exam. This time limit is for consistency in grading.
Attendance is mandatory. If you miss a class it is your responsibility to catch up in terms of course material and announcements made in the class. For each missed class 2% of your grade may be deducted. You may miss two classes without a penalty.
ANNOUNCEMENTS
Final exam study guide an project update announcements have been published -single document- (May 21, 9:37 am)
Assignment No. 4 has been published (April 19, 1:32 pm)
Term project timeline has been published (March 21, 3:30 pm)
Due to bad weather class cancellation we have the following changes in our schedule -in one listed case there is no change it is included for confirmation- (March 10, 12:26 pm)
As announced by the rector's office no class today (March 10)
March 14: No midterm exam since due to class cancellation we have not covered some essential material that I planned to include in the midterm
and there is no need to rush
March 17: Makeup class for the cancelled March 10 class (it was our class skip day)
March 21: New due date for Assignment No. 3 (by class time)
March 21: Five-minute presentations (no change in schedule for this)
March 24: New midterm date
Assignment No. 3 has been published (March 8, 1:50 am)
Five minute presnetation groups preliminary information has been posted. (March 6, 8:46 pm)
Class pictures have been pulished (March 1, 2:42 am).
Assignment No. 1 and 2 have been published (Feb. 19, 3:54 pm).
Planned Thursday class schedule has been announced (Feb. 14, 11:52 am).
No class on February 3 and we will makeup it on Feb. 10 by a two hour class.
Date of last update: May 21, 2011, 9:38 am.
Send comments to the author: .
* The announcements section may change every day
throughout the semester. Due to honest mistakes there can be some errors on this page and
I keep the right of making corrections on it without a notice.