Bilkent University*
COMPUTER ENGINEERING DEPARTMENT
CS533:
Information Retrieval Systems
Fall 2019
Tusday 10:40, 11:40; Friday 8:40 (Spare Hour), 9:40; EB502
INSTRUCTOR : Dr. FAZLI CAN
Office : EA511 (Muhendislik Fakultesi Binasi), e-mail: canf@cs.bilkent.edu.tr
Office Hours (Fall 2019): Tuesday 14:40-15:30, Friday 10:40-11:30, or by appointment.
COURSE OBJECTIVES
The main objective of this course is to learn the important concepts, algorithms,
and data/file structures that are necessary to design, and implement Information
Retrieval (IR) systems.
TENTATIVE COURSE SCHEDULE
IR Systems Overview, System Evaluation, Clustering and Cluster Validation, Automatic Indexing and Term Weighting; Fundamental File Structures: Inverted File, Signature Files, Query Processing, Data Fusion, n-gram-based Files, PAT trees, Data Stream Processing, New Event Detection and Tracking, Maximal Marginal Relevance, Information Filtering, Efficiency and Scalability Issues and other topics based on student projects.
Assignments of Spring 2012 (please be critical of solutions) Term Projects Assignment No. 6 Signature Files, Information Filtering, Huffman Coding, Compression. Solutions (thanks to Emir Gülümser) Assignment No. 5 TDV, PAT Trees, Data Fusion, Term Weighting, PageRank Assignment No. 4 The Art Doing SCIENCE and Engineering: Learning to Learn Assignment No. 3 (updated on March 15): Clustering (new due date March 26) Solutions (thanks to Caner Mercan) Term project timeline: Almost everything that you want to know about term project 5 minute presentations: Date of presentation is March 7 Assignment No. 1&2 : IR evaluation, inverted indexes, IR introduction AND 5-minute presentations
Assignments of Spring 2010 Assignment No. 6 : Stemming, PAT trees, TDV, Siganture Files, Information Filtering, Due date: May 21 ( Possible solutions sol 1 , sol 2 and sol 3 by Sefa Kilic, Fatih Cakir, and Abdullah Bulbul; please be critical of the solutions) Assignment No. 5 : Topic-based novelty detection annotation, Due date: March 19 Assignment No. 4 : paper review (not published yet) Assignment No. 3 : 5-minute presentation the list of Recommeneded reading for IR research students, Due date: April 14 (please see the 4th page) Paper assignments, Presentations Assignment No. 2 : IR evaluation, similarity calculation, inverted indexes, clustering, cluster maintenance, IR test collection creation, Due date: March 22, Monday, noon time ( Possible solutions sol 1 and sol 2 by Emre Varol and Sefa Kilic; please be critical of the solutions) Assignment No. 1: Information Retrieval on a Timeline (essay) New due date: March 8, Monday, noon time.
Assignments of Spring 2009 (General comment for all solutions/keys to hw assignments for all years: please be critical) Assignment No. 5 : Summarization annotation Assignment No. 4 : More annotation for topic-based novelty detection Assignment No. 3 : C3M, Cluster validation, Yao's formula
Assignment No. 2 : Annotation for topic-based novelty detection
Assignment No. 1: Evaluation (TREC 6 Appendix A ), Inverted Files, Intro. to Clustering, Web Classification, New due date: March 17, 2009, 11:59 am (possible solutions, by Kaan Onarlioglu)
Single-link clustering is order independent - possible proofs: by Osman Berat Okutan, by Ramazan Yilmaz.
Assignments of Spring 2008 Assignment No. 5 : TDV, Stemming, PAT Tress, Signature Files, Information Filtering, Compression Assignment No. 4.: Again Summarization Annotation Assignment No. 3.: Summarization Annotation Assignment No.2 : Clustering (C3M, Graph theoretical, Yao's formula) Assignment No. 1: Evaluation, Inverted Files, Intro. to Clustering, Cranfield Methodology (key/solutions, by Hidayet Aksu)
Measuring translation consistency using clustering methods, by Damla Arifoglu, Sermetcan Baysal, Mehmet Can Kurt
News portal main page news selection, by Duygu Atilgan, Bahadir Ozdemir, Merve Saglam
News portal new event selection, by Sitar Kortik, Murat Kurtcephe
Prime number-based signature generation for information retrieval, by Cem Mengenci, Kaan Onarlioglu, Reha Oguz Selvitopi, Volkan Yazici
Story link detection, by Faruk Belet, Ilker Murat Karakas
Voices of the Orhan Pamuk Characters, by Mucahid Kutlu, Bahri Turel, Ramazan Yilmaz
Letter grades will be determined according to the following table (if needed grades will be curved).
90 - 100 %: A
80 - 89 %: B
70 - 79 %: C
60 - 69 %: D
0 - 59 %: F
GENERAL POLICIES
You are expected to do your homework assignments alone. Group working will be considered as cheating. You may discuss your ideas and approaches, but do not walk the line. Group projects will be specified explicitly.
Your programs will be graded according to their correctness, algorithm design, readability, and neatness of presentation.
Your assignments must be turned in on the due dates. No late homework assignment will be accepted. No make-up/extension can be given for excuses with no proof and no prior notification.
Homework problems may be graded selectively (like 1 or 2 problems out of 5, however you have to solve all of them). The weights of individual assignments may vary.
If you need to supply written documentation with your assignments provide a neat presentation using a word-processor. This is a rule and exceptions will be specified explicitly.
If individual review is needed due to a question on the grade (including exams) this must be no later than one week after receiving your assignment or exam. This time limit is for consistency in grading.
Attendance is mandatory. If you miss a class it is your responsibility to catch up in terms of course material and announcements made in the class. For each missed class 2% of your grade may be deducted. You may miss two classes without a penalty.
ANNOUNCEMENTS
Due Date of Assignment No. 4
December 31, Tuesday
Due Dates of Class Activities: Assignment No. 3
December 3, Tuesday: Due date ==> New Due Date December 6, Friday
Due Dates of Class Activities: Assignment No. 2
Time of announcement: November 5, 2019, 10:25 pm
Time of announcement: October 25, 2019, 7:08 pm
November 8, Friday: Paper selection due date
November 15, Friday: Paper assignments are done by me
November 25, Monday: Presentation and handout due date
November 26, Tuesday: In class presentation
Due Dates of Class Activities: Assignment No. 1
Time of announcement: October 23, 2019, 1:51 pm.
November 1, Friday: Due date for HW1
Due Dates of Class Activities:Term Project Related Events
Time of announcement: October 23, 2019, 1:51 pm.
October 25, Friday: Declare your project partrner: See Project Timeline document.
November 5, Tuesday: Term project proposal document submission due date
November 19, Tuesday: Term project proposal presentation
December 3, Tuesday: Term project interim report due date ==> New Due Date December 10, Tuesday
December 24, Tuesday: Term Project Presentation
Welcome to the course
Time of announcement:
October 17, 2019, 5:51 pm.
Date of last update: December 30, 2019, 4:50 pm.
Send comments to the author:
* The announcements section may change every day
throughout the semester. Due to honest mistakes there can be some errors on this page and
I keep the right of making corrections on it without a notice.