Bilkent University*
COMPUTER ENGINEERING DEPARTMENT

CS351: DATA ORGANIZATION and MANAGEMENT

Fall 2010
Section 1:        Mon.  10:40, 11:40, Thur.  08:40, 09:40; EB104  
Section 2:        Mon.  15:40, 16:40, Thur.  13:40, 14:40; EB101
Section 3:        Tues.  15:40, 16:40, Fri.    13:40, 14:40; EB204

Hours not underlined are reserved for possible makeups.


INSTRUCTOR

Dr. FAZLI CAN
Office: EA505, Phone: 290-2613
Office Hours: Mon. 14:40-15:30, Thu. 10:40-11:30 and by appointment
E-mail: canf at cs dot bilkent dot edu dot tr


ASSISTANTS

HAYRETTİN ERDEM
Office:
EA126, Phone: 290-3019
Office Hours: Wed. 9:40-11:30, and by appointment
E-mail: hayrettin at cs dot bilkent dot edu

ÇAĞRI TORAMAN
Office:
EA126, Phone: 290-3019
Office Hours: Wed. 13:40-15:30, and by appointment
E-mail: ctoraman at cs dot bilkent dot edu

EMRE VAROL
Office:
EA531, Phone: 290-1451
Office Hours: Thu. 13:40-15:30, and by appointment
E-mail: evarol at cs dot bilkent dot edu


COURSE OBJECTIVES

Structure, organization and processing of files. Physical characteristics of storage media. Sequential file creation, access, and update. Sort/merge algorithms.
Direct file processing techniques. Dynamic hashing techniques: Extendible, Linear and Dynamic hashing. Indexed and Relative files: creation, processing and update.
B trees and  B+-trees as index structures and their maintenance.  File inversion, secondary key retrieval techniques. Introduction to database management systems. Three level system architecture. Entity-Relationship Modeling. Relational Model of Data.  Relational Algebra and Relational Calculus.

Students will practice and apply data organization and processing techniques through programming projects.

Prerequisite: CS 202-Fundamental Structures of Computer Science II.


COURSE SCHEDULE

The following program is subject to change throughout the semester. (Syllabus)

Week No.: Day/Month Topic
1: 16/9

Course overview.

2: 20/9

Secondary storage media and their physical characteristics.  Sequential files.

3: 27/9

Sequential file organization: creation, access and update. Performance of sequential files operations.

4: 4/10

External sorting/merging algorithms.

5: 11/10

External sorting/merging algorithms. Static hashing, address calculation and collision handling.

6: 18/10

Static hashing, address calculation and collision handling. Dynamic hashing techniques.

7: 25/10

Dynamic hashing techniques.

8: 1/11

Dynamic hashing techniques.  Indexed sequential access method (ISAM).  Midterm EXAM: Nov. 6 Saturday (tentative).

9: 8/11

B-trees and B+ trees: structure, organization, use and performance.

10: 15/11

Kurban Bayramı.

11: 22/11

B+ trees.  Secondary key retrievals.

12: 29/11

Introduction to database management systems. Architecture, components and facilities of a DBMS.

13: 6/12

Data model principles.

14: 13/12

Entity relationship data model.

15: 20/12

Relational model of data. Relational algebra and calculus.

16: 27/12

Relational algebra and calculus.

PREREQUISITE

CS 202 - Fundamental Structures of Computer Science II.


TEXTBOOK AND REFERENCES

Ramakrishnan, R., Gehrke, J. Database Management Systems, 3 rd ed. McGrawHill, Boston, MA, 2003. (Current Textbook).
Tharp, A. L. File Organization and Processing, John Willey & Sons, New York, NY, 1988. (QA76.9.F5 T48 )
Salzberg, B. File Structures: An Analytical Approach, Prentice Hall, Endlewood Cliffs, NJ, 1988. (QA76.9.F5 S25)
Korth, H. F. Sllberschatz, A. Database System Concepts, 3rd ed. McGraw Hill, 1997.


ASSIGNMENTS & OTHER COURSE MATERIAL

Fall 2010
Fall 2010: Homeworks (Please be critical of the published solutions.)
HW4: ER Model and Relational Algebra (Due Date: January 5, 2011) solutions
HW3: ISAM, B and B+ trees, Searching Sorted Sequential Files (Due Date: December 15, 2010)
solutions
HW2: (minor revision on Oct. 24) External sorting/merging (Due Date: Oct. 27, 2010) solutions
HW1: (minor revision on Oct. 11) Disk storage characteristics and Unsorted Sequential Files (Due Date: Oct. 18, 2010) solutions, solution to Q6.

Programming Projects (How to submit)
Project3, FAQ: DNS server simulation using linear hashing file structure (Due Date: January 3 9, 2011; 11:59 pm) The two highest grades of three projects will be used to determine your total project grade.
Project2 (Dec. 16 version) , FAQ: Linear hashing file structure (Due Date: December17 19, 2010; 11:59 pm) (Resubmission Due Date: January 8, 2011; 11:59 pm)
Project1 (updated) , FAQ, Test Case: Intersection of two sequential files (Due Date: November 10 12, 2010; 11:59 pm)

Fall 2010:
Quizzes
Quiz 6 : Relational Algebra (Dec. 27 & 28, 2010)
Quiz 4
: B+ trees (Dec. 2 & 3, 2010)
Quiz 3 : Dynamic hashing (linear hashing, extendible hashing) (Oct. 25 - Nov. 1, 2010)
Quiz 2 : External sorting (heap sort, replacement selection sort, p-way merge) (Oct. 18 - Oct. 19, 2010)
Quiz 1
: Sequentail file processing (bucket vs. block concept), Disk concets (cylinder track, etc.) (Sep. 28 - Oct. 1, 2010)

Midterm (Nov. 6, 2010) questions/solutions

Fall 2010
Final (Jan. 7, 2011) Question 8 solution
Final exam study guide
Midterm study guide

Fall 2009
Final exam study guide

Fall 2009: Homeworks (Solutions are by your classmates, please be critical of their solutions, typos and mistakes -if any- may have been left for your correction.)
HW5: ER (E/R) Model, Relational Model, Relational Algebra (Due Date: Dec. 29, 2009), solutions
HW4: B-trees and B+ trees (Due Date: Dec. 4, 2009), solutions
HW3
: Sorting, merging, static hashing, linear hashing, extendible hashing (Due Date: Nov. 5, 2009)
solutions, alternative solutions
HW2: Sequential file processing, sorting (Due Date: Oct. 16, 2009)solutions, alternative solutions
HW1
: Disk tape storage characteristics (Due Date: Oct. 6, 2009)
solutions

Programming Projects (How to submit)
Project3 (pdf, docx, Records.txt): Replacement selection sort (Due Date: Dec. 24, 2009; 23:59)
Project2(pdf, docx): Static hashing file transaction processing (Due Date: Nov. 16, 2009; 23:59, Nov. 23, 2009; 23:59, Dec. 2, 2009; 23:59), Deletion example
Project2 Files (all txt): Transactions, Students, Overflow, HashFile
Project1
: Static hashing file creation, FAQ
(Frequently Asked Questions) , sample input file (Due Date: Oct. 26, 2009; 23:59) <== new due date

Fall 2009:
Quizzes
Quiz 1: Sec.1-3 : Sequential file processing: Tf, Ty, Tcopy etc.(Sep. 28-Oct. 2, 2009)

Midterm (Nov. 7, 2009) questions/solutions


Course Material From Previous Years   


EXAM DATES

Midterm Exam : November 6, 2010 (between 13:00 - 15:00 12:00-14:00) <== Please note the time change.Student Exam Room Assignment: from last name - to last name
EB 101: Acun - Bayraktar
EB 102: Baysal - Dışpınar
EB 103: Dikmen - Karsavuran
EB 104: Karslı - Özcan
EB 201: Özçelik - Takmaz
EB 202: Tanrıseven - Yükseker

Final exam (comprehensive): January 7, 2011
Assignment: from last name - to last name (Proctor)
EB 101: Acun - Bayraktar (Emre Varol)
EB 102: Baysal - Dışpınar (Çağrı Toraman)
EB 103: Dikmen - Karsavuran (Hayrettin Erdem)
EB 104: Karslı - Özcan (Serkan Sarıtaş)
EB 201: Özçelik - Takmaz (Gülden Olgun)
EB 202: Tanrıseven – Yükseker (Hande Adıgüzel)


GRADING POLICY

Midterm exam  25%
Final exam (comprehensive) 35%
Homeworks & Projects 1525%
Quizzes  15%
---------------------------------
Total  100%

GENERAL POLICIES

  1. Your work (homeworks and project) should be turned in on due date, no late work will be accepted. 
  2. If individual review is needed due to a question on the grade this should be no later than one week after receiving your work or exam.  This time limit is for consistency in grading.
  3. Plagiarism is defined as the action of using or copying someone else's idea or work and pretending that you thought of it, or created it. Bilkent University requires that you be aware of the concept and dangers of plagiarism. In order to conform to international academic standards, you must respect the individual thoughts, ideas, and expressions of other authors in sources.

    In the homeworks and projects in this course, occurrences of plagiarism will be seriously dealt with, leading to a zero grade for the work concerned and upon repetition to a failure in the course, even to punishment through disciplinary procedures which call for a term or two terms of dispelling from the university. (Ogrenci Disiplin Ilke ve Kurallari, Madde 8).

    You may discuss and exchange ideas related to homework problems and the various aspects of the term project among yourselves, you may consult to relevant books and other forms of written material, but the final work must be your own, with references to the sources utilized.

OFFICE HOUR POLICIES

In case I am not able to be in my office during any office hour period, I will announce alternate hours to make up for them. Please do not assume that I am available to answer your questions any time you may barge into my office. During office hours, you are welcome to stop by and discuss your questions with me. If you cannot make it during my hours, you are further welcome to contact me and make an appointment for a more suitable time.


ANNOUNCEMENTS

  • HW3 and HW4 solutions have been published. (Announcement Time: January 6, 2011; 5:30 am.)
  • Final exam study guide has been published. Project 2: resubmission deadline: January 8, Saturday, 23:59; Project 3: new submission deadline: January 9, Sunday, 23:59. (Announcement Time: January 3, 2011; 1:56 am.)
  • HW4 has been published. A test case for Project 2 has been published. My decision about project 3 stands: the two highest grades of three projects will be used to determine your total project grade (no change in that policy).
  • Project 3 has been published. (Announcement Time: December 22, 2011; 5:10 pm.) The two highest grades of three projects will be used to determine your total project grade.
  • A new version of project 2 has been published. The new due date is Dec. 19, 2010: 23:59 pm. (Announcement Time: December 17, 2010; 11:47 pm.)
  • Solutions to quiz no. 4 have been published. (Announcement Time: December 8, 2010; 9:28 pm.)
  • Project 2 FAQ page has been published. (Announcement Time: December 8, 2010; 2:14 am.)
  • HW3 has been published. (Announcement Time: December 7, 2010; 9:31 am.)
  • Project 2 has been published. (Announcement Time: December 3, 2010; 4:38 pm.)
  • Midterm solution has been published. (Announcement Time: November 16, 2010; 9:07 am.)
  • Project 1 description has been updated (Dated November 4, one sentence is added to part B to clarify a possible ambiguity, and it is specified that a large sample will also be used.) (Announcement Time: November 7, 2010; 2:50 pm.)
  • ProjectProject 1 due date has been changed as November 12, 2010; Friday 11:59 pm.
  • Midterm study quide and solutions to quiz no. 3 have been published. (Announcement Time: November 3, 2010; 4:19 am.)
  • HW2 solution is published. (Announcement Time: October 30, 2010; 4:02 pm.)
  • Midterm time change announcement, HW1-Q6 updated solution is published, FAQ for Project 1 is announced. (Announcement Time: October 30, 2010; 8:48 am.)
  • HW1 has been published. (Announcement Time: October 7, 2010; 16:10 pm.)
  • The class pictures have been published. (Announcement Time: October 3, 2010; 12:45 pm.)
  • Welcome to the course. (Announcement Time: September 19, 2010; 11:25 pm.)
  • September 19, 2010: boring class video

Date/time of last update: January 17, 2011; 7:35 pm

Send comments to the author: canf@cs.bilkent.edu.tr

* The announcements section may change every day throughout the semester. There can be some errors on this page and I keep the right of making corrections without any notice.