Informaiton Retrieval and Applications, Spring 2010

This course offers an introduction to the principles and concepts in information retrieval, which is fundamental to modern Web search engines. In addition to Web search, other applications of information retrieval systems will also be described.

Course Information

Latest News

(Tentative) Schedule

The slides were slightly modified from the Stanford CS276 class.
Note: IIR - Introduction to Information Retrieval, MIR - Modern Information Retrieval, Salton - Automatic Text Processing
WeekDateContentReadingNote
1Feb. 22, 2010(Term Not Started Yet)
2Mar. 1, 2010Course Overview
Chap. 1, Boolean retrieval
IIR Ch.1, MIR Ch.1, MIR 8.1-8.2, Salton 8.1-8.3
3Mar. 8, 2010 Chap. 2, The term vocabulary and postings lists IIR Ch.2, MIR 8.2, 7.1.-7.2, Salton 8.6
4Mar. 15, 2010 Chap. 3, Dictionaries and tolerant retrieval IIR Ch.3, MIR 4.2, Salton Ch.9
5Mar. 22, 2010 Chap. 4, Index construction IIR Ch.4, MIR Ch.8HW#1
6Mar. 29, 2010 Sec. 5.1 Statistical properties of terms in information retrieval
Chap. 6, Scoring, term weighting, and the vector space model
IIR 5.1, MIR 6.1-6.3
IIR Ch.6, MIR 2.5
7Apr. 5, 2010(Tomb Sweeping Day or Ching-Ming Holidays)
8Apr. 12, 2010 Chap. 7, Computing scores in a complete search system IIR Ch,7, MIR 2.5HW#1 Due, HW#2
9Apr. 19, 2010 Chap. 8, Evaluation in information retreival IIR Ch.8, MIR Ch.3
10Apr. 26, 2010 (Midterm Exam)
11May 3, 2010 Chap. 9, Relevance feedback and query expansion IIR Ch.9, MIR Ch.5 HW#2 Due
12May 10, 2010 Chap. 19, Web search basics IIR Ch.19, MIR Ch.13 Project Proposal Due
13May 17, 2010 Chap. 20, Web crawling and indexes
(Selected topics: Chap. 21, Link analysis)
IIR Ch.20, MIR Ch.13
IIR Ch.21, MIR 2.7
14May 24, 2010 Chap. 13, Text classification and Naive Bayes
Chap. 14, Vector space classification
IIR Ch.13
IIR 14.1-14.3
15May 31, 2010 Sec. 15.1 Support vector machines
Chap. 16, Flat clustering & Chap. 17, Hierarchical clustering
IIR 15.1
IIR Ch.16-17, MIR 5.3
Only Sec.15.1 and parts of Ch.16 & 17 will be introduced
16Jun. 7, 2010Final Presentation: Week 110 teams completed
17Jun. 14, 2010Final Presentation: Week 210 teams completed
18Jun. 21, 2010Final Presentation: Week 315 teams completed

Useful Links

Here're some useful links to information retrieval related resources or further readings.

Programming Assignments and Projects

Please hand in your assignment before deadline according to the following instructions.

Submission Instructions

NOTE: Programs or projects in electronic files must be submitted directly to the TA online at: http://140.124.183.16/ir/.

If you cannot successfully submit your work, please contact with the TA or the instructor.

Homeworks

  1. HW#1: Indexing
    Extended Due: Apr. 12, 2010
  2. HW#2: Query processing and searching
    Due: Extended to May 3, 2010

Projects

  1. Term Project: paper presentation or system demonstration
    ItemDescriptionTime
    Proposal You are required to submit a proposal for final project after midterm exam (Extened to May 10, 2010).
    Note: Please submit your proposals *as soon as possible* since it will be included in the final grading of the project.
    (May 10, 2010)
    Topics The current list of topics for term project (as of May 17, 2010)
    Schedule The final schedule of term porject presentation (updated on Jun. 21, 2010) was scheduled in the last three weeks of this semester.
    ReportThe final report should contain at least the following:
    1. presentation slides (for all teams), and
    2. source code, installation/execution instructions, team members and task responsibility (for system projects)
    (Jun. 25, 2010)

Exams

  1. Midterm Exam: Apr. 19-23, 2010
  2. Final Exam: Jun. 21-25, 2010

Scores


E-mail: jhwang AT csie . ntut . edu . tw
Created: Feb. 22, 2010.
Last Updated: Jun. 29, 2010.