Informaiton Retrieval and Applications, Spring 2012

This course offers an introduction to the principles and concepts in information retrieval, which is fundamental to modern Web search engines. In addition to Web search, other applications of information retrieval systems will also be described.

Course Information

Latest News

(Tentative) Schedule

The slides were slightly modified from the Stanford CS276 class.
Note: IIR - Introduction to Information Retrieval, MIR - Modern Information Retrieval, Salton - Automatic Text Processing
WeekDateContentReadingNote
1Feb. 23, 2012Course Overview
2Mar. 1, 2012 Chap. 1, Boolean retrieval IIR Ch.1, MIR Ch.1, MIR 8.1-8.2, Salton 8.1-8.3
3Mar. 8, 2012 Chap. 2, The term vocabulary and postings lists
Chap. 3, Dictionaries and tolerant retrieval
IIR Ch.2, MIR 8.2, 7.1.-7.2, Salton 8.6
IIR Ch.3, MIR 4.2, Salton Ch.9
4Mar. 15, 2012 Chap. 4, Index construction IIR Ch.4, MIR Ch.8 HW#1
5Mar. 22, 2012 Sec. 5.1 Statistical properties of terms in information retrieval
Chap. 6, Scoring, term weighting, and the vector space model
IIR 5.1, MIR 6.1-6.3
IIR Ch.6, MIR 2.5
6Mar. 29, 2012 (Team Registration/Description of Term project proposal )
7Apr. 5, 2012(Compensation Leave for Sports Day) Due: HW#1
8Apr. 12, 2012 Chap. 7, Computing scores in a complete search system IIR Ch,7, MIR 2.5 HW#2
9Apr. 19, 2012 Chap. 8, Evaluation in information retreival IIR Ch.8, MIR Ch.3 Due: Proposal
10Apr. 26, 2012 (Midterm Exam) Due: HW#2
11May 3, 2012 Chap. 9, Relevance feedback and query expansion
Chap. 13, Text classification and Naive Bayes
IIR Ch.9, MIR Ch.5
IIR Ch.13
Note: Only selected topics in Ch.13 will be covered.
12May 10, 2012 Chap. 14, Vector space classification
Sec. 15.1 Support vector machines
IIR 14.1-14.3
IIR Sec.15.1
HW#3
Note: Only selected topics in Ch. 14 & Sec.15.1 will be covered.
13May 17, 2012 Chap. 16, Flat clustering & Chap. 17, Hierarchical clustering IIR Ch.16-17, MIR 5.3 Note: Only selected topics in Ch.16 & 17 will be covered.
14May 24, 2012 Chap. 19, Web search basics IIR Ch.19, MIR Ch.13 Due: HW#3
Due: Presentation time slot booking
15May 31, 2012 Chap. 20, Web crawling and indexes
Chap. 21, Link analysis
(Advanced topics and applications of IR: CLIR, Multimedia IR, and Semantic Search)
IIR Ch.20, MIR Ch.13
IIR Ch.21, MIR 2.7
Note: Only selected parts of Ch.21 will be introduced
16Jun. 7, 2012Final Presentation: Week 1 (9 teams completed.)
17Jun. 14, 2012Final Presentation: Week 2 (11 teams completed.)
18Jun. 21, 2012Final Presentation: Week 3 (13 teams completed.)

Useful Links

Here're some useful links to information retrieval related resources or further readings.

Programming Assignments and Projects

Please hand in your assignment before deadline according to the following instructions.

Submission Instructions

NOTE: Programs or projects in electronic files must be submitted directly to the TA online at: http://140.124.183.39/IR/.

If you cannot successfully submit your work, please contact with the TA or the instructor.

Homeworks

  1. HW#1: Indexing
    Due: Apr. 5, 2012
  2. HW#2: Query processing & searching
    Due: Apr. 26, 2012
  3. HW#3: Text Classification
    Due: May 24, 2012

Projects

  1. Term Project: paper presentation or system demonstration
    ItemDescriptionTime
    Proposal You are required to submit a proposal for term project before midterm exam (Apr. 19, 2012). Apr. 19, 2012
    Topics Topics for term projects include system demostration and paper presentation.
    In paper presentations, the paper quality will *greatly* affect your score in term project. Please *carefully* select good papers to read.
    Schedule Final Schedule of term project presentation has been arranged, with 33 teams in total (12 system demos, and 21 paper presentations) (as of Jun. 21, 2012)
    Due to our time limits, we will start the term project presentation on Jun. 7, 2012.
    Each team is allocated up to 20 minutes for your presentation (and system demo).
    * [NOTE] All presentations must be finished within the scheduled time slots, which will be the last *three* weeks of this semester. No other time slots will be avbailable.
    If you have preferred time slots, please book before May 24, 2012.
    Jun. 7, 14, 21, 2012
    ReportPlease upload your final report before Jun. 25, 2012
    The final report should contain at least the following:
    1. presentation slides (for all teams), and
    2. source code, installation/execution instructions, team members and task responsibility (for system projects)
    Jun. 25, 2012 (Monday)

Exams

  1. Midterm Exam: Apr. 16-20, 2012
  2. Final Exam: Jun. 18-22, 2012

Scores

Please check the homework submission site for more details.
E-mail: jhwang AT csie . ntut . edu . tw
Created: Feb. 16, 2012.
Last Updated: Jun. 26, 2012.