Informaiton Retrieval and Applications, Spring 2014

This course offers an introduction to the principles and concepts in information retrieval (IR), which is fundamental to modern Web search engines.
In addition to Web search, other applications of information retrieval systems will also be described.
This year, the course is offered at graduate-level as well as the International Graduate Program in College of Electrical Engineering and Computer Science (EECS). It's taught in English.

Course Information

Latest News

(Tentative) Schedule

The slides were slightly modified from the Stanford CS276 class.
Note: IIR - Introduction to Information Retrieval, MIR - Modern Information Retrieval, Salton - Automatic Text Processing
WeekDateContentReadingNote
1Feb. 21, 2014Course Overview
2Feb. 28, 2014 (228 Memorial Day)
3Mar. 7, 2014 Chap. 1, Boolean retrieval
Chap. 2, The term vocabulary and postings lists
IIR Ch.1, MIR Ch.1, MIR 8.1-8.2, Salton 8.1-8.3
IIR Ch.2, MIR 8.2, 7.1.-7.2, Salton 8.6
4Mar. 14, 2014 Chap. 3, Dictionaries and tolerant retrieval IIR Ch.3, MIR 4.2, Salton Ch.9
5Mar. 21, 2014 Chap. 4, Index construction IIR Ch.4, MIR Ch.8 HW#1
term project proposal
6Mar. 28, 2014 Sec. 5.1 Statistical properties of terms in information retrieval
Chap. 6, Scoring, term weighting, and the vector space model
IIR 5.1, MIR 6.1-6.3
IIR Ch.6, MIR 2.5
Due: Team Member Registration
7Apr. 4, 2014(Children's Day)
8Apr. 11, 2014 Chap. 7, Computing scores in a complete search system IIR Ch,7, MIR 2.5
9Apr. 18, 2014 Chap. 8, Evaluation in information retreival IIR Ch.8, MIR Ch.3 Due: HW#1
10Apr. 25, 2014 (Midterm Exam)
11May 2, 2014 Chap. 9, Relevance feedback and query expansion IIR Ch.9, MIR Ch.5 Due: Proposal
12May 9, 2014 Chap. 13, Text classification and Naive Bayes
Chap. 14, Vector space classification
IIR Ch.13
IIR 14.1-14.3
HW#2
Note: Only selected topics in Ch.13 & Ch.14 will be covered.
13May 16, 2014 (Leave for PAKDD 2014)
14May 23, 2014 Sec. 15.1 Support vector machines
Chap. 16, Flat clustering & Chap. 17, Hierarchical clustering
IIR Sec.15.1
IIR Ch.16-17, MIR 5.3
Note: Only selected topics in 15-1, 16, & 17 will be covered.
15May 30, 2014 Chap. 19, Web search basics
Chap. 20, Web crawling and indexes
Chap. 21, Link analysis
Advanced topics and applications of IR: CLIR, Multimedia IR, and Semantic Search
IIR Ch.19, MIR Ch.13
IIR Ch.20, MIR Ch.13
IIR Ch.21, MIR 2.7
Due: HW#2
Note: Only selected parts of Ch.21 will be introduced
16Jun. 6, 2014 Final Presentation: Week 1 - 10 teams completed.
17Jun. 13, 2014 Final Presentation: Week 2
18Jun. 20, 2014 Final Presentation: Week 3

Useful Links

Here're some useful links to information retrieval related resources or further readings.

Programming Assignments and Projects

Please hand in your assignment before deadline according to the following instructions.

Submission Instructions

NOTE: Programs or projects in electronic files must be submitted directly to the TA online.

If you cannot successfully submit your work, please contact with the TA or the instructor.

Homeworks

There will be about 2 programming homeworks that target at different IR tasks.
  1. HW#1: Vector Space Retrieval (Indexing and Search)
    Due: Apr. 18, 2014
  2. HW#2: Text Classification
    Due: extended to May 30, 2014

Projects

  1. Term Project: paper presentation or system demonstration
    ItemDescriptionTime
    Proposal You are required to submit a proposal for term project before midterm exam.
    As of May 28, 2014, 63 students in 39 teams submitted their proposal for term project presentation. There are 8 students who haven't submitted their proposal. Please so it as soon as possible.
    extended to May 2, 2014 (Fri.)
    Topics The list of topics in term project proposals has been posted. There are 17 papers, 16 demos, and 6 surveys in term project presentation.
    For paper presentations, the paper quality will *greatly* affect your score in term project. Please *carefully* select good papers to read.
    Schedule The presentation schedule for term project has been arranged. Please check if there's any problem.
    Due to our time limits, we might probably start the term project presentation on May 30, 2014 (Fri.).
    Each team is allocated up to 8 12-15 minutes for your presentation (and system demo).
    * [NOTE] All presentations *must* be finished within the scheduled time slots, which will be the last *three* weeks of this semester. No other time slots will be avbailable.
    If you have preferred time slots, please book at your earliest convenience in your proposal.
    Jun. 6, 13, 20, 2014
    ReportEach team is *required* to upload the final report after finishing your presentation.
    The final report should contain at least the following:
    1. presentation slides (for all teams), and
    2. source code, installation/execution instructions, team members and task responsibility (for system projects)
    Jun. 23, 2014 (Mon.)

Exams

  1. Midterm Exam: Apr. 14-18, 2014
  2. Final Exam: Jun. 16-20, 2014

Scores

Please check the homework submission site for more details.
E-mail: jhwang AT csie . ntut . edu . tw
Created: Feb. 20, 2014.
Last Updated: Jun. 27, 2014.