Informaiton Retrieval and Applications, Spring 2015

This course offers an introduction to the principles and concepts in information retrieval (IR), which is fundamental to modern Web search engines.
In addition to Web search, other applications of information retrieval systems will also be described.
This year, the course is offered at graduate-level as well as the International Graduate Program in College of Electrical Engineering and Computer Science (EECS). It's taught in English.

Course Information

Latest News

(Tentative) Schedule

The slides were slightly modified from the Stanford CS276 class.
Note: IIR - Introduction to Information Retrieval, MIR - Modern Information Retrieval, Salton - Automatic Text Processing
WeekDateContentReadingNote
1Mar. 2, 2015Course Overview
2Mar. 9, 2015 Chap. 1, Boolean retrieval
Chap. 2, The term vocabulary and postings lists
IIR Ch.1, MIR Ch.1, MIR 8.1-8.2, Salton 8.1-8.3
IIR Ch.2, MIR 8.2, 7.1.-7.2, Salton 8.6
3Mar. 16, 2015 Chap. 3, Dictionaries and tolerant retrieval IIR Ch.3, MIR 4.2, Salton Ch.9 HW#1
4Mar. 23, 2015 Chap. 4, Index construction IIR Ch.4, MIR Ch.8 Due: Team Member Registration
5Mar. 30, 2015 Sec. 5.1 Statistical properties of terms in information retrieval
Chap. 6, Scoring, term weighting, and the vector space model
IIR 5.1, MIR 6.1-6.3
IIR Ch.6, MIR 2.5
term project proposal
6Apr. 6, 2015 (Compensation Leave for Tomb Sweeping Day) Due: HW#1
7Apr. 13, 2015 Chap. 7, Computing scores in a complete search system
Chap. 8, Evaluation in information retreival
IIR Ch,7, MIR 2.5
IIR Ch.8, MIR Ch.3
HW#2
8Apr. 20, 2015 Chap. 9, Relevance feedback and query expansion IIR Ch.9, MIR Ch.5
9Apr. 27, 2015 (Midterm Exam) Due: HW#2
10May 4, 2015 Chap. 13, Text classification and Naive Bayes IIR Ch.13 Due: Proposal
11May 11, 2015 Chap. 14, Vector space classification
Sec. 15.1 Support vector machines
IIR 14.1-14.3
IIR Sec.15.1
HW#3
Note: Only selected topics in Ch.13, Ch.14, & Sec. 15-1 will be covered.
12May 18, 2015 (TA explanation on HW#3, datasets, and grading on previous homeworks) IIR Ch.16-17, MIR 5.3 Note: Only selected topics in Chap. 16, & 17 will be covered.
13May 25, 2015 Chap. 16, Flat clustering & Chap. 17, Hierarchical clustering IIR Ch.16-17, MIR 5.3
14Jun. 1, 2015 Chap. 19, Web search basics
Chap. 20, Web crawling and indexes
Chap. 21, Link analysis
Advanced topics and applications of IR: CLIR, Multimedia IR, and Semantic Search
Social search
IIR Ch.19, MIR Ch.13
IIR Ch.20, MIR Ch.13
IIR Ch.21, MIR 2.7
Due: HW#3
Note: Only selected parts of Ch.21 will be introduced
15Jun. 8, 2015 Final Presentation: Week 1: 9 teams completed.
16Jun. 15, 2015 Final Presentation: Week 2: 9 teams completed.
17Jun. 22, 2015 Final Presentation: Week 3: 11 teams completed.
18Jun. 29, 2015 Final Presentation: Week 4

Useful Links

Here're some useful links to information retrieval related resources or further readings.

Programming Assignments and Projects

Please hand in your assignment before deadline according to the following instructions.

Submission Instructions

NOTE: Programs or projects in electronic files must be submitted directly to the TA online at: https://140.124.183.13/.

If you cannot successfully submit your work, please contact with the TA or the instructor.

Homeworks

There will be about 2-3 programming homeworks that target at different IR tasks such as indexing, searching, and data analysis.
  1. HW#1: Vector Space Retrieval -- Indexing
    Due: extended to Apr. 6, 2015
  2. HW#2: Vector Space Retrieval -- Searching
    Due: Apr. 27, 2015
  3. HW#3: Text Classification
    Due: Jun. 1, 2015

Projects

  1. Term Project: paper presentation or system demonstration
    ItemDescriptionTime
    Proposal You are required to submit a proposal for term project one week after midterm exam. May 4, 2015 (Mon.)
    Topics For paper presentations, the paper quality will *greatly* affect your score in term project. Please *carefully* select good papers to read.
    Schedule
    Due to our time limits, we have to start the term project presentation on Jun. 8, 2015 (Mon.).

    * [NOTE] All presentations *must* be finished within the scheduled time slots, which will be the last *four* weeks in this semester. No other time slots will be avbailable.
    If you have preferred time slots, please book at your earliest convenience in your proposal.
    The current schedule of term project presentation has been announced. (As of Jun. 29, 2015, there are 36 teams in total.)
    Jun. 8, 15, 22, 29, 2015
    ReportEach team is *required* to upload the final report after finishing your presentation.
    The final report should contain at least the following:
    1. presentation slides (for all teams), and
    2. source code, installation/execution instructions, team members and task responsibility (for system projects)
    Jun. 30, 2015 (Tue.)

Exams

  1. Midterm Exam: Apr. 22-28, 2015
  2. Final Exam: Jun. 24-30, 2015

Scores

Please check the homework submission site for more details.
E-mail: jhwang AT csie . ntut . edu . tw
Created: Feb. 25, 2015.
Last Updated: Jul. 2, 2015.