Week | Date | Content | Reading | Note |
---|---|---|---|---|
1 | Sep. 20 & 22, 2021 |
(9/20: Compensation Leave for Mid-Autumn Festival)
Course Overview | ||
2 | Sep. 27 & 29, 2021 |
Introduction to Big Data Analytics
Ch.2, Getting to Know Your Data |
DM3, Ch.1
DM3, Ch.2 | |
3 | Oct. 4 & 6, 2021 | Ch.3, Data Preprocessing | DM3, Ch.3 (selected) | |
4 | Oct. 11 & 13, 2021 |
(Compensation Leave for National Day)
Ch.3 | HW#1 | |
5 | Oct. 18 & 20, 2021 | Ch.6, Frequent Pattern Mining | DM3, Ch.6 | Term Project Proposal |
6 | Oct. 25 & 27, 2021 | Ch.8, Classification: Basic Concepts | DM3, Ch.8 |
Due: HW#1
Team Registration |
7 | Nov. 1 & 3, 2021 | Ch.8 |
HW#2
Due: Team Registration | |
8 | Nov. 8 & 10, 2021 | Ch.9, Classification: Advanced Methods | DM3, Ch.9 (selected sections) | |
9 | Nov. 15 & 17, 2021 | Ch.10, Cluster Analysis: Basic Concepts and Methods | DM3, Ch.10 |
HW#3
Due: HW#2 |
10 | Nov. 22 & 24, 2021 | (11/22: Midterm Exam) | ||
11 | Nov. 29 & Dec. 1, 2021 |
Distribtued Platforms: Hadoop, Spark
Ref: Notes on installation, configuration, and management of Hadoop & Spark clusters |
Due: HW#3
Due: Proposal | |
12 | Dec. 6 & 8, 2021 | Parallel Programming Paradigms & Concepts | HW#4 | |
13 | Dec. 13 & 15, 2021 |
MapReduce Programming
(Lab: Spark cluster demo) | ||
14 | Dec. 20 & 22, 2021 |
Spark Programming
(Lab: classification using Spark) | ||
15 | Dec. 27 & 29, 2021 | Term Project Presentation (Week 1) | Due: HW#4 | |
16 | Jan. 3 & 5, 2022 | Term Project Presentation (Week 2) | ||
17 | Jan. 10 & 12, 2022 | Term Project Presentation (Week 3) | ||
18 | Jan. 17 & 19, 2022 | Term Project Presentation (Week 4) |
If you cannot successfully submit your work, please contact with the TA or the instructor.
[NOTE] For the programming projects in HW#2,
the DBLP dataset can be downloaded in XML format at: https://dblp.org/xml/release/
However, since DBLP dataset is very large, it might not be easy to analyze.
You can try to download the partial datasets collected by different sources.
The details can be checked in the Notes for HW#2, for example:
Item | Description | Time |
---|---|---|
Proposal | You are required to submit a
proposal for term project one week after midterm exam.
One option is to join the competitions. You can check the details on recent competitions as potential topics for term project. | Nov. 22, 2021 (Mon.) |
Topics | For paper presentations, the paper quality will *greatly* affect your score in term project. Please *carefully* select good papers to read. | |
Schedule |
Due to our time limits, we have to start the term
project presentation as early as Dec. 27, 2021 (Mon.). * [NOTE] All presentations *must* be finished within the scheduled time slots, which will be the last 4 weeks in this semester. No other time slots will be avbailable. |
Dec. 27, 29, 2020 & Jan. 3, 5, 10, 12, 17, 19, 2022 |
Report | Each team is *required* to upload the final report after finishing your presentation.
The final report should contain at least the following:
|
Jan. 22, 2022 (Sat.) |