Educational Data Mining and Applications, Fall 2024

This course is one of the advanced courses in the Micro Courses on Educational Big Data. It introduces the basic concepts and principles of data mining, including: data preprocessing, frequent pattern mining, classification, and clustering. Also, students need to use tools such as Weka and Python for educational big data analytics. With the programming exercises and term project, students will be able to learn the fundamental ideas of educational data analysis and real applications with practical tools.
The course is offered at undergraduate level.

Notes on online recording of course sessions

All students enrolled in this course have been added to the team in Microsoft Teams for the corresponding course number.
You can check the online recording of the course in Microsoft Teams at the following channel: Team created for Educational Data Mining and Applications Course [Course Number: 337849]

Course Information

Latest News

(Tentative) Schedule

* All slides can be downloaded at the iSchool+ plaform in NTUT.
WeekDateContentReadingNote
1Sep. 9 & 11, 2024 Course Overview
Ch.1, Introduction
2Sep. 16 & 18, 2024 Ch.2, Data, Measurements, and Data Preprocessing DM4, Ch.2
3Sep. 23 & 25, 2024 (Case Study: Preprocessing Educational Open Data) DM4, Ch.2
4Sep. 30 & Oct. 2, 2024 Ch.4, Pattern Mining: Basic Concepts and Methods DM4, Ch.4 HW#1
5Oct. 7 & 9, 2024 Ch.4 DM4, Ch.4 Term Project Proposal
6Oct. 14 & 16, 2024 Ch.5, Pattern Mining: Advanced Methods
(Case Study: Frequen Pattern Mining on Educaitonal Data)
DM4, Ch.5 (selected) Due: HW#1
10/16: HW#2
Team Registration
7Oct. 21 & 23, 2024 Ch.6, Classification: Basic Concepts DM4, Ch.6 Due: Team Registration
8Oct. 28 & 30, 2024 10/28: Invited Talk
(Case Study: Classifying Educational Data)
DM4, Ch.6 10/30 Due: HW#2 10/30: HW#3
9Nov. 4 & 6, 2024 (11/4: Midterm Exam)
10Nov. 11 & 13, 2024 Ch.7, Classification: Advanced Methods DM4, Ch.7 (selected sections) 11/13 Due: HW#3
11Nov. 18 & 20, 2024 Ch.8, Cluster Analysis: Basic Concepts and Methods DM4, Ch.8
Due: Proposal
12Nov. 25 & 27, 2024 (Case Study: Clustering Educational Data)
Ch.9, Cluster Analysis: Advanced Methods
DM4, Ch.9 (selected sections)
13Dec. 2 & 4, 2024
Distribtued Platforms: Hadoop, Spark
MapReduce Programming
(Lab: Spark cluster demo)
Spark Programming
(Lab: classification using Spark)
Result Visualization and Interpretation
14Dec. 9 & 11, 2024 Term Project Presentation (Week 1)
15Dec. 16 & 18, 2024 (Leave for IEEE BigData 2024)
(TA: Reviewing midterm exam papers)
16Dec. 23 & 25, 2024 (Leave for IEEE BigData 2024)
17Dec. 30, 2024 & Jan. 1, 2025 Term Project Presentation (Week 2)
(1/1: Leave for New Year's Day)
18Jan. 6 & 8, 2025 Term Project Presentation (Week 3)

Programming Assignments and Projects

Please hand in your assignment before deadline according to the following instructions.

Submission Instructions

NOTE: Programs or projects in electronic files must be submitted directly to i-School+.

If you cannot successfully submit your work, please contact with the TA and the instructor.

Homeworks

There will be several written assignments and programming exercises that target at different data analysis tasks.
  1. HW#1 : Ch.2 Data, Measurements, and Data Preprocessing [DM4]
    Due: Oct. 14, 2024
  2. HW#2 : Ch.4 Pattern Mining: Basic Concepts and Methods [DM4]
    Due: Oct. 30, 2024
  3. HW#3 : Ch.6-7 Classification [DM4]
    Due: Nov. 13, 2024
  4. (HW#4)

Projects

  1. Term Project
    ItemDescriptionTime
    Proposal You are required to submit a proposal for term project one week after midterm exam. Nov. 11, 2024 (Tue.)
    Topics Two options:
    1. Project for data analysis or related system development
    2. Joining competitions as your term project. You can check the details of recent competitions as potential topics for term project.
    Schedule Due to our time limits, we have to start the term project presentation as early as Dec. 16, 2024 (Mon.).

    * [NOTE] All presentations *must* be finished within the scheduled time slots, which will be the last 4 weeks in this semester. No other time slots will be avbailable.
    Dec. 9, 11, 30, 2024 & Jan. 6, 8, 2025
    ReportEach team is *required* to upload the final report after finishing your presentation.
    The final report should contain at least the following:
    1. presentation slides, and
    2. source code, and documents containing installation/execution instructions, team members and task responsibility
    Jan. 10, 2025 (Fri.)

Exams

  1. Midterm Exam: Nov. 4-8, 2024
  2. Final Exam: Jan. 6-10, 2025

Scores

Please check the homework submission site for more details.
E-mail: jhwang AT ntut . edu . tw
Created: Sep. 2, 2024.
Last Updated: Nov. 10, 2024.