COMPX523-20A (HAM)

Data Stream Mining

15 Points

Edit Header Content
Division of Health Engineering Computing & Science
School of Computing and Mathematical Sciences
Department of Computer Science

Staff

Edit Staff Content

Convenor(s)

Lecturer(s)

Administrator(s)

Placement/WIL Coordinator(s)

Tutor(s)

Student Representative(s)

Lab Technician(s)

Librarian(s)

: debby.dada@waikato.ac.nz

You can contact staff by:

  • Calling +64 7 838 4466 select option 1, then enter the extension.
  • Extensions starting with 4, 5, 9 or 3 can also be direct dialled:
    • For extensions starting with 4: dial +64 7 838 extension.
    • For extensions starting with 5: dial +64 7 858 extension.
    • For extensions starting with 9: dial +64 7 837 extension.
    • For extensions starting with 3: dial +64 7 2620 + the last 3 digits of the extension e.g. 3123 = +64 7 262 0123.
Edit Staff Content

Paper Description

Edit Paper Description Content
Data streams are everywhere, from F1 racing over electricity networks to news feeds. Data stream mining relies on and develops new incremental algorithms that process streams under strict resource limitations. This paper focuses on, as well as extends the methods implemented in MOA (Java) and scikit-multiflow (Python), two open source stream mining softwares currently being developed by the Machine Learning group.
Edit Paper Description Content

Paper Structure

Edit Paper Structure Content
Class attendance is expected. The course notes provided are not comprehensive, additional material will be covered in class. You are responsible for all material covered in class.
Edit Paper Structure Content

Learning Outcomes

Edit Learning Outcomes Content

Students who successfully complete the course should be able to:

  • Students will be able to select and apply appropriate algorithms for data stream mining problems.
    Linked to the following assessments:
  • Students will be able to design and implement new algorithms in a data stream mining framework like MOA, or similar.
    Linked to the following assessments:
  • Students will be able to compare and evaluate different algorithms/solutions for a problem and summarize their findings in a report.
    Linked to the following assessments:
Edit Learning Outcomes Content
Edit Learning Outcomes Content

Assessment

Edit Assessments Content

Assessment Components

Edit Assessments Content

The internal assessment/exam ratio (as stated in the University Calendar) is 100:0. There is no final exam. The final exam makes up 0% of the overall mark.

The internal assessment/exam ratio (as stated in the University Calendar) is 100:0 or 0:0, whichever is more favourable for the student. The final exam makes up either 0% or 0% of the overall mark.

Component DescriptionDue Date TimePercentage of overall markSubmission MethodCompulsory
1. Assignment 1
15
  • Online: Submit through Moodle
2. Assignment 2
35
  • Online: Submit through Moodle
3. Test 1
30 Apr 2020
No set time
25
4. Test 2
4 Jun 2020
No set time
25
Assessment Total:     100    
Failing to complete a compulsory assessment component of a paper will result in an IC grade
Edit Assessments Content

Required and Recommended Readings

Edit Required Readings Content

Required Readings

Edit Required Readings Content

https://mitpress.mit.edu/books/machine-learning-data-streams

There is an official online version available at https://moa.cms.waikato.ac.nz/book/

Edit Required Readings Content

Recommended Readings

Edit Recommended Readings Content
Knowledge Discovery from Data Streams, by Joao Gama
Edit Recommended Readings Content

Online Support

Edit Online Support Content
Moodle is used for this paper.
Edit Online Support Content

Workload

Edit Workload Content
About 150 to 180 hours in total, including lecture, reading time, assignment, and presentation and report preparation.
Edit Workload Content

Linkages to Other Papers

Edit Linkages Content
Three 300 level Computer Science papers, including COMP321 Practical Data Mining or COMP316 Artificial Intelligence Techniques and Applications Corresponding Papers COMP523 Data Stream Mining
Edit Linkages Content

Prerequisite(s)

Prerequisite papers: COMPX305 or COMPX310 or COMP316 or COMP321 and a further 30 points at 300 level in Computer Science

Corequisite(s)

Equivalent(s)

Restriction(s)

Restricted papers: COMP423, COMP523

Edit Linkages Content