
COMPX523-23A (HAM)
Data Stream Mining
15 Points
Staff
Convenor(s)
Albert Bifet Figuerol
albert.bifet@waikato.ac.nz
|
|
Guilherme Weigert Cassales
guilherme.weigertcassales@waikato.ac.nz
|
|
Administrator(s)
Librarian(s)
You can contact staff by:
- Calling +64 7 838 4466 select option 1, then enter the extension.
-
Extensions starting with 4, 5, 9 or 3 can also be direct dialled:
- For extensions starting with 4: dial +64 7 838 extension.
- For extensions starting with 5: dial +64 7 858 extension.
- For extensions starting with 9: dial +64 7 837 extension.
- For extensions starting with 3: dial +64 7 2620 + the last 3 digits of the extension e.g. 3123 = +64 7 262 0123.
What this paper is about
Machine learning algorithms leverage data to mimic intelligent behaviour for specific tasks, varying from detecting anomalies in industrial processes to movie recommendations. Recent technological advances enabled efficient transfer, storage, and process of data. These advances also impacted machine learning algorithms by allowing an ever-growing increase in complexity, which sometimes means increased accuracy.
However, to effectively learn from fast data, it is also essential to account for changes that may have catastrophic effects on the machine learning algorithms. For example, an algorithm may signal legit credit card usage as a fraud because the users' behaviour has changed.
The reason behind these changes may not be accessible to the algorithm, making it unable to react to them immediately. For example, a different online credit card usage pattern may be because customers cannot leave their homes.
This paper combines theoretical and practical aspects of machine learning for data streams. We present examples using the methods implemented in the Massive Online Analysis (MOA) framework and river (Python) framework.
The learning outcomes for this paper are linked to Washington Accord graduate attributes WA1-WA11. Explanation of the graduate attributes can be found at: https://www.ieagreements.org/
How this paper will be taught
Required Readings
https://mitpress.mit.edu/books/machine-learning-data-streams
There is an official online version available at https://moa.cms.waikato.ac.nz/book/
Learning Outcomes
Students who successfully complete the course should be able to:
Assessments
How you will be assessed
"Samples of your work may be required as part of the Engineering New Zealand accreditation process for BE(Hons) degrees. Any samples taken will have the student name and ID redacted. If you do not want samples of your work collected then please email the engineering administrator, Natalie Shaw (natalie.shaw@waikato.ac.nz), to opt out."
The internal assessment/exam ratio (as stated in the University Calendar) is 100:0. There is no final exam.