Available in 2024
Course code

COMP3340

Units

10 units

Level

3000 level

Course handbook

Description

The course introduces the students to the identification of patterns in data that can be used to derive knowledge for prediction and/or classification purposes. The course exposes the learners to a variety of established techniques and methodologies for the analysis of data. The course is motivated by the inclusion of selected topics of data analytic problems arising in business and consumer analytics and data science and data engineering.


Availability2024 Course Timetables

Callaghan

  • Semester 2 - 2024

Learning outcomes

On successful completion of the course students will be able to:

1. Evaluate the processes and techniques for data analysis.

2. Apply well-established approaches and develop new systems for data analytics.

3. Discuss the practical, computational and scientific issues in data mining.


Content

1)    Introduction to the Knowledge Discovery from Databases process: Representation issues and Feature Engineering.  

2)    Preprocessing of data: aggregation, sampling, discretization, attribute selection, identification of outliers, continuous and discrete measurements, missing values and imputation.  Decision trees, rule-based classifiers.

3)    Evaluating the performance of a classifier: precision, recall, TPR, FPR, TNR, FNR, sensitivity, specificity. Taking into account misclassification costs. The class imbalance problem. Confusion matrices. The Matthews Correlation Coefficient.

4)    Evaluating the performance of a model (cont.): cross-validation; bootstrap. Comparing models.

5)    Association rules (intro). The Apriori algorithm.

6)    Unsupervised methods: Basic concepts of clustering, K-means, the role of similarities measures. Clustering validation. Inter-rater reliability methods (Cohen’s and Fleiss’ kappa).   


Assumed knowledge

MATH1510 Discrete Mathematics, SENG1110 Object Oriented Programming


Assessment items

Written Assignment: Programming Assignment

Online Open Book Formal Examination: Final Exam
Compulsory Requirement: Pass requirement 40% - Must obtain 40% in this assessment item to pass the course.


Contact hours

Semester 2 - 2024 - Callaghan

Lecture-1
  • Face to Face On Campus 2 hour(s) per week(s) for 13 week(s) starting in week 1
Workshop-1
  • Face to Face On Campus 2 hour(s) per week(s) for 13 week(s) starting in week 1

Course outline

Course outline not yet available.