ENGG3300
10 units
3000 level
Course handbook
Description
This introductory course on statistical machine learning is intended to offer foundational concepts in both the theory and practice of this exciting area. It is intended for students with some background in physics, calculus and linear algebra and focuses on supervised learning, i.e., classification and regression, and introduces nonparametric Bayesian methods. A major theme of this course is to introduce flexible model structures, and incorporate fundamental limits (such as physical conservation laws) - where appropriate - so that predictions obey these limits. The course will cover a range of methods used in statistical machine learning and will review relevant areas of statistics and probability theory as required. These methods will be studied and applied to real data from various applications throughout the course. The course also covers important practical considerations such as cross-validation, model selection and the bias-variance trade-off. The course includes theory (e.g., derivations and proofs) as well as practice (labs and assignments) and is delivered using a problem-based format. The practical aspects are implemented using the industry standard programming language, Python, which will be introduced in context throughout the course.
Availability2024 Course Timetables
Singapore PSB
- Trimester 3 - 2024 (Singapore)
Callaghan
- Semester 2 - 2024
Learning outcomes
On successful completion of the course students will be able to:
1. Propose appropriate flexible models for different problem domains
2. Quantify the uncertainty of predictions and estimates
3. Implement solutions in Python
4. Select a model using cross-validation and bias-variance trade-offs
5. Design and train deep neural-networks for classification
6. Construct Gaussian Process models for nonparametric modelling
7. Apply machine learning methods to large-data problems
8. Design solutions that adhere to fundamental limits, where appropriate
Content
- Review of relevant topics in statistics and probability
- Linear regression (including ridge regression and the Lasso)
- Classification via logistic regression and k-nearest neighbours
- Linear and quadratic discriminant analysis
- Regression & classification trees (including bagging and random forests)
- Boosting
- Neural networks and deep learning
- Bayesian non-parametric methods (including the Gaussian process)
- Incorporating fundamental limits into solutions
- Introduction to Python
Assumed knowledge
MATH1110 Mathematics for Engineering, Science and Technology 1, MATH1120 Mathematics for Engineering, Science and Technology 2, MATH2310 Calculus of Science and Engineering, ENGG1003 Introduction to Procedural Programming.
Assessment items
Tutorial / Laboratory Exercises: Laboratory Exercises (x10)
Written Assignment: Assignments (x2)
Contact hours
Trimester 3 - 2024 (Singapore) - Singapore PSB
Laboratory-1
- Face to Face On Campus 2 hour(s) per week(s) for 13 week(s) starting in week 1
Laboratory-2
- Face to Face On Campus 2 hour(s) per week(s) for 13 week(s) starting in week 1
Lectorial-1
- Face to Face On Campus 2 hour(s) per week(s) for 13 week(s) starting in week 1
Semester 2 - 2024 - Callaghan
Laboratory-1
- Face to Face On Campus 2 hour(s) per week(s) for 13 week(s) starting in week 1
Laboratory-2
- Face to Face On Campus 2 hour(s) per week(s) for 13 week(s) starting in week 1
Lectorial-1
- Face to Face On Campus 2 hour(s) per week(s) for 13 week(s) starting in week 1
Course outline
Course outline not yet available.
The University of Newcastle acknowledges the traditional custodians of the lands within our footprint areas: Awabakal, Darkinjung, Biripai, Worimi, Wonnarua, and Eora Nations. We also pay respect to the wisdom of our Elders past and present.