
Dr Mahakim Newton
Lecturer - Data Science
School of Information and Physical Sciences (Data Science and Statistics)
- Email:mahakim.newton@newcastle.edu.au
- Phone:0249216850
Career Summary
Biography
Qualifications
- Doctor of Philosophy in Computer and Information Sciences, University of Strathclyde
- Graduate Certificate in Higher Education, Griffith University
Keywords
- Artificial Intelligence
- Big Data Analytics
- Bioinformatics
- Computational Biology
- Computational Chemistry
- Computational Drug Discovery
- Computer Science Education
- Constraints
- Data Science
- Decision Trees
- Deep Learning
- Environmental Informatics
- Explainable Algorithms
- Genomics and Proteomics
- Health Informatics
- Internet of Things
- Life Science
- Machine Learning
- Planning
- Satisfiability
- Scheduling
- Search and Optimisation
- Supply Chain
- Time Series
Languages
- English (Fluent)
- Bengali (Mother)
Fields of Research
| Code | Description | Percentage |
|---|---|---|
| 460299 | Artificial intelligence not elsewhere classified | 40 |
| 460501 | Data engineering and data science | 30 |
| 310299 | Bioinformatics and computational biology not elsewhere classified | 30 |
Professional Experience
UON Appointment
| Title | Organisation / Department |
|---|---|
| Lecturer - Data Science | University of Newcastle School of Information and Physical Sciences Australia |
Academic appointment
| Dates | Title | Organisation / Department |
|---|---|---|
| 1/1/2013 - 25/3/2022 | Lecturer/Research Fellow | Griffith University School of Information and Communication Technology Australia |
| 19/1/2002 - 19/7/2009 | Assistance Professor (Computer Science and Engineering) | Bangladesh University of Engineering and Technology Bangladesh |
| 19/1/2000 - 18/1/2003 | Lecturer (Computer Science and Engineering) | Bangladesh University of Engineering and Technology Bangladesh |
Professional appointment
| Dates | Title | Organisation / Department |
|---|---|---|
| 21/7/2009 - 31/12/2012 | Research Engineer (Search and Optimisation) | National ICT Australia Australia |
Teaching
| Code | Course | Role | Duration |
|---|---|---|---|
| COMP6900 |
Computing Project School of Information and Physical Sciences, The University of Newcastle, Australia |
Course Coordinator | 1/7/2023 - 31/12/2023 |
| COMP6900 |
Computing Project School of Information and Physical Sciences, The University of Newcastle, Australia |
Course Coordinator | 1/7/2022 - 31/12/2022 |
| COMP6230 |
Algorithms School of Information and Physical Sciences, The University of Newcastle, Australia |
Course Coordinator | 1/7/2023 - 31/12/2023 |
| COMP6230 |
Algorithms School of Information and Physical Sciences, The University of Newcastle, Australia |
Course Coordinator | 1/7/2022 - 31/12/2022 |
| STAT3800 |
Deterministic and Stochastic Optimisation School of Information and Physical Sciences, The University of Newcastle, Australia |
Course Teacher | 1/1/2023 - 30/6/2023 |
| COMP2230 |
Algorithms School of Information and Physical Sciences, The University of Newcastle, Australia |
Course Coordinator | 1/7/2022 - 31/12/2022 |
| STAT3800 |
Deterministic and Stochastic Optimisation School of Information and Physical Sciences, The University of Newcastle, Australia |
Course Coordinator | 1/1/2024 - 30/6/2024 |
| COMP2230 |
Algorithms School of Information and Physical Sciences, The University of Newcastle, Australia |
Course Coordinator | 1/7/2023 - 31/12/2023 |
Publications
For publications that are currently unpublished or in-press, details are shown in italics.
Conference (34 outputs)
| Year | Citation | Altmetrics | Link | |||||
|---|---|---|---|---|---|---|---|---|
| 2022 |
Zaman R, Newton MAH, Mataeimoghadam F, Sattar A, 'Tailoring Contact Based Scoring Functions for Protein Structure Prediction', AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE, 13151, 155-168 (2022) [E1]
|
|||||||
| 2021 |
Mataeimoghadam F, Newton MAH, Zaman R, Sattar A, 'Improving Protein Backbone Angle Prediction Using Hidden Markov Models in Deep Learning', Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13031 LNAI, 239-251 (2021) [E1]
|
|||||||
| 2020 |
Namazi M, Sanderson C, Newton MAH, Sattar A, 'Surrogate assisted optimisation for travelling thief problems', Proceedings of the 13th International Symposium on Combinatorial Search, SoCS 2020, 111-115 (2020) [E1]
|
|||||||
| 2019 |
Riahi V, Newton MAH, Sattar A, 'Exploiting Setup Time Constraints in Local Search for Flowshop Scheduling', Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11671 LNAI, 379-392 (2019) [E1]
|
|||||||
| 2019 |
Namazi M, Newton MAH, Sattar A, Sanderson C, 'A profit guided coordination heuristic for travelling thief problems', Proceedings of the 12th International Symposium on Combinatorial Search, SoCS 2019, 140-144 (2019) [E1]
|
|||||||
| 2019 |
Karim A, Singh J, Mishra A, Dehzangi A, Newton MAH, Sattar A, 'Toxicity Prediction by Multimodal Deep Learning', Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11669 LNAI, 142-152 (2019) [E1]
|
|||||||
| 2018 |
Namazi M, Sanderson C, Newton MAH, Polash MMA, Sattar A, 'Diversified Late Acceptance Search', AI 2018: ADVANCES IN ARTIFICIAL INTELLIGENCE, 11320, 299-311 (2018) [E1]
|
|||||||
| 2018 |
Riahi V, Newton MAH, Sattar A, 'Constraint-Guided Local Search for Single Mixed-Operation Runway', AI 2018: ADVANCES IN ARTIFICIAL INTELLIGENCE, 11320, 329-341 (2018) [E1]
|
|||||||
| 2018 |
Riahi V, Polash MMA, Newton MAH, Sattar A, 'Mixed Neighbourhood Local Search for Customer Order Scheduling Problem', PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 11012, 296-309 (2018) [E1]
|
|||||||
| 2018 |
Riahi V, Newton MAH, Su K, Sattar A, 'Local Search for Flowshops with Setup Times and Blocking Constraints', TWENTY-EIGHTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING (ICAPS 2018), 199-207 (2018) [E1]
|
|||||||
| 2016 |
Namazi M, Ghooshchi NG, Hakim Newton MA, Sattar A, 'Assignment precipitation in fail first search', Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9992 LNAI, 281-287 (2016) [E1]
|
|||||||
| Show 31 more conferences | ||||||||
Journal article (42 outputs)
| Year | Citation | Altmetrics | Link | |||||
|---|---|---|---|---|---|---|---|---|
| 2025 |
Azad T, Newton MAH, Trevathan J, Sattar A, 'Syntactic and Semantic Edge Interoperability', Journal of Internet Services and Applications, 16, 235-252 (2025) [C1]
|
|||||||
| 2025 |
Sarker P, Banshal S, Newton M, Anika S, Sumsee Sristy M, Chakrobortty A, 'Perceived key factors affecting online university classrooms', Education and Information Technologies (2025)
|
|||||||
| 2025 |
Azad T, Newton MAH, Trevathan J, Sattar A, 'IoT edge network interoperability', Computer Communications, 236 (2025) [C1]
Network interoperability is crucial for achieving seamless communication across Internet of Things (IoT) environments. IoT comprises heterogeneous devices and systems s... [more] Network interoperability is crucial for achieving seamless communication across Internet of Things (IoT) environments. IoT comprises heterogeneous devices and systems supporting diverse technologies, protocols, and manufacturers. Enabling devices to communicate and exchange data effectively, regardless of underlying protocols, is key to building cohesive and integrated IoT networks. IoT has transformed multiple sectors ranging from home automation to healthcare¿by harnessing a vast array of sensors and actuators that communicate through cloud, fog, and edge layers. However, the variety in device manufacturing and communication standards demands interoperable interfaces, and most current solutions depend on cloud-based centralised architectures. These architectures introduce latency and scalability challenges, particularly for resource-constrained IoT devices that often struggle to communicate with the cloud due to limited resources. This paper addresses network interoperability at the IoT edge level, focusing on resource-efficient communication by integrating Wi-Fi and Bluetooth, two commonly used protocols in IoT ecosystems. We have implemented a network edge interoperability solution that supports effective data exchange between devices operating on these distinct protocols, enhancing the overall efficiency, flexibility, and scalability of IoT systems. Our approach allows devices interoperate by addressing network latency and bandwidth limitations, incorporating an integrated controller to facilitate broader applications and enhance performance across IoT networks. Our findings illustrate how bridging protocol differences can foster more resilient and adaptable IoT solutions, advancing the deployment of IoT applications across various domains and use cases.
|
|||||||
| 2024 |
Cader JMA, Newton MAH, Rahman J, Cader AJMA, Sattar A, 'Ensembling methods for protein-ligand binding affinity prediction', SCIENTIFIC REPORTS, 14 (2024) [C1]
Protein-ligand binding affinity prediction is a key element of computer-aided drug discovery. Most of the existing deep learning methods for protein-ligand binding affi... [more] Protein-ligand binding affinity prediction is a key element of computer-aided drug discovery. Most of the existing deep learning methods for protein-ligand binding affinity prediction utilize single models and suffer from low accuracy and generalization capability. In this paper, we train 13 deep learning models from combinations of 5 input features. Then, we explore all possible ensembles of the trained models to find the best ensembles. Our deep learning models use cross-attention and self-attention layers to extract short and long-range interactions. Our method is named Ensemble Binding Affinity (EBA). EBA extracts information from various models using different combinations of input features, such as simple 1D sequential and structural features of the protein-ligand complexes rather than 3D complex features. EBA is implemented to accurately predict the binding affinity of a protein-ligand complex. One of our ensembles achieves the highest Pearson correlation coefficient (R) value of 0.914 and the lowest root mean square error (RMSE) value of 0.957 on the well-known benchmark test set CASF2016. Our ensembles show significant improvements of more than 15% in R-value and 19% in RMSE on both well-known benchmark CSAR-HiQ test sets over the second-best predictor named CAPLA. Furthermore, the superior performance of the ensembles across all metrics compared to existing state-of-the-art protein-ligand binding affinity prediction methods on all five benchmark test datasets demonstrates the effectiveness and robustness of our approach. Therefore, our approach to improving binding affinity prediction between proteins and ligands can contribute to improving the success rate of potential drugs and accelerate the drug development process.
|
|||||||
| 2024 |
Rahman J, Newton MAH, Ali ME, Sattar A, 'Distance plus attention for binding affinity prediction', JOURNAL OF CHEMINFORMATICS, 16 (2024) [C1]
Protein-ligand binding affinity plays a pivotal role in drug development, particularly in identifying potential ligands for target disease-related proteins. Accurate af... [more] Protein-ligand binding affinity plays a pivotal role in drug development, particularly in identifying potential ligands for target disease-related proteins. Accurate affinity predictions can significantly reduce both the time and cost involved in drug development. However, highly precise affinity prediction remains a research challenge. A key to improve affinity prediction is to capture interactions between proteins and ligands effectively. Existing deep-learning-based computational approaches use 3D grids, 4D tensors, molecular graphs, or proximity-based adjacency matrices, which are either resource-intensive or do not directly represent potential interactions. In this paper, we propose atomic-level distance features and attention mechanisms to capture better specific protein-ligand interactions based on donor-acceptor relations, hydrophobicity, and p-stacking atoms. We argue that distances encompass both short-range direct and long-range indirect interaction effects while attention mechanisms capture levels of interaction effects. On the very well-known CASF-2016 dataset, our proposed method, named Distance plus Attention for Affinity Prediction (DAAP), significantly outperforms existing methods by achieving Correlation Coefficient (R) 0.909, Root Mean Squared Error (RMSE) 0.987, Mean Absolute Error (MAE) 0.745, Standard Deviation (SD) 0.988, and Concordance Index (CI) 0.876. The proposed method also shows substantial improvement, around 2% to 37%, on five other benchmark datasets. The program and data are publicly available on the website https://gitlab.com/mahnewton/daap. Scientific Contribution Statement This study innovatively introducesdistance-based features to predict protein-ligand binding affinity, capitalizing onunique molecular interactions. Furthermore, the incorporation of protein sequencefeatures of specific residues enhances the model's proficiency in capturing intricatebinding patterns. The predictive capabilities are further strengthened through theuse of a deep learning architecture with attention mechanisms, and an ensembleapproach, averaging the outputs of five models, is implemented to ensure robustand reliable predictions.
|
Open Research Newcastle | ||||||
| 2024 |
Ben Islam MK, Newton MAH, Trevathan J, Sattar A, 'Lite approaches for long-range multi-step water quality prediction', STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 38, 3755-3770 (2024) [C1]
Forecasting accurate water quality is very important in aquaculture, environment monitoring, and many other applications. Many internal and external factors influence w... [more] Forecasting accurate water quality is very important in aquaculture, environment monitoring, and many other applications. Many internal and external factors influence water quality. Therefore, water quality parameters exhibit complex time series characteristics. Consequently, long-range accurate prediction of water quality parameters suffers from poor propagation of information from past timepoints to further future timepoints. Moreover, to synchronise the prediction model with the changes in the time series characteristics, periodic retraining of the prediction model is required and such retraining is to be done on resource-restricted computation devices. In this work, we present a low-cost training approach to improve long-range multi-step water quality prediction. We train a short-range predictor to save training effort. Then, we strive to achieve and/or improve long-range prediction using multi-step iterative ensembling during inference. Experimental results on 9 water quality datasets demonstrate that the proposed method achieves significantly lower error than the existing state-of-the-art approaches. Our approach significantly outperforms the existing approaches in several standard metrics, even in the case of future timepoints at long distances.
|
Open Research Newcastle | ||||||
| 2024 |
Nikafshan Rad H, Su Z, Trinh A, Hakim Newton MA, Shamsani J, NYGC ALS Consortium , Karim A, Sattar A, 'Amyotrophic lateral sclerosis diagnosis using machine learning and multi-omic data integration', Heliyon, 10 (2024) [C1]
Amyotrophic Lateral Sclerosis (ALS) is a complex and rare neurodegenerative disorder characterized by significant genetic, molecular, and clinical heterogeneity. Despit... [more] Amyotrophic Lateral Sclerosis (ALS) is a complex and rare neurodegenerative disorder characterized by significant genetic, molecular, and clinical heterogeneity. Despite numerous endeavors to discover the genetic factors underlying ALS, a significant number of these factors remain unknown. This knowledge gap highlights the necessity for personalized medicine approaches that can provide more comprehensive information for the purposes of diagnosis, prognosis, and treatment of ALS. This work utilizes an innovative approach by employing a machine learning-facilitated, multi-omic model to develop a more comprehensive knowledge of ALS. Through unsupervised clustering on gene expression profiles, 9,847 genes associated with ALS pathways are isolated and integrated with 7,699 genes containing rare, presumed pathogenic genomic variants, leading to a comprehensive amalgamation of 17,546 genes. Subsequently, a Variational Autoencoder is applied to distil complex biomedical information from these genes, culminating in the creation of the proposed Multi-Omics for ALS (MOALS) model, which has been designed to expose intricate genotype-phenotype interconnections within the dataset. Our meticulous investigation elucidates several pivotal ALS signaling pathways and demonstrates that MOALS is a superior model, outclassing other machine learning models based on single omic approaches such as SNV and RNA expression, enhancing accuracy by 1.7 percent and 6.2 percent, respectively. The findings of this study suggest that analyzing the relationships within biological systems can provide heuristic insights into the biological mechanisms that help to make highly accurate ALS diagnosis tools and achieve more interpretable results.
|
Open Research Newcastle | ||||||
| 2024 |
Azad T, Newton MAH, Trevathan J, Sattar A, 'Hierarchical Decentralized Edge Interoperability', IEEE INTERNET OF THINGS JOURNAL, 11, 13948-13960 (2024) [C1]
The Internet of Things (IoT) has many important applications in multiple domains that include home automation, smart cities, healthcare, agriculture, and environment. I... [more] The Internet of Things (IoT) has many important applications in multiple domains that include home automation, smart cities, healthcare, agriculture, and environment. IoT comprises a wide range of sensors and actuators that communicate with each other over cloud, fog, and edge level networks. Moreover, these devices use various communication protocols and are made by different manufactures. To deal with these diversities, IoT essentially needs interoperable communication interfaces among devices. Unfortunately, existing interoperability solutions are centralized and use fog or cloud level computing resources, making IoT communications latency-prone and poorly scalable. These issues could be handled effectively, if edge level devices could be made interoperable within the edge level and without needing fog or cloud level access. This article proposes a decentralized interoperability solution that stays fully within the edge level. The solution relies on controller devices that work on the interface boundaries of the edge devices. Unlike existing solutions, the proposed solution adopts a hierarchical interoperability model to handle interoperability at network, syntactical, semantic, and organizational levels. Our solution is nonproprietary, generic over vendors and platforms, and easily extendable to new devices. We compare our proposed solution with existing interoperability solutions for edge devices and show its mobility, efficiency, and flexibility.
|
Open Research Newcastle | ||||||
| 2023 |
Rahman J, Newton MAH, Hasan MAM, Sattar A, 'Real-to-bin conversion for protein residue distances', COMPUTATIONAL BIOLOGY AND CHEMISTRY, 104 (2023) [C1]
Protein Structure Prediction (PSP) has achieved significant progress lately. Prediction of inter-residue distances by machine learning and their exploitation during the... [more] Protein Structure Prediction (PSP) has achieved significant progress lately. Prediction of inter-residue distances by machine learning and their exploitation during the conformational search is largely among the critical factors behind the progress. Real values than bin probabilities could more naturally represent inter-residue distances, while the latter, via spline curves more naturally helps obtain differentiable objective functions than the former. Consequently, PSP methods that exploit predicted binned distances perform better than those that exploit predicted real-valued distances. To leverage the advantage of bin probabilities in getting differentiable objective functions, in this work, we propose techniques to convert real-valued distances into distance bin probabilities. Using standard benchmark proteins, we then show that our real-to-bin converted distances help PSP methods obtain three-dimensional structures with 4%¿16% better root mean squared deviation (RMSD), template modeling score (TM-Score), and global distance test (GDT) values than existing similar PSP methods. Our proposed PSP method is named real to bin (R2B) inter-residue distance predictor, and its code is available from https://gitlab.com/mahnewton/r2b.
|
Open Research Newcastle | ||||||
| 2023 |
Mufassirin MMM, Newton MAH, Rahman J, Sattar A, 'Multi-S3P: Protein Secondary Structure Prediction With Specialized Multi-Network and Self-Attention-Based Deep Learning Model', IEEE ACCESS, 11, 57083-57096 (2023) [C1]
Protein structure prediction (PSP) is a vital challenge in bioinformatics, structural biology and drug discovery. Protein secondary structure (SS) prediction is critica... [more] Protein structure prediction (PSP) is a vital challenge in bioinformatics, structural biology and drug discovery. Protein secondary structure (SS) prediction is critical since three-dimensional (3D) structures are primarily made up of secondary structures. With the advancement of deep learning approaches, SS classification accuracy has been significantly improved. Many existing methods use an ensemble of complex neural networks to improve SS prediction. Because of the high dimensionality of the hyperparameter space, deep neural networks with complex architectures are typically challenging to train effectively. Also, predicting secondary structures in the boundary regions between different types of SS is challenging. This study presents Multi-S3P, which employs bidirectional Long-Short-Term-Memory (BILSTM) and Convolutional Neural Networks (CNN) with a self-attention mechanism to improve the secondary structure prediction using an effective training strategy to capture the unique characteristics of each type of secondary structure and combine them more effectively. The ensemble of CNN and BILSTM can learn both contextual information and long-range interactions between the residues. In addition, using a self-attention mechanism allows the model to focus on the most important features for improving performance. We used the SPOT-1D dataset for the training and validation of our model using a set of four input features derived from amino acid sequences. Further, the model was tested on four popular independent test datasets and compared with various state-of-the-art predictors. The presented results show that Multi-S3P outperformed the other methods in terms of Q3, Q8 accuracy and other performance metrics, achieving the highest Q3 accuracy of 87.57% and a Q8 accuracy of 77.56% on the TEST2016 test set. More importantly, Multi-S3P demonstrates high performance in SS boundary regions. Our experiment also demonstrates that the combination of different input features and a multi-network-based training strategy significantly improved the performance.
|
Open Research Newcastle | ||||||
| 2023 |
Namazi M, Newton MAH, Sanderson C, Sattar A, 'Solving travelling thief problems using coordination based methods', JOURNAL OF HEURISTICS [C1]
|
Open Research Newcastle | ||||||
| 2023 |
Mufassirin MMM, Newton MAH, Sattar A, 'Artificial intelligence for template-free protein structure prediction: a comprehensive review', ARTIFICIAL INTELLIGENCE REVIEW, 56, 7665-7732 (2023) [C1]
|
Open Research Newcastle | ||||||
| 2022 |
Saha S, Sarker PS, Al Saud A, Shatabda S, Newton MAH, 'Cluster-oriented instance selection for classification problems', INFORMATION SCIENCES, 602, 143-158 (2022) [C1]
More training instances could lead to better classification accuracy. However, accuracy could also degrade if more training instances mean further noises and outliers. ... [more] More training instances could lead to better classification accuracy. However, accuracy could also degrade if more training instances mean further noises and outliers. Additional training instances arguably need additional computational resources in future data mining operations. Instance selection algorithms identify subsets of training instances that could desirably increase accuracy or at least do not decrease accuracy significantly. There exist many instance selection algorithms, but no single algorithm, in general, dominates the others. Moreover, existing instance selection algorithms do not allow direct controlling of the instance selection rate. In this paper, we present a simple and generic cluster-oriented instance selection algorithm for classification problems. Our proposed algorithm runs an unsupervised K Means Clustering algorithm on the training instances and with a given selection rate, selects instances from the centers and the borders of the clusters. On 24 benchmark classification problems, when very similar percentages of instances are selected by various instance selection algorithms, K Nearest Neighbours classifiers achieve more than 2%¿3% better accuracy when using instances selected by our proposed method than when using those selected by other state-of-the-art generic instance selection algorithms.
|
Open Research Newcastle | ||||||
| 2022 |
Rahman J, Newton MAH, Hasan MAM, Sattar A, 'A stacked meta-ensemble for protein inter-residue distance prediction', COMPUTERS IN BIOLOGY AND MEDICINE, 148 (2022) [C1]
Predicted inter-residue distances are a key behind recent success in high quality protein structure prediction (PSP). However, prediction of both short and long distanc... [more] Predicted inter-residue distances are a key behind recent success in high quality protein structure prediction (PSP). However, prediction of both short and long distance values together is challenging. Consequently, predicted short distances are mostly used by existing PSP methods. In this paper, we use a stacked meta-ensemble method to combine deep learning models trained for different ranges of real-valued distances. On five benchmark sets of proteins, our proposed inter-residue distance prediction method improves mean Local Distance Different Test (LDDT) scores at least by 5% over existing such methods. Moreover, using a real-valued distance based conformational search algorithm, we also show that predicted long distances help obtain significantly better protein conformations than when only predicted short distances are used. Our method is named meta-ensemble for distance prediction (MDP) and its program is available from https://gitlab.com/mahnewton/mdp.
|
Open Research Newcastle | ||||||
| 2022 |
Ben Islam MK, Newton MAH, Rahman J, Trevathan J, Sattar A, 'Long range multi-step water quality forecasting using iterative ensembling', ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 114 (2022) [C1]
Real-life water quality monitoring applications such as aquaculture domains and water resource management need long range multi-step prediction for disaster control. Ho... [more] Real-life water quality monitoring applications such as aquaculture domains and water resource management need long range multi-step prediction for disaster control. However, prediction accuracy usually degrades gradually as the prediction target timepoint is further away from the current timepoint. To address this, recent water quality forecasting methods mostly rely on complex deep learning models. In this paper, we propose a simple time-variant iterative ensembling method that strives to significantly improve the performance of a given arbitrary long range multi-step time series predictor for water quality data with minimal increase in computational cost. With the given predictor, our proposed method iteratively uses ensembles of predicted values for preceding steps to improve the prediction accuracy for the succeeding steps. The iterative ensembling operation is performed on the trained model and only at the inference stage, and so does not need any further computing-intensive training for the performance improvement. We experimentally show that the proposed method is effective with 7 predictors and 9 water quality datasets of various types, and it outperforms the state-of-the-art results in those datasets by around 2%¿29% in mean absolute error (MAE), root mean squared error (RMSE) and mean absolute percentage error (MAPE) metrics. Similar improvement has also been found in two other metrics such as normalized Nash¿Sutcliffe model efficiency coefficient (NNSE) metric and Taylor diagram plot. Overall, the proposed iterative ensembling is a promising approach for multi-step long range water quality prediction for high-frequency water quality monitoring systems.
|
Open Research Newcastle | ||||||
| 2022 |
Newton MAH, Zaman R, Mataeimoghadam F, Rahman J, Sattar A, 'Constraint Guided Beta-Sheet Refinement for Protein Structure Prediction', COMPUTATIONAL BIOLOGY AND CHEMISTRY, 101 (2022) [C1]
|
Open Research Newcastle | ||||||
| 2022 |
Newton MAH, Mataeimoghadam F, Zaman R, Sattar A, 'Secondary structure specific simpler prediction models for protein backbone angles', BMC BIOINFORMATICS, 23 (2022) [C1]
|
|||||||
| 2022 |
Rahman J, Newton MAH, Ben Islam MK, Sattar A, 'Enhancing protein inter-residue real distance prediction by scrutinising deep learning models', SCIENTIFIC REPORTS, 12 (2022) [C1]
|
|||||||
| 2022 |
Newton MAH, Rahman J, Zaman R, Sattar A, 'Enhancing protein contact map prediction accuracy via ensembles of inter-residue distance predictors', COMPUTATIONAL BIOLOGY AND CHEMISTRY, 99 (2022) [C1]
Protein contact maps capture coevolutionary interactions between amino acid residue pairs that are spatially within certain proximity threshold. Predicted contact maps ... [more] Protein contact maps capture coevolutionary interactions between amino acid residue pairs that are spatially within certain proximity threshold. Predicted contact maps are used in many protein related problems that include drug design, protein design, protein function prediction, and protein structure prediction. Contact map prediction has achieved significant progress lately but still further challenges remain with prediction of contacts between residues that are separated in the amino acid residue sequence by large numbers of other residues. In this paper, with experimental results on 5 standard benchmark datasets that include membrane proteins, we show that contact map prediction could be significantly enhanced by using ensembles of various state-of-the-art short distance predictors and then by converting predicted distances into contact probabilities. Our program along with its data is available from https://gitlab.com/mahnewton/ecp.
|
Open Research Newcastle | ||||||
| 2022 |
Zaman R, Newton MAH, Mataeimoghadam F, Sattar A, 'Constraint Guided Neighbor Generation for Protein Structure Prediction', IEEE ACCESS, 10, 54991-55001 (2022) [C1]
|
Open Research Newcastle | ||||||
| 2021 |
Karim A, Riahi V, Mishra A, Newton MAH, Dehzangi A, Balle T, Sattar A, 'Quantitative Toxicity Prediction via Meta Ensembling of Multitask Deep Learning Models', ACS OMEGA, 6, 12306-12317 (2021) [C1]
Toxicity prediction using quantitative structure-activity relationship has achieved significant progress in recent years. However, most existing machine learning method... [more] Toxicity prediction using quantitative structure-activity relationship has achieved significant progress in recent years. However, most existing machine learning methods in toxicity prediction utilize only one type of feature representation and one type of neural network, which essentially restricts their performance. Moreover, methods that use more than one type of feature representation struggle with the aggregation of information captured within the features since they use predetermined aggregation formulas. In this paper, we propose a deep learning framework for quantitative toxicity prediction using five individual base deep learning models and their own base feature representations. We then propose to adopt a meta ensemble approach using another separate deep learning model to perform aggregation of the outputs of the individual base deep learning models. We train our deep learning models in a weighted multitask fashion combining four quantitative toxicity data sets of LD50, IGC50, LC50, and LC50-DM and minimizing the root-mean-square errors. Compared to the current state-of-the-art toxicity prediction method TopTox on LD50, IGC50, and LC50-DM, that is, three out of four data sets, our method, respectively, obtains 5.46, 16.67, and 6.34% better root-mean-square errors, 6.41, 11.80, and 12.16% better mean absolute errors, and 5.21, 7.36, and 2.54% better coefficients of determination. We named our method QuantitativeTox, and our implementation is available from the GitHub repository https://github.com/Abdulk084/QuantitativeTox.
|
|||||||
| 2021 |
Newton MAH, Polash MMA, Pham DN, Thornton J, Su K, Sattar A, 'Evaluating logic gate constraints in local search for structured satisfiability problems', ARTIFICIAL INTELLIGENCE REVIEW, 54, 5347-5411 (2021) [C1]
|
|||||||
| 2021 |
Riahi V, Newton MAH, Sattar A, 'A scatter search algorithm for time-dependent prize-collecting arc routing problems', COMPUTERS & OPERATIONS RESEARCH, 134 (2021) [C1]
|
|||||||
| 2021 |
Riahi V, Newton MAH, Sattar A, 'Constraint based local search for flowshops with sequence-dependent setup times', ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 102 (2021) [C1]
|
Open Research Newcastle | ||||||
| 2020 |
Mataeimoghadam F, Newton MAH, Dehzangi A, Karim A, Jayaram B, Ranganathan S, Sattar A, 'Enhancing protein backbone angle prediction by using simpler models of deep neural networks', SCIENTIFIC REPORTS, 10 (2020) [C1]
|
|||||||
| 2019 |
Riahi V, Newton MAH, Su K, Sattar A, 'Constraint guided accelerated search for mixed blocking permutation flowshop scheduling', Computers and Operations Research, 102 102-120 (2019) [C1]
Mixed Blocking Permutation Flowshop Scheduling Problem (MBPFSP) with the objective of makespan minimisation is NP-Hard. It has important industrial applications that in... [more] Mixed Blocking Permutation Flowshop Scheduling Problem (MBPFSP) with the objective of makespan minimisation is NP-Hard. It has important industrial applications that include the cider production industry. MBPFSP has made some progress in recent years. However, within practical time limits, existing incomplete algorithms still either find low quality solutions or struggle with large problems. One key reason behind this is the typical way of using generic heuristics or metaheuristics that usually lack problem specific structural knowledge. In MBPFSP, a machine could be blocked with the currently finished job until the subsequent machine is available to process the same job. These blocking constraints affect the makespan. So MBPFSP search should naturally take explicit steps to take the blocking constraints into account. Unfortunately, existing research on MBPFSP just uses only the makespan to compare generated solutions, but the search otherwise is not aware of the blocking constraints. Moreover, existing such methods use either an exhaustive or a random neighbourhood generation strategy. In this work, we aim to advance MBPFSP search by better exploiting the problem specific structural knowledge. We use the constraint and the objective functions to obtain such problem specific knowledge and we exploit such knowledge both in a constructive search method and in a local search method. In this paper, we also present an acceleration method to efficiently evaluate insertion-based neighbourhoods of MBPFSP. Our experimental results on three standard testbeds demonstrate that our proposed algorithms significantly improve over existing best-performing algorithms.
|
|||||||
| 2019 |
Riahi V, Newton MAH, Polash MMA, Su K, Sattar A, 'Constraint guided search for aircraft sequencing', EXPERT SYSTEMS WITH APPLICATIONS, 118, 440-458 (2019) [C1]
Aircraft sequencing problem (ASP) is an NP-Hard problem. It involves allocation of aircraft to runways for landing and takeoff, minimising total tardiness. ASP has made... [more] Aircraft sequencing problem (ASP) is an NP-Hard problem. It involves allocation of aircraft to runways for landing and takeoff, minimising total tardiness. ASP has made significant progress in recent years. However, within practical time limits, existing incomplete algorithms still either find low quality solutions or struggle with large problems. One key reason behind this is the typical way of using generic heuristics or metaheuristics that usually lack problem specific structural knowledge. As a result, existing such methods use either an exhaustive or a random neighbourhood generation strategy. So their search guidance comes only from the evaluation function that is used mainly after the neighbourhood generation. In this work, we aim to advance ASP search by better exploiting the problem specific structural knowledge. We use the constraint and the objective functions to obtain such problem specific knowledge and we exploit such knowledge both in a constructive search method and in a local search method. Our motivation comes from the constraint optimisation paradigm in artificial intelligence, where instead of random decisions, constraint-guided more informed optimisation decisions are of particular interest. We run our experiments on a range of standard benchmark problem instances that include instances from real airports and instances crafted using real airport parameters, and contain scenarios involving multiple runways and both landing and takeoff operations. We show that our proposed algorithms significantly outperform existing state-of-the-art aircraft sequencing algorithms.
|
|||||||
| 2019 |
Karim A, Mishra A, Newton MAH, Sattar A, 'Efficient Toxicity Prediction via Simple Features Using Shallow Neural Networks and Decision Trees', ACS OMEGA, 4, 1874-1888 (2019) [C1]
Toxicity prediction of chemical compounds is a grand challenge. Lately, it achieved significant progress in accuracy but using a huge set of features, implementing a co... [more] Toxicity prediction of chemical compounds is a grand challenge. Lately, it achieved significant progress in accuracy but using a huge set of features, implementing a complex blackbox technique such as a deep neural network, and exploiting enormous computational resources. In this paper, we strongly argue for the models and methods that are simple in machine learning characteristics, efficient in computing resource usage, and powerful to achieve very high accuracy levels. To demonstrate this, we develop a single task-based chemical toxicity prediction framework using only 2D features that are less compute intensive. We effectively use a decision tree to obtain an optimum number of features from a collection of thousands of them. We use a shallow neural network and jointly optimize it with decision tree taking both network parameters and input features into account. Our model needs only a minute on a single CPU for its training while existing methods using deep neural networks need about 10 min on NVidia Tesla K40 GPU. However, we obtain similar or better performance on several toxicity benchmark tasks. We also develop a cumulative feature ranking method which enables us to identify features that can help chemists perform prescreening of toxic compounds effectively.
|
|||||||
| 2019 |
Riahi V, Newton MAH, Polash MMA, Sattar A, 'Tailoring customer order scheduling search algorithms', COMPUTERS & OPERATIONS RESEARCH, 108, 155-165 (2019) [C1]
|
|||||||
| 2019 |
Newton MAH, Riahi V, Su K, Sattar A, 'Scheduling blocking flowshops with setup times via constraint guided and accelerated local search', COMPUTERS & OPERATIONS RESEARCH, 109, 64-76 (2019) [C1]
|
|||||||
| 2019 |
Newton MAH, Riahi V, Sattar A, 'Makespan preserving flowshop reengineering via blocking constraints', COMPUTERS & OPERATIONS RESEARCH, 112 (2019) [C1]
|
|||||||
| 2017 |
Riahi V, Khorramizadeh M, Newton MAH, Sattar A, 'Scatter search for mixed blocking flowshop scheduling', EXPERT SYSTEMS WITH APPLICATIONS, 79, 20-32 (2017) [C1]
Empty or limited storage capacities between machines introduce various types of blocking constraint in the industries with flowshop environment. While large application... [more] Empty or limited storage capacities between machines introduce various types of blocking constraint in the industries with flowshop environment. While large applications demand flowshop scheduling with a mix of different types of blocking, research in this area mainly focuses on using only one kind of blocking in a given problem instance. In this paper, using makespan as a criterion, we study permutation flowshops with zero capacity buffers operating under mixed blocking conditions. We present a very effective scatter search (SS) algorithm for this. At the initialisation phase of SS, we use a modified version of the well-known Nawaz, Enscore and Ham (NEH) heuristic. For the improvement method in SS, we use an Iterated Local Search (ILS) algorithm that adopts a greedy job selection and a powerful NEH-based perturbation procedure. Moreover, in the reference set update phase of SS, with small probabilities, we accept worse solutions so as to increase the search diversity. On standard benchmark problems of varying sizes, our algorithm very significantly outperforms well-known existing algorithms in terms of both the solution quality and the computing time. Moreover, our algorithm has found new upper bounds for 314 out of 360 benchmark problem instances.
|
|||||||
| 2017 |
Polash MMA, Newton MAH, Sattar A, 'Constraint-directed search for all-interval series', Constraints, 22 403-431 (2017) [C1]
All-interval series is a standard benchmark problem for constraint satisfaction search. An all-interval series of size n is a permutation of integers [0, n) such that t... [more] All-interval series is a standard benchmark problem for constraint satisfaction search. An all-interval series of size n is a permutation of integers [0, n) such that the differences between adjacent integers are a permutation of [1, n). Generating each such all-interval series of size n is an interesting challenge for constraint community. The problem is very difficult in terms of the size of the search space. Different approaches have been used to date to generate all the solutions of AIS but the search space that must be explored still remains huge. In this paper, we present a constraint-directed backtracking-based tree search algorithm that performs efficient lazy checking rather than immediate constraint propagation. Moreover, we prove several key properties of all-interval series that help prune the search space significantly. The reduced search space essentially results into fewer backtracking. We also present scalable parallel versions of our algorithm that can exploit the advantage of having multi-core processors and even multiple computer systems. Our new algorithm generates all the solutions of size up to 27 while a satisfiability-based state-of-the-art approach generates all solutions up to size 24.
|
|||||||
| 2017 |
Ghooshchi NG, Namazi M, Newton MAH, Sattar A, 'Encoding domain transitions for constraint-based Planning', Journal of Artificial Intelligence Research, 58 905-966 (2017) [C1]
We describe a constraint-based automated planner named Transition Constraints for Parallel Planning (TCPP). TCPP constructs its constraint model from a redefined versio... [more] We describe a constraint-based automated planner named Transition Constraints for Parallel Planning (TCPP). TCPP constructs its constraint model from a redefined version of the domain transition graphs (DTG) of a given planning problem. TCPP encodes state transitions in the redefined DTGs by using table constraints with cells containing don't cares or wild cards. TCPP uses Minion the constraint solver to solve the constraint model and returns a parallel plan. We empirically compare TCPP with the other state-of-theart constraint-based parallel planner PaP2. PaP2 encodes action successions in the finite state automata (FSA) as table constraints with cells containing sets of values. PaP2 uses SICStus Prolog as its constraint solver. We also improve PaP2 by using don't cares and mutex constraints. Our experiments on a number of standard classical planning benchmark domains demonstrate TCPP's efficiency over the original PaP2 running on SICStus Prolog and our reconstructed and enhanced versions of PaP2 running on Minion.
|
|||||||
| 2017 |
Polash MMA, Newton MAH, Sattar A, 'Constraint-based search for optimal Golomb rulers', JOURNAL OF HEURISTICS, 23, 501-532 (2017) [C1]
|
|||||||
| Show 39 more journal articles | ||||||||
Grants and Funding
Summary
| Number of grants | 7 |
|---|---|
| Total funding | $82,430 |
Click on a grant title below to expand the full details for that specific grant.
20251 grants / $5,000
Global Experience Support Funding$5,000
Funding body: University of Newcastle Global Engagement and Partnerships (UON Global)
| Funding body | University of Newcastle Global Engagement and Partnerships (UON Global) |
|---|---|
| Project Team | M A Hakim Newton, Marcella Papini, Kyle Harrison, Saiful Islam, Nasimul Noman, Rukshan Athauda, Alex Mendes, Karen Blackmore |
| Scheme | Global Experience Support Fund |
| Role | Lead |
| Funding Start | 2025 |
| Funding Finish | 2025 |
| GNo | |
| Type Of Funding | Internal |
| Category | INTE |
| UON | N |
20243 grants / $63,011
Next-Level Predictions: Transforming Tabular Data for High- Dimensional Precision$52,500
Funding body: Honeysuckle Health Pty Limited
| Funding body | Honeysuckle Health Pty Limited |
|---|---|
| Project Team | Doctor Saiful Islam, Dr Ryan Gallagher, Mrs Venkata Kadiyala, Doctor Mahakim Newton |
| Scheme | PhD Scholarship |
| Role | Investigator |
| Funding Start | 2024 |
| Funding Finish | 2027 |
| GNo | G2401394 |
| Type Of Funding | C3100 – Aust For Profit |
| Category | 3100 |
| UON | Y |
Global Experience Support Funding$9,000
Funding body: University of Newcastle Global Engagement and Partnerships (UON Global)
| Funding body | University of Newcastle Global Engagement and Partnerships (UON Global) |
|---|---|
| Project Team | M A Hakim Newton, Marcella Papini, Weijia Zhang, Saiful Islam, Alexandre Mendes, Karen Blackmore |
| Scheme | Global Experience Support Fund |
| Role | Lead |
| Funding Start | 2024 |
| Funding Finish | 2024 |
| GNo | |
| Type Of Funding | Internal |
| Category | INTE |
| UON | N |
Course Development Funding$1,511
Funding body: College of Engineering, Science and Environment (CESE), University of Newcastle
| Funding body | College of Engineering, Science and Environment (CESE), University of Newcastle |
|---|---|
| Project Team | M A Hakim Newton |
| Scheme | College of Engineering, Science, & Environment (CESE) Course Development Funding |
| Role | Lead |
| Funding Start | 2024 |
| Funding Finish | 2024 |
| GNo | |
| Type Of Funding | Internal |
| Category | INTE |
| UON | N |
20223 grants / $14,420
Start-up grant$10,000
Funding body: College of Engineering, Science & Environment (CESE) Start-up Funding
| Funding body | College of Engineering, Science & Environment (CESE) Start-up Funding |
|---|---|
| Project Team | M A Hakim Newton |
| Scheme | College of Engineering, Science & Environment (CESE) Start-up Funding |
| Role | Lead |
| Funding Start | 2022 |
| Funding Finish | 2022 |
| GNo | |
| Type Of Funding | Internal |
| Category | INTE |
| UON | N |
SIPS Funding$3,000
Funding body: School of Information and Physical Sciences (SIPS) Funding
| Funding body | School of Information and Physical Sciences (SIPS) Funding |
|---|---|
| Project Team | M A Hakim Newton |
| Scheme | School of Information and Physical Sciences (SIPS) Funding |
| Role | Lead |
| Funding Start | 2022 |
| Funding Finish | 2022 |
| GNo | |
| Type Of Funding | Internal |
| Category | INTE |
| UON | N |
Course Development Funding$1,420
Funding body: College of Engineering, Science and Environment (CESE), University of Newcastle
| Funding body | College of Engineering, Science and Environment (CESE), University of Newcastle |
|---|---|
| Project Team | M A Hakim Newton |
| Scheme | College of Engineering, Science, & Environment (CESE) Course Development Funding |
| Role | Lead |
| Funding Start | 2022 |
| Funding Finish | 2022 |
| GNo | |
| Type Of Funding | Internal |
| Category | INTE |
| UON | N |
Research Supervision
Number of supervisions
Current Supervision
| Commenced | Level of Study | Research Title | Program | Supervisor Type |
|---|---|---|---|---|
| 2025 | PhD | AI-guided phenomic and genomic prediction for yield performance in macadamia | Computer Science, Griffith University | Co-Supervisor |
| 2025 | PhD | HSC Course Prediction using AI Methods | Education, School of Education, The University of Newcastle | Co-Supervisor |
| 2025 | PhD | HSC Course Prediction Model | PhD (Education), College of Human and Social Futures, The University of Newcastle | Co-Supervisor |
| 2024 | PhD | Ethical AI for Enhanced Health Insurance Risk Assessment and Decision-Making | PhD (Computer Science), College of Engineering, Science and Environment, The University of Newcastle | Co-Supervisor |
| 2024 | PhD | Explainable Machine Learning for Cyber Security | Computer Science, Griffith University | Co-Supervisor |
| 2024 | PhD | Deep Learning for Water Quality Monitoring | Computer Science, Griffith University | Co-Supervisor |
| 2024 | PhD | Next-Level Predictions: Transforming Tabular Data for High-Dimensional Precision | PhD (Computer Science), College of Engineering, Science and Environment, The University of Newcastle | Co-Supervisor |
| 2023 | PhD | Deep Transformation of Tabular Categorical Data for Enhanced Predictive Analytics in Health Informatics | PhD (Computer Science), College of Engineering, Science and Environment, The University of Newcastle | Co-Supervisor |
| 2022 | PhD | Deep Learning for Protein-Ligand Affinity Prediction | Computer Science, Griffith University | Co-Supervisor |
| 2021 | PhD | Deep Learning Approaches for Motor Neuron Diseases | Computer Science, Griffith University | Consultant Supervisor |
Past Supervision
| Year | Level of Study | Research Title | Program | Supervisor Type |
|---|---|---|---|---|
| 2025 | PhD | Deep Learning Methods for Secondary Protein Structure Prediction | Computer Science, Griffith University | Principal Supervisor |
| 2024 | PhD | Simplified Deep Learning Models for Protein Backbone Angle Prediction | Computer Science, Griffith University | Co-Supervisor |
| 2024 | PhD | Distance Maps for Prediction of Protein Structures and Protein-ligand Affinity | Computer Science, Griffith University | Principal Supervisor |
| 2023 | PhD | Constraint Guided Local Search for Protein Structure Prediction | Computer Science, Griffith University | Co-Supervisor |
| 2022 | PhD | Learning in Combinatorial Constraint Optimisation | Computer Science, Griffith University | Co-Supervisor |
| 2020 | PhD | Constraint-based Automated Planning and Business Process Modelling | Computer Science, Griffith University | Principal Supervisor |
| 2020 | PhD | Molecular toxicity prediction using deep learning | Computer Science, Griffith University | Co-Supervisor |
| 2019 | PhD | Constraint Directed Scheduling | Computer Science, Griffith University | Co-Supervisor |
| 2017 | PhD | Exploiting Structures in Combinatorial Search | Computer Science, Griffith University | Co-Supervisor |
| 2014 | PhD | Local Search Heuristics for Protein Structure Prediction | Computer Science, Griffith University | Co-Supervisor |
Research Opportunities
Intelligent Search and Optimisation
Developing intelligent algorithms for search and optimisation: planning, scheduling, constraints, satisfiability problems
PHD
School of Information and Physical Sciences
1/1/2024 - 31/12/2030
Contact
Doctor Mahakim Newton
University of Newcastle
School of Information and Physical Sciences
mahakim.newton@newcastle.edu.au
Constraint-Based Explainable Machine Learning
Using constraints in achieving explainability in machine learning.
PHD
School of Information and Physical Sciences
1/1/2024 - 31/12/2030
Contact
Doctor Mahakim Newton
University of Newcastle
School of Information and Physical Sciences
mahakim.newton@newcastle.edu.au
Artificial Intelligence for Environment
Using internet of things, machine learning and search and optimisation for the environment, agriculture, and disaster management
PHD
School of Information and Physical Sciences
1/1/2024 - 31/12/2030
Contact
Doctor Mahakim Newton
University of Newcastle
School of Information and Physical Sciences
mahakim.newton@newcastle.edu.au
Computational Drug Discovery
Using machine learning and search and optimisation in various stages of computational drug discovery
PHD
School of Information and Physical Sciences
1/1/2024 - 31/12/2030
Contact
Doctor Mahakim Newton
University of Newcastle
School of Information and Physical Sciences
mahakim.newton@newcastle.edu.au
Artificial Intelligence for Health
Using big data, machine learning and search and optimisation for predictive and prescriptive health informatics
PHD
School of Information and Physical Sciences
1/1/2024 - 31/12/2030
Contact
Doctor Mahakim Newton
University of Newcastle
School of Information and Physical Sciences
mahakim.newton@newcastle.edu.au
Dr Mahakim Newton
Position
Lecturer - Data Science
School of Information and Physical Sciences
College of Engineering, Science and Environment
Focus area
Data Science and Statistics
Contact Details
| mahakim.newton@newcastle.edu.au | |
| Phone | 0249216850 |
Office
| Room | EF124 |
|---|---|
| Building | Engineering F |
| Location | Callaghan Campus University Drive Callaghan, NSW 2308 Australia |
