Dr Mahakim Newton

Dr Mahakim Newton

Lecturer - Data Science

School of Information and Physical Sciences (Data Science and Statistics)

Career Summary

Biography

M A Hakim Newton is a Lecturer in Data Science in the School of Information and Physical Sciences at the University of Newcastle, Australia. He is also an Adjunct Senior Research Fellow with the Institute for Integrated and Intelligent Systems (IIIS), Griffith University, Australia. Newton received his B.Sc.Engg. and M.Sc.Engg. degrees from the Bangladesh University of Engineering and Technology (BUET) and his Ph.D. degree from Strathclyde University, U.K. He was a Research Engineer at the National ICT Australia (NICTA).  His research interests include artificial intelligence, intelligent search, machine learning, data science, bioinformatics, and computer education research.

Qualifications

  • Doctor of Philosophy in Computer and Information Sciences, University of Strathclyde
  • Graduate Certificate in Higher Education, Griffith University

Keywords

  • Artificial Intelligence
  • Big Data Analytics
  • Bioinformatics
  • Computational Biology
  • Computational Chemistry
  • Computational Drug Discovery
  • Computer Science Education
  • Constraints
  • Data Science
  • Decision Trees
  • Deep Learning
  • Environmental Informatics
  • Explainable Algorithms
  • Genomics and Proteomics
  • Health Informatics
  • Internet of Things
  • Life Science
  • Machine Learning
  • Planning
  • Satisfiability
  • Scheduling
  • Search and Optimisation
  • Supply Chain
  • Time Series

Languages

  • English (Fluent)
  • Bengali (Mother)

Fields of Research

Code Description Percentage
460299 Artificial intelligence not elsewhere classified 40
460501 Data engineering and data science 30
310299 Bioinformatics and computational biology not elsewhere classified 30

Professional Experience

UON Appointment

Title Organisation / Department
Lecturer - Data Science University of Newcastle
School of Information and Physical Sciences
Australia

Academic appointment

Dates Title Organisation / Department
1/1/2013 - 25/3/2022 Lecturer/Research Fellow Griffith University
School of Information and Communication Technology
Australia
19/1/2002 - 19/7/2009 Assistance Professor (Computer Science and Engineering) Bangladesh University of Engineering and Technology
Bangladesh
19/1/2000 - 18/1/2003 Lecturer (Computer Science and Engineering) Bangladesh University of Engineering and Technology
Bangladesh

Professional appointment

Dates Title Organisation / Department
21/7/2009 - 31/12/2012 Research Engineer (Search and Optimisation) National ICT Australia
Australia

Teaching

Code Course Role Duration
COMP6900 Computing Project
School of Information and Physical Sciences, The University of Newcastle, Australia
Course Coordinator 1/7/2023 - 31/12/2023
COMP6900 Computing Project
School of Information and Physical Sciences, The University of Newcastle, Australia
Course Coordinator 1/7/2022 - 31/12/2022
COMP6230 Algorithms
School of Information and Physical Sciences, The University of Newcastle, Australia
Course Coordinator 1/7/2023 - 31/12/2023
COMP6230 Algorithms
School of Information and Physical Sciences, The University of Newcastle, Australia
Course Coordinator 1/7/2022 - 31/12/2022
STAT3800 Deterministic and Stochastic Optimisation
School of Information and Physical Sciences, The University of Newcastle, Australia
Course Teacher 1/1/2023 - 30/6/2023
COMP2230 Algorithms
School of Information and Physical Sciences, The University of Newcastle, Australia
Course Coordinator 1/7/2022 - 31/12/2022
STAT3800 Deterministic and Stochastic Optimisation
School of Information and Physical Sciences, The University of Newcastle, Australia
Course Coordinator 1/1/2024 - 30/6/2024
COMP2230 Algorithms
School of Information and Physical Sciences, The University of Newcastle, Australia
Course Coordinator 1/7/2023 - 31/12/2023
Edit

Publications

For publications that are currently unpublished or in-press, details are shown in italics.


Conference (34 outputs)

Year Citation Altmetrics Link
2022 Zaman R, Newton MAH, Mataeimoghadam F, Sattar A, 'Tailoring Contact Based Scoring Functions for Protein Structure Prediction', AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE, 13151, 155-168 (2022) [E1]
DOI 10.1007/978-3-030-97546-3_13
2021 Mataeimoghadam F, Newton MAH, Zaman R, Sattar A, 'Improving Protein Backbone Angle Prediction Using Hidden Markov Models in Deep Learning', Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13031 LNAI, 239-251 (2021) [E1]
DOI 10.1007/978-3-030-89188-6_18
2020 Namazi M, Sanderson C, Newton MAH, Sattar A, 'Surrogate assisted optimisation for travelling thief problems', Proceedings of the 13th International Symposium on Combinatorial Search, SoCS 2020, 111-115 (2020) [E1]
Citations Scopus - 8
2019 Riahi V, Newton MAH, Sattar A, 'Exploiting Setup Time Constraints in Local Search for Flowshop Scheduling', Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11671 LNAI, 379-392 (2019) [E1]
DOI 10.1007/978-3-030-29911-8_29
Citations Scopus - 1
2019 Namazi M, Newton MAH, Sattar A, Sanderson C, 'A profit guided coordination heuristic for travelling thief problems', Proceedings of the 12th International Symposium on Combinatorial Search, SoCS 2019, 140-144 (2019) [E1]
Citations Scopus - 9
2019 Karim A, Singh J, Mishra A, Dehzangi A, Newton MAH, Sattar A, 'Toxicity Prediction by Multimodal Deep Learning', Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11669 LNAI, 142-152 (2019) [E1]
DOI 10.1007/978-3-030-30639-7_12
Citations Scopus - 1
2018 Riahi V, Newton M, Polash MMA, Sattar A, 'Customer Order Scheduling by Scattered Wolf Packs', Marakech, Morocco (2018)
2018 Namazi M, Sanderson C, Newton MAH, Polash MMA, Sattar A, 'Diversified Late Acceptance Search', AI 2018: ADVANCES IN ARTIFICIAL INTELLIGENCE, 11320, 299-311 (2018) [E1]
DOI 10.1007/978-3-030-03991-2_29
Citations Scopus - 9Web of Science - 7
2018 Riahi V, Newton MAH, Sattar A, 'Constraint-Guided Local Search for Single Mixed-Operation Runway', AI 2018: ADVANCES IN ARTIFICIAL INTELLIGENCE, 11320, 329-341 (2018) [E1]
DOI 10.1007/978-3-030-03991-2_32
Citations Scopus - 3Web of Science - 2
2018 Riahi V, Polash MMA, Newton MAH, Sattar A, 'Mixed Neighbourhood Local Search for Customer Order Scheduling Problem', PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 11012, 296-309 (2018) [E1]
DOI 10.1007/978-3-319-97304-3_23
Citations Scopus - 2Web of Science - 2
2018 Riahi V, Newton MAH, Su K, Sattar A, 'Local Search for Flowshops with Setup Times and Blocking Constraints', TWENTY-EIGHTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING (ICAPS 2018), 199-207 (2018) [E1]
Citations Scopus - 9Web of Science - 7
2016 Namazi M, Ghooshchi NG, Hakim Newton MA, Sattar A, 'Assignment precipitation in fail first search', Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9992 LNAI, 281-287 (2016) [E1]
DOI 10.1007/978-3-319-50127-7_23
2015 Polash MMA, Newton MAH, Sattar A, 'Constraint-Based Local Search for Golomb Rulers', INTEGRATION OF AI AND OR TECHNIQUES IN CONSTRAINT PROGRAMMING, 9075, 322-331 (2015)
DOI 10.1007/978-3-319-18008-3_22
Citations Scopus - 7Web of Science - 4
2015 Ghooshchi NG, Namazi M, Newton MAH, Sattar A, 'Transition Constraints for Parallel Planning', PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, Austin, TX (2015)
Citations Scopus - 4Web of Science - 2
2014 Shatabda S, Newton MAH, Sattar A, 'Constraint-Based Evolutionary Local Search for Protein Structures with Secondary Motifs', PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, Gold Coast, AUSTRALIA (2014)
Citations Web of Science - 1
2014 Ahmed K, Newton MAH, Wen L, Sattar A, 'Formalisation of the integration of behavior trees', Ase 2014 Proceedings of the 29th ACM IEEE International Conference on Automated Software Engineering, 779-784 (2014)

In this paper, we present a formal definition of the integration of the requirements modeling language Behavior Trees (BTs). We first provide the semantic integration o... [more]

In this paper, we present a formal definition of the integration of the requirements modeling language Behavior Trees (BTs). We first provide the semantic integration of two interrelated BTs using an extended version of Communicating Sequential Processes. We then use a Semantic Network Model to capture a set of interrelated BTs, and develop algorithm to integrate them all into one BT. This formalisation facilitates developing (semi-)automated tools for modeling the requirements of large-scale software intensive systems.

DOI 10.1145/2642937.2642945
Citations Scopus - 4
2013 Duong TT, Pham DN, Sattar A, Hakim Newton MA, 'Weight-enhanced diversification in stochastic local search for satisfiability', Ijcai International Joint Conference on Artificial Intelligence, 524-530 (2013)

Intensification and diversification are the key factors that control the performance of stochastic local search in satisfiability (SAT). Recently, Novelty Walk has beco... [more]

Intensification and diversification are the key factors that control the performance of stochastic local search in satisfiability (SAT). Recently, Novelty Walk has become a popular method for improving diversification of the search and so has been integrated in many well-known SAT solvers such as TNM and gNovelty+. In this paper, we introduce new heuristics to improve the effectiveness of NoveltyWalk in terms of reducing search stagnation. In particular, we use weights (based on statistical information collected during the search) to focus the diversification phase onto specific areas of interest. With a given probability, we select the most frequently unsatisfied clause instead of a totally random one as Novelty Walk does. Amongst all the variables appearing in the selected clause, we then select the least flipped variable for the next move. Our experimental results show that the new weight-enhanced diversification method significantly improves the performance of gNovelty + and thus outperforms other local search SAT solvers on a wide range of structured and random satisfiability benchmarks.

Citations Scopus - 8
2013 Rashid MA, Newton MAH, Hoque MT, Sattar A, 'A Local Search Embedded Genetic Algorithm for Simplified Protein Structure Prediction', 2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 1091-1098 (2013)
Citations Scopus - 1Web of Science - 7
2013 Shatabda S, Newton MAH, Rashid MA, Sattar A, 'An Efficient Encoding for Simplified Protein Structure Prediction Using Genetic Algorithms', 2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 1217-1224 (2013)
Citations Scopus - 1Web of Science - 12
2013 Newton MAH, Pham DN, Tan WL, Portmann M, Sattar A, 'Stochastic Local Search Based Channel Assignment in Wireless Mesh Networks', PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, CP 2013, 8124, 832-847 (2013)
Citations Scopus - 4Web of Science - 3
2013 Shatabda S, Newton MAH, Sattar A, 'Mixed heuristic local search for protein structure prediction', Proceedings of the 27th Aaai Conference on Artificial Intelligence Aaai 2013, 876-882 (2013)

Protein structure prediction is an unsolved problem in computational biology. One great difficulty is due to the unknown factors in the actual energy function. Moreover... [more]

Protein structure prediction is an unsolved problem in computational biology. One great difficulty is due to the unknown factors in the actual energy function. Moreover, the energy models available are often not very informative particularly when spatially similar structures are compared during search. We introduce several novel heuristics to augment the energy model and present a new local search algorithm that exploits these heuristics in a mixed fashion. Although the heuristics individually are weaker in performance than the energy function, their combination interestingly produces stronger results. For standard benchmark proteins on the face centered cubic lattice and a realistic 20 × 20 energy model, we obtain structures with significantly lower energy than those obtained by the state-of-the-art algorithms. We also report results for these proteins using the same energy model on the cubic lattice. Copyright © 2013, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

Citations Scopus - 15
2013 Shatabda S, Newton MAH, Sattar A, 'Simplified lattice models for protein structure prediction: How good are they?', Proceedings of the 27th Aaai Conference on Artificial Intelligence Aaai 2013, 1639-1640 (2013)

In this paper, we present a local search framework for lattice fit problem of proteins. Our algorithm significantly improves state-of-the-art results and justifies the ... [more]

In this paper, we present a local search framework for lattice fit problem of proteins. Our algorithm significantly improves state-of-the-art results and justifies the significance of the lattice models. In addition to these, our analysis reveals the weakness of several energy functions used Copyright © 2013, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

Citations Scopus - 4
2013 Shatabda S, Newton MAH, Sattar A, 'Neighborhood selection in constraint-based local search for protein structure prediction', Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 8272 LNAI, 44-55 (2013)

Protein structure prediction (PSP) is a very challenging constraint optimization problem. Constraint-based local search approaches have obtained promising results in so... [more]

Protein structure prediction (PSP) is a very challenging constraint optimization problem. Constraint-based local search approaches have obtained promising results in solving constraint models for PSP. However, the neighborhood exploration policies adopted in these approaches either remain exhaustive or are based on random decisions. In this paper, we propose heuristics to intelligently explore only the promising areas of the search neighborhood. On face centered cubic lattice using a realistic 20 × 20 energy model and standard benchmark proteins, we obtain structures with significantly lower energy and RMSD values than those obtained by the state-of-the-art algorithms. © Springer International Publishing 2013.

DOI 10.1007/978-3-319-03680-9_5
Citations Scopus - 1
2013 Rashid MA, Newton MAH, Hoque MT, Shatabda S, Pham DN, Sattar A, 'Spiral search: a hydrophobic-core directed local search for simplified PSP on 3D FCC lattice', BMC BIOINFORMATICS, 14 (2013)
DOI 10.1186/1471-2105-14-S2-S16
Citations Scopus - 1Web of Science - 14
2013 Shatabda S, Newton MAH, Rashid MA, Pham DN, Sattar A, 'The road not taken: retreat and diverge in local search for simplified protein structure prediction', BMC BIOINFORMATICS, 14 (2013)
DOI 10.1186/1471-2105-14-S2-S19
Citations Web of Science - 9
2013 Rashid MA, Newton MAH, Hoque MT, Sattar A, 'Collaborative Parallel Local Search for Simplified Protein Structure Prediction', 2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 966-973 (2013)
DOI 10.1109/TrustCom.2013.118
Citations Scopus - 2Web of Science - 3
2013 Shatabda S, Newton MAH, Duc NP, Sattar A, 'A Hybrid Local Search for Simplified Protein Structure Prediction', BIOINFORMATICS 2013: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOINFORMATICS MODELS, METHODS AND ALGORITHMS, 158-163 (2013)
Citations Scopus - 3Web of Science - 1
2012 Rashid MA, Shatabda S, Newton MAH, Hoque MT, Pham DN, Sattar A, 'Random-walk: A stagnation recovery technique for simplified protein structure prediction', 2012 ACM Conference on Bioinformatics Computational Biology and Biomedicine Bcb 2012, 620-622 (2012)

Protein structure prediction is a challenging optimisation problem to the computer scientists. A large number of existing (meta-)heuristic search algorithms attempt to ... [more]

Protein structure prediction is a challenging optimisation problem to the computer scientists. A large number of existing (meta-)heuristic search algorithms attempt to solve the problem by exploring possible structures and finding the one with minimum free energy. However, these algorithms often get stuck in local minima and thus perform poorly on large sized proteins. In this paper, we present a random-walk based stagnation recovery approach. We tested our approach on tabu-based local search as well as population based genetic algorithms. The experimental results show that, random-walk is very effective for escaping from local minima for protein structure prediction on facecentred- cubic lattice and hydrophobic-polar energy model.

DOI 10.1145/2382936.2383043
Citations Scopus - 13
2012 Shatabda S, Newton MAH, Pham DN, Sattar A, 'Memory-based local search for simplified protein structure prediction', 2012 ACM Conference on Bioinformatics Computational Biology and Biomedicine Bcb 2012, 345-352 (2012)

Protein structure prediction is one of the most challenging problems in computational biology. Given a protein's amino acid sequence, a simplified version of the p... [more]

Protein structure prediction is one of the most challenging problems in computational biology. Given a protein's amino acid sequence, a simplified version of the problem is to find an on-lattice self-avoiding walk that minimizes the interaction energy among the amino acids. In this paper, we present a memory-based local search method for the simplified problem using Hydrophobic-Polar energy model and Face Centered Cubic lattice. By memorizing local minima and then avoiding their neighbohood, our approach significantly improves the state-of-the-art local search method for protein structure prediction on a set of standard benchmark proteins. Copyright © 2012 ACM.

DOI 10.1145/2382936.2382980
Citations Scopus - 17
2012 Rashid MA, Hoque MT, Newton MAH, Pham DN, Sattar A, 'A new genetic algorithm for simplified protein structure prediction', Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 7691 LNAI, 107-119 (2012)

In this paper, we present a new genetic algorithm for protein structure prediction problem using face-centred cubic lattice and hydrophobic-polar energy model. Our algo... [more]

In this paper, we present a new genetic algorithm for protein structure prediction problem using face-centred cubic lattice and hydrophobic-polar energy model. Our algorithm uses i) an exhaustive generation approach to diversify the search; ii) a novel hydrophobic core-directed macro move to intensify the search; and iii) a random-walk strategy to recover from stagnation. On a set of standard benchmark proteins, our algorithm significantly outperforms the state-of-the-art algorithms for the same models. © 2012 Springer-Verlag.

DOI 10.1007/978-3-642-35101-3_10
Citations Scopus - 20
2011 Newton MAH, Pham DN, Sattar A, Maher M, 'Kangaroo: An efficient constraint-based local search system using lazy propagation', Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 6876 LNCS, 645-659 (2011)

In this paper, we introduce Kangaroo, a constraint-based local search system. While existing systems such as Comet maintain invariants after every move, Kangaroo adopts... [more]

In this paper, we introduce Kangaroo, a constraint-based local search system. While existing systems such as Comet maintain invariants after every move, Kangaroo adopts a lazy strategy, updating invariants only when they are needed. Our empirical evaluation shows that Kangaroo consistently has a smaller memory footprint than Comet, and is usually significantly faster. © 2011 Springer-Verlag.

DOI 10.1007/978-3-642-23786-7_49
Citations Scopus - 32
2010 Newton MAH, Levine J, 'Implicit Learning of Compiled Macro-Actions for Planning', ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PORTUGAL, Univ Lisbon, Fac Sci, Lisbon (2010)
DOI 10.3233/978-1-60750-606-5-323
Citations Scopus - 2Web of Science - 2
2007 Newton MAH, Levine J, Fox M, Long D, 'Learning macro-actions for arbitrary planners and domains', Icaps 2007 17th International Conference on Automated Planning and Scheduling, 256-263 (2007)

Many complex domains and even larger problems in simple domains remain challenging in spite of the recent progress in planning. Besides developing and improving plannin... [more]

Many complex domains and even larger problems in simple domains remain challenging in spite of the recent progress in planning. Besides developing and improving planning technologies, re-engineering a domain by utilising acquired knowledge opens up a potential avenue for further research. Moreover, macro-actions, when added to the domain as additional actions, provide a promising means by which to convey such knowledge. A macro-action, or macro in short, is a group of actions selected for application as a single choice. Most existing work on macros exploits properties explicitly specific to the planners or the domains. However, such properties are not likely to be common with arbitrary planners or domains. Therefore, a macro learning method that does not exploit any structural knowledge about planners or domains explicitly is of immense interest. This paper presents an offline macro learning method that works with arbitrarily chosen planners and domains. Given a planner, a domain, and a number of example problems, the learning method generates macros from plans of some of the given problems under the guidance of a genetic algorithm. It represents macros like regular actions, evaluates them individually by solving the remaining given problems, and suggests individual macros that are to be added to the domain permanently. Genetic algorithms are automatic learning methods that can capture inherent features of a system using no explicit knowledge about it. Our method thus does not strive to discover or utilise any structural properties specific to a planner or a domain. Copyright © 2007, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

Citations Scopus - 62
2005 Hosain MS, Newton MAH, 'Multi-key index for distributed database system', INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, Singapore, SINGAPORE (2005)
DOI 10.1142/S0218194005002075
Citations Scopus - 1
Show 31 more conferences

Journal article (42 outputs)

Year Citation Altmetrics Link
2025 Azad T, Newton MAH, Trevathan J, Sattar A, 'Syntactic and Semantic Edge Interoperability', Journal of Internet Services and Applications, 16, 235-252 (2025) [C1]
DOI 10.5753/jisa.2025.4996
2025 Sarker P, Banshal S, Newton M, Anika S, Sumsee Sristy M, Chakrobortty A, 'Perceived key factors affecting online university classrooms', Education and Information Technologies (2025)
DOI 10.1007/s10639-025-13744-w
2025 Azad T, Newton MAH, Trevathan J, Sattar A, 'IoT edge network interoperability', Computer Communications, 236 (2025) [C1]

Network interoperability is crucial for achieving seamless communication across Internet of Things (IoT) environments. IoT comprises heterogeneous devices and systems s... [more]

Network interoperability is crucial for achieving seamless communication across Internet of Things (IoT) environments. IoT comprises heterogeneous devices and systems supporting diverse technologies, protocols, and manufacturers. Enabling devices to communicate and exchange data effectively, regardless of underlying protocols, is key to building cohesive and integrated IoT networks. IoT has transformed multiple sectors ranging from home automation to healthcare¿by harnessing a vast array of sensors and actuators that communicate through cloud, fog, and edge layers. However, the variety in device manufacturing and communication standards demands interoperable interfaces, and most current solutions depend on cloud-based centralised architectures. These architectures introduce latency and scalability challenges, particularly for resource-constrained IoT devices that often struggle to communicate with the cloud due to limited resources. This paper addresses network interoperability at the IoT edge level, focusing on resource-efficient communication by integrating Wi-Fi and Bluetooth, two commonly used protocols in IoT ecosystems. We have implemented a network edge interoperability solution that supports effective data exchange between devices operating on these distinct protocols, enhancing the overall efficiency, flexibility, and scalability of IoT systems. Our approach allows devices interoperate by addressing network latency and bandwidth limitations, incorporating an integrated controller to facilitate broader applications and enhance performance across IoT networks. Our findings illustrate how bridging protocol differences can foster more resilient and adaptable IoT solutions, advancing the deployment of IoT applications across various domains and use cases.

DOI 10.1016/j.comcom.2025.108125
Citations Scopus - 1
2024 Cader JMA, Newton MAH, Rahman J, Cader AJMA, Sattar A, 'Ensembling methods for protein-ligand binding affinity prediction', SCIENTIFIC REPORTS, 14 (2024) [C1]

Protein-ligand binding affinity prediction is a key element of computer-aided drug discovery. Most of the existing deep learning methods for protein-ligand binding affi... [more]

Protein-ligand binding affinity prediction is a key element of computer-aided drug discovery. Most of the existing deep learning methods for protein-ligand binding affinity prediction utilize single models and suffer from low accuracy and generalization capability. In this paper, we train 13 deep learning models from combinations of 5 input features. Then, we explore all possible ensembles of the trained models to find the best ensembles. Our deep learning models use cross-attention and self-attention layers to extract short and long-range interactions. Our method is named Ensemble Binding Affinity (EBA). EBA extracts information from various models using different combinations of input features, such as simple 1D sequential and structural features of the protein-ligand complexes rather than 3D complex features. EBA is implemented to accurately predict the binding affinity of a protein-ligand complex. One of our ensembles achieves the highest Pearson correlation coefficient (R) value of 0.914 and the lowest root mean square error (RMSE) value of 0.957 on the well-known benchmark test set CASF2016. Our ensembles show significant improvements of more than 15% in R-value and 19% in RMSE on both well-known benchmark CSAR-HiQ test sets over the second-best predictor named CAPLA. Furthermore, the superior performance of the ensembles across all metrics compared to existing state-of-the-art protein-ligand binding affinity prediction methods on all five benchmark test datasets demonstrates the effectiveness and robustness of our approach. Therefore, our approach to improving binding affinity prediction between proteins and ligands can contribute to improving the success rate of potential drugs and accelerate the drug development process.

DOI 10.1038/s41598-024-72784-3
Citations Scopus - 1
2024 Rahman J, Newton MAH, Ali ME, Sattar A, 'Distance plus attention for binding affinity prediction', JOURNAL OF CHEMINFORMATICS, 16 (2024) [C1]

Protein-ligand binding affinity plays a pivotal role in drug development, particularly in identifying potential ligands for target disease-related proteins. Accurate af... [more]

Protein-ligand binding affinity plays a pivotal role in drug development, particularly in identifying potential ligands for target disease-related proteins. Accurate affinity predictions can significantly reduce both the time and cost involved in drug development. However, highly precise affinity prediction remains a research challenge. A key to improve affinity prediction is to capture interactions between proteins and ligands effectively. Existing deep-learning-based computational approaches use 3D grids, 4D tensors, molecular graphs, or proximity-based adjacency matrices, which are either resource-intensive or do not directly represent potential interactions. In this paper, we propose atomic-level distance features and attention mechanisms to capture better specific protein-ligand interactions based on donor-acceptor relations, hydrophobicity, and p-stacking atoms. We argue that distances encompass both short-range direct and long-range indirect interaction effects while attention mechanisms capture levels of interaction effects. On the very well-known CASF-2016 dataset, our proposed method, named Distance plus Attention for Affinity Prediction (DAAP), significantly outperforms existing methods by achieving Correlation Coefficient (R) 0.909, Root Mean Squared Error (RMSE) 0.987, Mean Absolute Error (MAE) 0.745, Standard Deviation (SD) 0.988, and Concordance Index (CI) 0.876. The proposed method also shows substantial improvement, around 2% to 37%, on five other benchmark datasets. The program and data are publicly available on the website https://gitlab.com/mahnewton/daap. Scientific Contribution Statement This study innovatively introducesdistance-based features to predict protein-ligand binding affinity, capitalizing onunique molecular interactions. Furthermore, the incorporation of protein sequencefeatures of specific residues enhances the model's proficiency in capturing intricatebinding patterns. The predictive capabilities are further strengthened through theuse of a deep learning architecture with attention mechanisms, and an ensembleapproach, averaging the outputs of five models, is implemented to ensure robustand reliable predictions.

DOI 10.1186/s13321-024-00844-x
Citations Scopus - 4Web of Science - 1
2024 Ben Islam MK, Newton MAH, Trevathan J, Sattar A, 'Lite approaches for long-range multi-step water quality prediction', STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 38, 3755-3770 (2024) [C1]

Forecasting accurate water quality is very important in aquaculture, environment monitoring, and many other applications. Many internal and external factors influence w... [more]

Forecasting accurate water quality is very important in aquaculture, environment monitoring, and many other applications. Many internal and external factors influence water quality. Therefore, water quality parameters exhibit complex time series characteristics. Consequently, long-range accurate prediction of water quality parameters suffers from poor propagation of information from past timepoints to further future timepoints. Moreover, to synchronise the prediction model with the changes in the time series characteristics, periodic retraining of the prediction model is required and such retraining is to be done on resource-restricted computation devices. In this work, we present a low-cost training approach to improve long-range multi-step water quality prediction. We train a short-range predictor to save training effort. Then, we strive to achieve and/or improve long-range prediction using multi-step iterative ensembling during inference. Experimental results on 9 water quality datasets demonstrate that the proposed method achieves significantly lower error than the existing state-of-the-art approaches. Our approach significantly outperforms the existing approaches in several standard metrics, even in the case of future timepoints at long distances.

DOI 10.1007/s00477-024-02770-8
Citations Scopus - 1
2024 Nikafshan Rad H, Su Z, Trinh A, Hakim Newton MA, Shamsani J, NYGC ALS Consortium , Karim A, Sattar A, 'Amyotrophic lateral sclerosis diagnosis using machine learning and multi-omic data integration', Heliyon, 10 (2024) [C1]

Amyotrophic Lateral Sclerosis (ALS) is a complex and rare neurodegenerative disorder characterized by significant genetic, molecular, and clinical heterogeneity. Despit... [more]

Amyotrophic Lateral Sclerosis (ALS) is a complex and rare neurodegenerative disorder characterized by significant genetic, molecular, and clinical heterogeneity. Despite numerous endeavors to discover the genetic factors underlying ALS, a significant number of these factors remain unknown. This knowledge gap highlights the necessity for personalized medicine approaches that can provide more comprehensive information for the purposes of diagnosis, prognosis, and treatment of ALS. This work utilizes an innovative approach by employing a machine learning-facilitated, multi-omic model to develop a more comprehensive knowledge of ALS. Through unsupervised clustering on gene expression profiles, 9,847 genes associated with ALS pathways are isolated and integrated with 7,699 genes containing rare, presumed pathogenic genomic variants, leading to a comprehensive amalgamation of 17,546 genes. Subsequently, a Variational Autoencoder is applied to distil complex biomedical information from these genes, culminating in the creation of the proposed Multi-Omics for ALS (MOALS) model, which has been designed to expose intricate genotype-phenotype interconnections within the dataset. Our meticulous investigation elucidates several pivotal ALS signaling pathways and demonstrates that MOALS is a superior model, outclassing other machine learning models based on single omic approaches such as SNV and RNA expression, enhancing accuracy by 1.7 percent and 6.2 percent, respectively. The findings of this study suggest that analyzing the relationships within biological systems can provide heuristic insights into the biological mechanisms that help to make highly accurate ALS diagnosis tools and achieve more interpretable results.

DOI 10.1016/j.heliyon.2024.e38583
2024 Azad T, Newton MAH, Trevathan J, Sattar A, 'Hierarchical Decentralized Edge Interoperability', IEEE INTERNET OF THINGS JOURNAL, 11, 13948-13960 (2024) [C1]

The Internet of Things (IoT) has many important applications in multiple domains that include home automation, smart cities, healthcare, agriculture, and environment. I... [more]

The Internet of Things (IoT) has many important applications in multiple domains that include home automation, smart cities, healthcare, agriculture, and environment. IoT comprises a wide range of sensors and actuators that communicate with each other over cloud, fog, and edge level networks. Moreover, these devices use various communication protocols and are made by different manufactures. To deal with these diversities, IoT essentially needs interoperable communication interfaces among devices. Unfortunately, existing interoperability solutions are centralized and use fog or cloud level computing resources, making IoT communications latency-prone and poorly scalable. These issues could be handled effectively, if edge level devices could be made interoperable within the edge level and without needing fog or cloud level access. This article proposes a decentralized interoperability solution that stays fully within the edge level. The solution relies on controller devices that work on the interface boundaries of the edge devices. Unlike existing solutions, the proposed solution adopts a hierarchical interoperability model to handle interoperability at network, syntactical, semantic, and organizational levels. Our solution is nonproprietary, generic over vendors and platforms, and easily extendable to new devices. We compare our proposed solution with existing interoperability solutions for edge devices and show its mobility, efficiency, and flexibility.

DOI 10.1109/JIOT.2023.3340298
Citations Scopus - 3
2023 Rahman J, Newton MAH, Hasan MAM, Sattar A, 'Real-to-bin conversion for protein residue distances', COMPUTATIONAL BIOLOGY AND CHEMISTRY, 104 (2023) [C1]

Protein Structure Prediction (PSP) has achieved significant progress lately. Prediction of inter-residue distances by machine learning and their exploitation during the... [more]

Protein Structure Prediction (PSP) has achieved significant progress lately. Prediction of inter-residue distances by machine learning and their exploitation during the conformational search is largely among the critical factors behind the progress. Real values than bin probabilities could more naturally represent inter-residue distances, while the latter, via spline curves more naturally helps obtain differentiable objective functions than the former. Consequently, PSP methods that exploit predicted binned distances perform better than those that exploit predicted real-valued distances. To leverage the advantage of bin probabilities in getting differentiable objective functions, in this work, we propose techniques to convert real-valued distances into distance bin probabilities. Using standard benchmark proteins, we then show that our real-to-bin converted distances help PSP methods obtain three-dimensional structures with 4%¿16% better root mean squared deviation (RMSD), template modeling score (TM-Score), and global distance test (GDT) values than existing similar PSP methods. Our proposed PSP method is named real to bin (R2B) inter-residue distance predictor, and its code is available from https://gitlab.com/mahnewton/r2b.

DOI 10.1016/j.compbiolchem.2023.107834
Citations Scopus - 2Web of Science - 1
2023 Mufassirin MMM, Newton MAH, Rahman J, Sattar A, 'Multi-S3P: Protein Secondary Structure Prediction With Specialized Multi-Network and Self-Attention-Based Deep Learning Model', IEEE ACCESS, 11, 57083-57096 (2023) [C1]

Protein structure prediction (PSP) is a vital challenge in bioinformatics, structural biology and drug discovery. Protein secondary structure (SS) prediction is critica... [more]

Protein structure prediction (PSP) is a vital challenge in bioinformatics, structural biology and drug discovery. Protein secondary structure (SS) prediction is critical since three-dimensional (3D) structures are primarily made up of secondary structures. With the advancement of deep learning approaches, SS classification accuracy has been significantly improved. Many existing methods use an ensemble of complex neural networks to improve SS prediction. Because of the high dimensionality of the hyperparameter space, deep neural networks with complex architectures are typically challenging to train effectively. Also, predicting secondary structures in the boundary regions between different types of SS is challenging. This study presents Multi-S3P, which employs bidirectional Long-Short-Term-Memory (BILSTM) and Convolutional Neural Networks (CNN) with a self-attention mechanism to improve the secondary structure prediction using an effective training strategy to capture the unique characteristics of each type of secondary structure and combine them more effectively. The ensemble of CNN and BILSTM can learn both contextual information and long-range interactions between the residues. In addition, using a self-attention mechanism allows the model to focus on the most important features for improving performance. We used the SPOT-1D dataset for the training and validation of our model using a set of four input features derived from amino acid sequences. Further, the model was tested on four popular independent test datasets and compared with various state-of-the-art predictors. The presented results show that Multi-S3P outperformed the other methods in terms of Q3, Q8 accuracy and other performance metrics, achieving the highest Q3 accuracy of 87.57% and a Q8 accuracy of 77.56% on the TEST2016 test set. More importantly, Multi-S3P demonstrates high performance in SS boundary regions. Our experiment also demonstrates that the combination of different input features and a multi-network-based training strategy significantly improved the performance.

DOI 10.1109/ACCESS.2023.3282702
Citations Scopus - 1Web of Science - 4
2023 Namazi M, Newton MAH, Sanderson C, Sattar A, 'Solving travelling thief problems using coordination based methods', JOURNAL OF HEURISTICS [C1]
DOI 10.1007/s10732-023-09518-7
Citations Scopus - 6Web of Science - 2
2023 Mufassirin MMM, Newton MAH, Sattar A, 'Artificial intelligence for template-free protein structure prediction: a comprehensive review', ARTIFICIAL INTELLIGENCE REVIEW, 56, 7665-7732 (2023) [C1]
DOI 10.1007/s10462-022-10350-x
Citations Scopus - 1Web of Science - 3
2022 Saha S, Sarker PS, Al Saud A, Shatabda S, Newton MAH, 'Cluster-oriented instance selection for classification problems', INFORMATION SCIENCES, 602, 143-158 (2022) [C1]

More training instances could lead to better classification accuracy. However, accuracy could also degrade if more training instances mean further noises and outliers. ... [more]

More training instances could lead to better classification accuracy. However, accuracy could also degrade if more training instances mean further noises and outliers. Additional training instances arguably need additional computational resources in future data mining operations. Instance selection algorithms identify subsets of training instances that could desirably increase accuracy or at least do not decrease accuracy significantly. There exist many instance selection algorithms, but no single algorithm, in general, dominates the others. Moreover, existing instance selection algorithms do not allow direct controlling of the instance selection rate. In this paper, we present a simple and generic cluster-oriented instance selection algorithm for classification problems. Our proposed algorithm runs an unsupervised K Means Clustering algorithm on the training instances and with a given selection rate, selects instances from the centers and the borders of the clusters. On 24 benchmark classification problems, when very similar percentages of instances are selected by various instance selection algorithms, K Nearest Neighbours classifiers achieve more than 2%¿3% better accuracy when using instances selected by our proposed method than when using those selected by other state-of-the-art generic instance selection algorithms.

DOI 10.1016/j.ins.2022.04.036
Citations Scopus - 4Web of Science - 16
2022 Rahman J, Newton MAH, Hasan MAM, Sattar A, 'A stacked meta-ensemble for protein inter-residue distance prediction', COMPUTERS IN BIOLOGY AND MEDICINE, 148 (2022) [C1]

Predicted inter-residue distances are a key behind recent success in high quality protein structure prediction (PSP). However, prediction of both short and long distanc... [more]

Predicted inter-residue distances are a key behind recent success in high quality protein structure prediction (PSP). However, prediction of both short and long distance values together is challenging. Consequently, predicted short distances are mostly used by existing PSP methods. In this paper, we use a stacked meta-ensemble method to combine deep learning models trained for different ranges of real-valued distances. On five benchmark sets of proteins, our proposed inter-residue distance prediction method improves mean Local Distance Different Test (LDDT) scores at least by 5% over existing such methods. Moreover, using a real-valued distance based conformational search algorithm, we also show that predicted long distances help obtain significantly better protein conformations than when only predicted short distances are used. Our method is named meta-ensemble for distance prediction (MDP) and its program is available from https://gitlab.com/mahnewton/mdp.

DOI 10.1016/j.compbiomed.2022.105824
Citations Scopus - 7Web of Science - 5
2022 Ben Islam MK, Newton MAH, Rahman J, Trevathan J, Sattar A, 'Long range multi-step water quality forecasting using iterative ensembling', ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 114 (2022) [C1]

Real-life water quality monitoring applications such as aquaculture domains and water resource management need long range multi-step prediction for disaster control. Ho... [more]

Real-life water quality monitoring applications such as aquaculture domains and water resource management need long range multi-step prediction for disaster control. However, prediction accuracy usually degrades gradually as the prediction target timepoint is further away from the current timepoint. To address this, recent water quality forecasting methods mostly rely on complex deep learning models. In this paper, we propose a simple time-variant iterative ensembling method that strives to significantly improve the performance of a given arbitrary long range multi-step time series predictor for water quality data with minimal increase in computational cost. With the given predictor, our proposed method iteratively uses ensembles of predicted values for preceding steps to improve the prediction accuracy for the succeeding steps. The iterative ensembling operation is performed on the trained model and only at the inference stage, and so does not need any further computing-intensive training for the performance improvement. We experimentally show that the proposed method is effective with 7 predictors and 9 water quality datasets of various types, and it outperforms the state-of-the-art results in those datasets by around 2%¿29% in mean absolute error (MAE), root mean squared error (RMSE) and mean absolute percentage error (MAPE) metrics. Similar improvement has also been found in two other metrics such as normalized Nash¿Sutcliffe model efficiency coefficient (NNSE) metric and Taylor diagram plot. Overall, the proposed iterative ensembling is a promising approach for multi-step long range water quality prediction for high-frequency water quality monitoring systems.

DOI 10.1016/j.engappai.2022.105166
Citations Scopus - 1Web of Science - 12
2022 Newton MAH, Zaman R, Mataeimoghadam F, Rahman J, Sattar A, 'Constraint Guided Beta-Sheet Refinement for Protein Structure Prediction', COMPUTATIONAL BIOLOGY AND CHEMISTRY, 101 (2022) [C1]
DOI 10.1016/j.compbiolchem.2022.107773
Citations Scopus - 1
2022 Newton MAH, Mataeimoghadam F, Zaman R, Sattar A, 'Secondary structure specific simpler prediction models for protein backbone angles', BMC BIOINFORMATICS, 23 (2022) [C1]
DOI 10.1186/s12859-021-04525-6
Citations Scopus - 7Web of Science - 4
2022 Rahman J, Newton MAH, Ben Islam MK, Sattar A, 'Enhancing protein inter-residue real distance prediction by scrutinising deep learning models', SCIENTIFIC REPORTS, 12 (2022) [C1]
DOI 10.1038/s41598-021-04441-y
Citations Scopus - 9Web of Science - 8
2022 Newton MAH, Rahman J, Zaman R, Sattar A, 'Enhancing protein contact map prediction accuracy via ensembles of inter-residue distance predictors', COMPUTATIONAL BIOLOGY AND CHEMISTRY, 99 (2022) [C1]

Protein contact maps capture coevolutionary interactions between amino acid residue pairs that are spatially within certain proximity threshold. Predicted contact maps ... [more]

Protein contact maps capture coevolutionary interactions between amino acid residue pairs that are spatially within certain proximity threshold. Predicted contact maps are used in many protein related problems that include drug design, protein design, protein function prediction, and protein structure prediction. Contact map prediction has achieved significant progress lately but still further challenges remain with prediction of contacts between residues that are separated in the amino acid residue sequence by large numbers of other residues. In this paper, with experimental results on 5 standard benchmark datasets that include membrane proteins, we show that contact map prediction could be significantly enhanced by using ensembles of various state-of-the-art short distance predictors and then by converting predicted distances into contact probabilities. Our program along with its data is available from https://gitlab.com/mahnewton/ecp.

DOI 10.1016/j.compbiolchem.2022.107700
Citations Scopus - 6Web of Science - 3
2022 Zaman R, Newton MAH, Mataeimoghadam F, Sattar A, 'Constraint Guided Neighbor Generation for Protein Structure Prediction', IEEE ACCESS, 10, 54991-55001 (2022) [C1]
DOI 10.1109/ACCESS.2022.3176945
Citations Scopus - 4Web of Science - 2
2021 Karim A, Riahi V, Mishra A, Newton MAH, Dehzangi A, Balle T, Sattar A, 'Quantitative Toxicity Prediction via Meta Ensembling of Multitask Deep Learning Models', ACS OMEGA, 6, 12306-12317 (2021) [C1]

Toxicity prediction using quantitative structure-activity relationship has achieved significant progress in recent years. However, most existing machine learning method... [more]

Toxicity prediction using quantitative structure-activity relationship has achieved significant progress in recent years. However, most existing machine learning methods in toxicity prediction utilize only one type of feature representation and one type of neural network, which essentially restricts their performance. Moreover, methods that use more than one type of feature representation struggle with the aggregation of information captured within the features since they use predetermined aggregation formulas. In this paper, we propose a deep learning framework for quantitative toxicity prediction using five individual base deep learning models and their own base feature representations. We then propose to adopt a meta ensemble approach using another separate deep learning model to perform aggregation of the outputs of the individual base deep learning models. We train our deep learning models in a weighted multitask fashion combining four quantitative toxicity data sets of LD50, IGC50, LC50, and LC50-DM and minimizing the root-mean-square errors. Compared to the current state-of-the-art toxicity prediction method TopTox on LD50, IGC50, and LC50-DM, that is, three out of four data sets, our method, respectively, obtains 5.46, 16.67, and 6.34% better root-mean-square errors, 6.41, 11.80, and 12.16% better mean absolute errors, and 5.21, 7.36, and 2.54% better coefficients of determination. We named our method QuantitativeTox, and our implementation is available from the GitHub repository https://github.com/Abdulk084/QuantitativeTox.

DOI 10.1021/acsomega.1c01247
Citations Scopus - 3Web of Science - 18
2021 Mataeimoghadam F, Newton MAH, Dehzangi A, Karim A, Jayaram B, Ranganathan S, Sattar A, 'Enhancing protein backbone angle prediction by using simpler models of deep neural networks (vol 10, 19430, 2020)', SCIENTIFIC REPORTS, 11 (2021)
DOI 10.1038/s41598-021-96666-0
2021 Newton MAH, Polash MMA, Pham DN, Thornton J, Su K, Sattar A, 'Evaluating logic gate constraints in local search for structured satisfiability problems', ARTIFICIAL INTELLIGENCE REVIEW, 54, 5347-5411 (2021) [C1]
DOI 10.1007/s10462-021-10024-0
Citations Scopus - 4Web of Science - 3
2021 Riahi V, Newton MAH, Sattar A, 'A scatter search algorithm for time-dependent prize-collecting arc routing problems', COMPUTERS & OPERATIONS RESEARCH, 134 (2021) [C1]
DOI 10.1016/j.cor.2021.105392
Citations Scopus - 1Web of Science - 7
2021 Riahi V, Newton MAH, Sattar A, 'Constraint based local search for flowshops with sequence-dependent setup times', ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 102 (2021) [C1]
DOI 10.1016/j.engappai.2021.104264
Citations Scopus - 4Web of Science - 2
2020 Mataeimoghadam F, Newton MAH, Dehzangi A, Karim A, Jayaram B, Ranganathan S, Sattar A, 'Enhancing protein backbone angle prediction by using simpler models of deep neural networks', SCIENTIFIC REPORTS, 10 (2020) [C1]
DOI 10.1038/s41598-020-76317-6
Citations Scopus - 2Web of Science - 20
2019 Riahi V, Newton MAH, Su K, Sattar A, 'Constraint guided accelerated search for mixed blocking permutation flowshop scheduling', Computers and Operations Research, 102 102-120 (2019) [C1]

Mixed Blocking Permutation Flowshop Scheduling Problem (MBPFSP) with the objective of makespan minimisation is NP-Hard. It has important industrial applications that in... [more]

Mixed Blocking Permutation Flowshop Scheduling Problem (MBPFSP) with the objective of makespan minimisation is NP-Hard. It has important industrial applications that include the cider production industry. MBPFSP has made some progress in recent years. However, within practical time limits, existing incomplete algorithms still either find low quality solutions or struggle with large problems. One key reason behind this is the typical way of using generic heuristics or metaheuristics that usually lack problem specific structural knowledge. In MBPFSP, a machine could be blocked with the currently finished job until the subsequent machine is available to process the same job. These blocking constraints affect the makespan. So MBPFSP search should naturally take explicit steps to take the blocking constraints into account. Unfortunately, existing research on MBPFSP just uses only the makespan to compare generated solutions, but the search otherwise is not aware of the blocking constraints. Moreover, existing such methods use either an exhaustive or a random neighbourhood generation strategy. In this work, we aim to advance MBPFSP search by better exploiting the problem specific structural knowledge. We use the constraint and the objective functions to obtain such problem specific knowledge and we exploit such knowledge both in a constructive search method and in a local search method. In this paper, we also present an acceleration method to efficiently evaluate insertion-based neighbourhoods of MBPFSP. Our experimental results on three standard testbeds demonstrate that our proposed algorithms significantly improve over existing best-performing algorithms.

DOI 10.1016/j.cor.2018.10.003
Citations Scopus - 24Web of Science - 21
2019 Riahi V, Newton MAH, Polash MMA, Su K, Sattar A, 'Constraint guided search for aircraft sequencing', EXPERT SYSTEMS WITH APPLICATIONS, 118, 440-458 (2019) [C1]

Aircraft sequencing problem (ASP) is an NP-Hard problem. It involves allocation of aircraft to runways for landing and takeoff, minimising total tardiness. ASP has made... [more]

Aircraft sequencing problem (ASP) is an NP-Hard problem. It involves allocation of aircraft to runways for landing and takeoff, minimising total tardiness. ASP has made significant progress in recent years. However, within practical time limits, existing incomplete algorithms still either find low quality solutions or struggle with large problems. One key reason behind this is the typical way of using generic heuristics or metaheuristics that usually lack problem specific structural knowledge. As a result, existing such methods use either an exhaustive or a random neighbourhood generation strategy. So their search guidance comes only from the evaluation function that is used mainly after the neighbourhood generation. In this work, we aim to advance ASP search by better exploiting the problem specific structural knowledge. We use the constraint and the objective functions to obtain such problem specific knowledge and we exploit such knowledge both in a constructive search method and in a local search method. Our motivation comes from the constraint optimisation paradigm in artificial intelligence, where instead of random decisions, constraint-guided more informed optimisation decisions are of particular interest. We run our experiments on a range of standard benchmark problem instances that include instances from real airports and instances crafted using real airport parameters, and contain scenarios involving multiple runways and both landing and takeoff operations. We show that our proposed algorithms significantly outperform existing state-of-the-art aircraft sequencing algorithms.

DOI 10.1016/j.eswa.2018.10.033
Citations Scopus - 1Web of Science - 7
2019 Karim A, Mishra A, Newton MAH, Sattar A, 'Efficient Toxicity Prediction via Simple Features Using Shallow Neural Networks and Decision Trees', ACS OMEGA, 4, 1874-1888 (2019) [C1]

Toxicity prediction of chemical compounds is a grand challenge. Lately, it achieved significant progress in accuracy but using a huge set of features, implementing a co... [more]

Toxicity prediction of chemical compounds is a grand challenge. Lately, it achieved significant progress in accuracy but using a huge set of features, implementing a complex blackbox technique such as a deep neural network, and exploiting enormous computational resources. In this paper, we strongly argue for the models and methods that are simple in machine learning characteristics, efficient in computing resource usage, and powerful to achieve very high accuracy levels. To demonstrate this, we develop a single task-based chemical toxicity prediction framework using only 2D features that are less compute intensive. We effectively use a decision tree to obtain an optimum number of features from a collection of thousands of them. We use a shallow neural network and jointly optimize it with decision tree taking both network parameters and input features into account. Our model needs only a minute on a single CPU for its training while existing methods using deep neural networks need about 10 min on NVidia Tesla K40 GPU. However, we obtain similar or better performance on several toxicity benchmark tasks. We also develop a cumulative feature ranking method which enables us to identify features that can help chemists perform prescreening of toxic compounds effectively.

DOI 10.1021/acsomega.8b03173
Citations Scopus - 5Web of Science - 44
2019 Riahi V, Newton MAH, Polash MMA, Sattar A, 'Tailoring customer order scheduling search algorithms', COMPUTERS & OPERATIONS RESEARCH, 108, 155-165 (2019) [C1]
DOI 10.1016/j.cor.2019.04.015
Citations Scopus - 1Web of Science - 12
2019 Newton MAH, Riahi V, Su K, Sattar A, 'Scheduling blocking flowshops with setup times via constraint guided and accelerated local search', COMPUTERS & OPERATIONS RESEARCH, 109, 64-76 (2019) [C1]
DOI 10.1016/j.cor.2019.04.024
Citations Scopus - 2Web of Science - 18
2019 Newton MAH, Riahi V, Sattar A, 'Makespan preserving flowshop reengineering via blocking constraints', COMPUTERS & OPERATIONS RESEARCH, 112 (2019) [C1]
DOI 10.1016/j.cor.2019.07.013
Citations Scopus - 1
2017 Riahi V, Khorramizadeh M, Newton MAH, Sattar A, 'Scatter search for mixed blocking flowshop scheduling', EXPERT SYSTEMS WITH APPLICATIONS, 79, 20-32 (2017) [C1]

Empty or limited storage capacities between machines introduce various types of blocking constraint in the industries with flowshop environment. While large application... [more]

Empty or limited storage capacities between machines introduce various types of blocking constraint in the industries with flowshop environment. While large applications demand flowshop scheduling with a mix of different types of blocking, research in this area mainly focuses on using only one kind of blocking in a given problem instance. In this paper, using makespan as a criterion, we study permutation flowshops with zero capacity buffers operating under mixed blocking conditions. We present a very effective scatter search (SS) algorithm for this. At the initialisation phase of SS, we use a modified version of the well-known Nawaz, Enscore and Ham (NEH) heuristic. For the improvement method in SS, we use an Iterated Local Search (ILS) algorithm that adopts a greedy job selection and a powerful NEH-based perturbation procedure. Moreover, in the reference set update phase of SS, with small probabilities, we accept worse solutions so as to increase the search diversity. On standard benchmark problems of varying sizes, our algorithm very significantly outperforms well-known existing algorithms in terms of both the solution quality and the computing time. Moreover, our algorithm has found new upper bounds for 314 out of 360 benchmark problem instances.

DOI 10.1016/j.eswa.2017.02.027
Citations Scopus - 4Web of Science - 43
2017 Polash MMA, Newton MAH, Sattar A, 'Constraint-directed search for all-interval series', Constraints, 22 403-431 (2017) [C1]

All-interval series is a standard benchmark problem for constraint satisfaction search. An all-interval series of size n is a permutation of integers [0, n) such that t... [more]

All-interval series is a standard benchmark problem for constraint satisfaction search. An all-interval series of size n is a permutation of integers [0, n) such that the differences between adjacent integers are a permutation of [1, n). Generating each such all-interval series of size n is an interesting challenge for constraint community. The problem is very difficult in terms of the size of the search space. Different approaches have been used to date to generate all the solutions of AIS but the search space that must be explored still remains huge. In this paper, we present a constraint-directed backtracking-based tree search algorithm that performs efficient lazy checking rather than immediate constraint propagation. Moreover, we prove several key properties of all-interval series that help prune the search space significantly. The reduced search space essentially results into fewer backtracking. We also present scalable parallel versions of our algorithm that can exploit the advantage of having multi-core processors and even multiple computer systems. Our new algorithm generates all the solutions of size up to 27 while a satisfiability-based state-of-the-art approach generates all solutions up to size 24.

DOI 10.1007/s10601-016-9261-y
Citations Scopus - 1Web of Science - 1
2017 Ghooshchi NG, Namazi M, Newton MAH, Sattar A, 'Encoding domain transitions for constraint-based Planning', Journal of Artificial Intelligence Research, 58 905-966 (2017) [C1]

We describe a constraint-based automated planner named Transition Constraints for Parallel Planning (TCPP). TCPP constructs its constraint model from a redefined versio... [more]

We describe a constraint-based automated planner named Transition Constraints for Parallel Planning (TCPP). TCPP constructs its constraint model from a redefined version of the domain transition graphs (DTG) of a given planning problem. TCPP encodes state transitions in the redefined DTGs by using table constraints with cells containing don't cares or wild cards. TCPP uses Minion the constraint solver to solve the constraint model and returns a parallel plan. We empirically compare TCPP with the other state-of-theart constraint-based parallel planner PaP2. PaP2 encodes action successions in the finite state automata (FSA) as table constraints with cells containing sets of values. PaP2 uses SICStus Prolog as its constraint solver. We also improve PaP2 by using don't cares and mutex constraints. Our experiments on a number of standard classical planning benchmark domains demonstrate TCPP's efficiency over the original PaP2 running on SICStus Prolog and our reconstructed and enhanced versions of PaP2 running on Minion.

DOI 10.1613/jair.5378
Citations Scopus - 5Web of Science - 2
2017 Polash MMA, Newton MAH, Sattar A, 'Constraint-based search for optimal Golomb rulers', JOURNAL OF HEURISTICS, 23, 501-532 (2017) [C1]
DOI 10.1007/s10732-017-9353-x
Citations Scopus - 9Web of Science - 2
2014 Rashid MA, Polash MMA, Newton MAH, Hoque MT, Sattar A, 'Amino Acids Pattern-Biased Spiral Search for Protein Structure Prediction', PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 8862, 143-156 (2014)
2014 Rashid MA, Shatabda S, Newton MAH, Hoque MT, Sattar A, 'A parallel framework for multipoint spiral search in ab initio protein structure prediction', Advances in Bioinformatics, 2014 (2014)

Protein structure prediction is computationally a very challenging problem. A large number of existing search algorithms attempt to solve the problem by exploring possi... [more]

Protein structure prediction is computationally a very challenging problem. A large number of existing search algorithms attempt to solve the problem by exploring possible structures and finding the one with the minimum free energy. However, these algorithms perform poorly on large sized proteins due to an astronomically wide search space. In this paper, we present a multipoint spiral search framework that uses parallel processing techniques to expedite exploration by starting from different points. In our approach, a set of random initial solutions are generated and distributed to different threads. We allow each thread to run for a predefined period of time. The improved solutions are stored threadwise. When the threads finish, the solutions are merged together and the duplicates are removed. A selected distinct set of solutions are then split to different threads again. In our ab initio protein structure prediction method, we use the three-dimensional face-centred-cubic lattice for structure-backbone mapping. We use both the low resolution hydrophobic-polar energy model and the high-resolution 20×20 energy model for search guiding. The experimental results show that our new parallel framework significantly improves the results obtained by the state-of-the-art single-point search approaches for both energy models on three-dimensional face-centred-cubic lattice. We also experimentally show the effectiveness of mixing energy models within parallel threads. © 2014 Mahmood A. Rashid et al.

DOI 10.1155/2014/985968
Citations Scopus - 2
2014 Shatabda S, Newton MAH, Rashid MA, Pham DN, Sattar A, 'How good are simplified models for protein structure prediction?', Advances in Bioinformatics, 2014 (2014)

Protein structure prediction (PSP) has been one of the most challenging problems in computational biology for several decades. The challenge is largely due to the compl... [more]

Protein structure prediction (PSP) has been one of the most challenging problems in computational biology for several decades. The challenge is largely due to the complexity of the all-atomic details and the unknown nature of the energy function. Researchers have therefore used simplified energy models that consider interaction potentials only between the amino acid monomers in contact on discrete lattices. The restricted nature of the lattices and the energy models poses a twofold concern regarding the assessment of the models. Can a native or a very close structure be obtained when structures are mapped to lattices? Can the contact based energy models on discrete lattices guide the search towards the native structures? In this paper, we use the protein chain lattice fitting (PCLF) problem to address the first concern; we developed a constraint-based local search algorithm for the PCLF problem for cubic and face-centered cubic lattices and found very close lattice fits for the native structures. For the second concern, we use a number of techniques to sample the conformation space and find correlations between energy functions and root mean square deviation (RMSD) distance of the lattice-based structures with the native structures. Our analysis reveals weakness of several contact based energy models used that are popular in PSP. © 2014 Swakkhar Shatabda et al.

DOI 10.1155/2014/867179
Citations Scopus - 12
2013 Rashid MA, Newton MAH, Hoque MT, Sattar A, 'Mixing Energy Models in Genetic Algorithms for On-Lattice Protein Structure Prediction', BIOMED RESEARCH INTERNATIONAL, 2013 (2013)
DOI 10.1155/2013/924137
Citations Scopus - 1Web of Science - 9
2008 Shahriar AZM, Akbar MM, Rahman MS, Newton MAH, 'A multiprocessor based heuristic for multi-dimensional multiple-choice knapsack problem', JOURNAL OF SUPERCOMPUTING, 43 257-280 (2008)
DOI 10.1007/s11227-007-0144-2
Citations Scopus - 18Web of Science - 8
2005 Hosain MS, Newton MAH, Rahman MM, 'Performance analysis of global index in distributed environment', Wseas Transactions on Computers, 4, 874-881 (2005)

A multi-key index model has been proposed for fast and efficient access of data in distributed database system. The index model has tow parts global index and local ind... [more]

A multi-key index model has been proposed for fast and efficient access of data in distributed database system. The index model has tow parts global index and local index. The performance of global index greatly depends on its smooth adaptability in distributed environment, effective space utilization and operation speed. In this paper we present the algorithms for dynamic adaptation of global index. We also make comparison of global index structure to other multi-key indices such as graph structure based multi-key index.

Show 39 more journal articles
Edit

Grants and Funding

Summary

Number of grants 7
Total funding $82,430

Click on a grant title below to expand the full details for that specific grant.


20251 grants / $5,000

Global Experience Support Funding$5,000

Funding body: University of Newcastle Global Engagement and Partnerships (UON Global)

Funding body University of Newcastle Global Engagement and Partnerships (UON Global)
Project Team

M A Hakim Newton, Marcella Papini, Kyle Harrison, Saiful Islam, Nasimul Noman, Rukshan Athauda, Alex Mendes, Karen Blackmore

Scheme Global Experience Support Fund
Role Lead
Funding Start 2025
Funding Finish 2025
GNo
Type Of Funding Internal
Category INTE
UON N

20243 grants / $63,011

Next-Level Predictions: Transforming Tabular Data for High- Dimensional Precision$52,500

Funding body: Honeysuckle Health Pty Limited

Funding body Honeysuckle Health Pty Limited
Project Team Doctor Saiful Islam, Dr Ryan Gallagher, Mrs Venkata Kadiyala, Doctor Mahakim Newton
Scheme PhD Scholarship
Role Investigator
Funding Start 2024
Funding Finish 2027
GNo G2401394
Type Of Funding C3100 – Aust For Profit
Category 3100
UON Y

Global Experience Support Funding$9,000

Funding body: University of Newcastle Global Engagement and Partnerships (UON Global)

Funding body University of Newcastle Global Engagement and Partnerships (UON Global)
Project Team

M A Hakim Newton, Marcella Papini, Weijia Zhang, Saiful Islam, Alexandre Mendes, Karen Blackmore

Scheme Global Experience Support Fund
Role Lead
Funding Start 2024
Funding Finish 2024
GNo
Type Of Funding Internal
Category INTE
UON N

Course Development Funding$1,511

Funding body: College of Engineering, Science and Environment (CESE), University of Newcastle

Funding body College of Engineering, Science and Environment (CESE), University of Newcastle
Project Team

M A Hakim Newton

Scheme College of Engineering, Science, & Environment (CESE) Course Development Funding
Role Lead
Funding Start 2024
Funding Finish 2024
GNo
Type Of Funding Internal
Category INTE
UON N

20223 grants / $14,420

Start-up grant$10,000

Funding body: College of Engineering, Science & Environment (CESE) Start-up Funding

Funding body College of Engineering, Science & Environment (CESE) Start-up Funding
Project Team

M A Hakim Newton

Scheme College of Engineering, Science & Environment (CESE) Start-up Funding
Role Lead
Funding Start 2022
Funding Finish 2022
GNo
Type Of Funding Internal
Category INTE
UON N

SIPS Funding$3,000

Funding body: School of Information and Physical Sciences (SIPS) Funding

Funding body School of Information and Physical Sciences (SIPS) Funding
Project Team

M A Hakim Newton

Scheme School of Information and Physical Sciences (SIPS) Funding
Role Lead
Funding Start 2022
Funding Finish 2022
GNo
Type Of Funding Internal
Category INTE
UON N

Course Development Funding$1,420

Funding body: College of Engineering, Science and Environment (CESE), University of Newcastle

Funding body College of Engineering, Science and Environment (CESE), University of Newcastle
Project Team

M A Hakim Newton

Scheme College of Engineering, Science, & Environment (CESE) Course Development Funding
Role Lead
Funding Start 2022
Funding Finish 2022
GNo
Type Of Funding Internal
Category INTE
UON N
Edit

Research Supervision

Number of supervisions

Completed10
Current10

Current Supervision

Commenced Level of Study Research Title Program Supervisor Type
2025 PhD AI-guided phenomic and genomic prediction for yield performance in macadamia Computer Science, Griffith University Co-Supervisor
2025 PhD HSC Course Prediction using AI Methods Education, School of Education, The University of Newcastle Co-Supervisor
2025 PhD HSC Course Prediction Model PhD (Education), College of Human and Social Futures, The University of Newcastle Co-Supervisor
2024 PhD Ethical AI for Enhanced Health Insurance Risk Assessment and Decision-Making PhD (Computer Science), College of Engineering, Science and Environment, The University of Newcastle Co-Supervisor
2024 PhD Explainable Machine Learning for Cyber Security Computer Science, Griffith University Co-Supervisor
2024 PhD Deep Learning for Water Quality Monitoring Computer Science, Griffith University Co-Supervisor
2024 PhD Next-Level Predictions: Transforming Tabular Data for High-Dimensional Precision PhD (Computer Science), College of Engineering, Science and Environment, The University of Newcastle Co-Supervisor
2023 PhD Deep Transformation of Tabular Categorical Data for Enhanced Predictive Analytics in Health Informatics PhD (Computer Science), College of Engineering, Science and Environment, The University of Newcastle Co-Supervisor
2022 PhD Deep Learning for Protein-Ligand Affinity Prediction Computer Science, Griffith University Co-Supervisor
2021 PhD Deep Learning Approaches for Motor Neuron Diseases Computer Science, Griffith University Consultant Supervisor

Past Supervision

Year Level of Study Research Title Program Supervisor Type
2025 PhD Deep Learning Methods for Secondary Protein Structure Prediction Computer Science, Griffith University Principal Supervisor
2024 PhD Simplified Deep Learning Models for Protein Backbone Angle Prediction Computer Science, Griffith University Co-Supervisor
2024 PhD Distance Maps for Prediction of Protein Structures and Protein-ligand Affinity Computer Science, Griffith University Principal Supervisor
2023 PhD Constraint Guided Local Search for Protein Structure Prediction Computer Science, Griffith University Co-Supervisor
2022 PhD Learning in Combinatorial Constraint Optimisation Computer Science, Griffith University Co-Supervisor
2020 PhD Constraint-based Automated Planning and Business Process Modelling Computer Science, Griffith University Principal Supervisor
2020 PhD Molecular toxicity prediction using deep learning Computer Science, Griffith University Co-Supervisor
2019 PhD Constraint Directed Scheduling Computer Science, Griffith University Co-Supervisor
2017 PhD Exploiting Structures in Combinatorial Search Computer Science, Griffith University Co-Supervisor
2014 PhD Local Search Heuristics for Protein Structure Prediction Computer Science, Griffith University Co-Supervisor
Edit

Research Opportunities

Intelligent Search and Optimisation

Developing intelligent algorithms for search and optimisation: planning, scheduling, constraints, satisfiability problems

PHD

School of Information and Physical Sciences

1/1/2024 - 31/12/2030

Contact

Doctor Mahakim Newton
University of Newcastle
School of Information and Physical Sciences
mahakim.newton@newcastle.edu.au

Constraint-Based Explainable Machine Learning

Using constraints in achieving explainability in machine learning.

PHD

School of Information and Physical Sciences

1/1/2024 - 31/12/2030

Contact

Doctor Mahakim Newton
University of Newcastle
School of Information and Physical Sciences
mahakim.newton@newcastle.edu.au

Artificial Intelligence for Environment

Using internet of things, machine learning and search and optimisation for the environment, agriculture, and disaster management

PHD

School of Information and Physical Sciences

1/1/2024 - 31/12/2030

Contact

Doctor Mahakim Newton
University of Newcastle
School of Information and Physical Sciences
mahakim.newton@newcastle.edu.au

Computational Drug Discovery

Using machine learning and search and optimisation in various stages of computational drug discovery

PHD

School of Information and Physical Sciences

1/1/2024 - 31/12/2030

Contact

Doctor Mahakim Newton
University of Newcastle
School of Information and Physical Sciences
mahakim.newton@newcastle.edu.au

Artificial Intelligence for Health

Using big data, machine learning and search and optimisation for predictive and prescriptive health informatics

PHD

School of Information and Physical Sciences

1/1/2024 - 31/12/2030

Contact

Doctor Mahakim Newton
University of Newcastle
School of Information and Physical Sciences
mahakim.newton@newcastle.edu.au

Edit

Dr Mahakim Newton

Position

Lecturer - Data Science
School of Information and Physical Sciences
College of Engineering, Science and Environment

Focus area

Data Science and Statistics

Contact Details

Email mahakim.newton@newcastle.edu.au
Phone 0249216850

Office

Room EF124
Building Engineering F
Location Callaghan Campus
University Drive
Callaghan, NSW 2308
Australia
Edit