A Novel Approach to Semi-Supervised Statistical Machine Learning. Recent successes in the construction of classifiers for making diagnoses and predictions are due in part to their using much data labelled with respect to their class of origin. But typically there are little labelled data but plentiful unlabelled data. The goal of semi-supervised learning (SSL) is to leverage large amounts of unlabelled data to improve the performance using only small labelled datasets and so SSL is of paramount ....A Novel Approach to Semi-Supervised Statistical Machine Learning. Recent successes in the construction of classifiers for making diagnoses and predictions are due in part to their using much data labelled with respect to their class of origin. But typically there are little labelled data but plentiful unlabelled data. The goal of semi-supervised learning (SSL) is to leverage large amounts of unlabelled data to improve the performance using only small labelled datasets and so SSL is of paramount importance to applications where it is expensive or impractical to obtain much labelled data. The project is to develop a novel SSL approach that adopts a missingness mechanism for the missing labels to build a classifier that not only improves accuracy but it can be greater than if the missing labels were known.
Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE200100200
Funder
Australian Research Council
Funding Amount
$418,398.00
Summary
Next generation causal inference methods for biological data. This project aims to develop next generation causal inference methods for analysing biological data especially the single cell sequencing data and their applications in cell biology. Although Artificial Intelligence and Statistical Machine Learning have been applied successfully in many fields, including biological research, there is still a serious lack of methods for interpreting and reasoning about the mechanism of biological syste ....Next generation causal inference methods for biological data. This project aims to develop next generation causal inference methods for analysing biological data especially the single cell sequencing data and their applications in cell biology. Although Artificial Intelligence and Statistical Machine Learning have been applied successfully in many fields, including biological research, there is still a serious lack of methods for interpreting and reasoning about the mechanism of biological systems, the ultimate goal of research in many areas. Efficient data-driven causality discovery approaches developed by the project will be a timely and significant contribution to the knowledge of biology and statistics as well as the battle against health threats.
Read moreRead less
Scalable and Robust Bayesian Inference for Implicit Statistical Models. This project aims to develop the next generation of efficient methods for fitting complex simulation-based statistical models to data. Practitioners and scientists are interested in such implicit models to enable discoveries, produce accurate predictions and inform decisions under uncertainty. However, the associated computational cost has restricted researchers to implicit models that must have a small number of parameters ....Scalable and Robust Bayesian Inference for Implicit Statistical Models. This project aims to develop the next generation of efficient methods for fitting complex simulation-based statistical models to data. Practitioners and scientists are interested in such implicit models to enable discoveries, produce accurate predictions and inform decisions under uncertainty. However, the associated computational cost has restricted researchers to implicit models that must have a small number of parameters and be well specified, impeding scientific progress. This project will develop new computational methods and algorithms for implicit models that scale to high dimensions and are robust to misspecification. Benefits will arise from the more routine use of implicit models in epidemiology, biology, ecology and other fields.Read moreRead less
Advances in Sequential Monte Carlo Methods for Complex Bayesian Models. This project aims to develop efficient statistical algorithms for parameter estimation of complex stochastic models that currently cannot be handled. Parameter estimation is an essential component of mathematical modelling for answering scientific questions and revealing new insights. Current parameter estimation methods can be inefficient and require too much user intervention. This project will develop novel Bayesian alg ....Advances in Sequential Monte Carlo Methods for Complex Bayesian Models. This project aims to develop efficient statistical algorithms for parameter estimation of complex stochastic models that currently cannot be handled. Parameter estimation is an essential component of mathematical modelling for answering scientific questions and revealing new insights. Current parameter estimation methods can be inefficient and require too much user intervention. This project will develop novel Bayesian algorithms that are optimally automated and efficient by exploiting ever-improving parallel computing devices. The new methods will allow practitioners to process realistic models, enabling new scientific discoveries in a wide range of disciplines such as biology, ecology, agriculture, hydrology and finance.Read moreRead less
Statistical methods for quantifying variation in spatiotemporal areal data. This project aims to develop new statistical methods for extracting insights into spatial and temporal variation in areal data. These tools will extend the Australian Cancer Atlas which provides small area estimates for 20 cancers across Australia. The project is significant because it will allow government and other organisations to reap dividends from investment in collecting spatial information and it will enable mode ....Statistical methods for quantifying variation in spatiotemporal areal data. This project aims to develop new statistical methods for extracting insights into spatial and temporal variation in areal data. These tools will extend the Australian Cancer Atlas which provides small area estimates for 20 cancers across Australia. The project is significant because it will allow government and other organisations to reap dividends from investment in collecting spatial information and it will enable modelled small-area estimates to be released without compromising confidentiality. The expected outcomes include new statistical knowledge and new insights into cancer. The results will benefit the many disciplines, managers and policy makers that make decisions based on geographic data mapped over space and time. Read moreRead less
Stochastic majorization--minimization algorithms for data science. The changing nature of acquisition and storage data has made the process of drawing inference infeasible with traditional statistical and machine learning methods. Modern data are often acquired in real time, in an incremental nature, and are often available in too large a volume to process on conventional machinery. The project proposes to study the family of stochastic majorisation-minimisation algorithms for computation of inf ....Stochastic majorization--minimization algorithms for data science. The changing nature of acquisition and storage data has made the process of drawing inference infeasible with traditional statistical and machine learning methods. Modern data are often acquired in real time, in an incremental nature, and are often available in too large a volume to process on conventional machinery. The project proposes to study the family of stochastic majorisation-minimisation algorithms for computation of inferential quantities in an incremental manner. The proposed stochastic algorithms encompass and extend upon a wide variety of current algorithmic frameworks for fitting statistical and machine learning models, and can be used to produce feasible and practical algorithms for complex models, both current and future.
Read moreRead less
In for the count: Maximising trust and reliability in Australian elections. This project aims to develop innovative approaches to identifying, measuring, and evaluating errors and purposeful intervention in the uniquely complex elections at the basis of Australian democracy. Such methods can underpin a world-class election auditing system, which contends with the risks that are emerging at the intersection of election digitisation, cybersecurity and foreign interference. The project’s expected o ....In for the count: Maximising trust and reliability in Australian elections. This project aims to develop innovative approaches to identifying, measuring, and evaluating errors and purposeful intervention in the uniquely complex elections at the basis of Australian democracy. Such methods can underpin a world-class election auditing system, which contends with the risks that are emerging at the intersection of election digitisation, cybersecurity and foreign interference. The project’s expected outcomes are new auditing methods, tested on real Australian election data, with their benefits quantified against global best practice. The research outputs should help reinforce the community’s trust in Australian elections, which are a foundation for our security, social cohesion, and political resilience.Read moreRead less
Large Markov decision processes and combinatorial optimisation. Markov decision processes continue to gain in popularity for modelling a wide range of applications ranging from analysis of supply chains and queueing networks to cognitive science and control of autonomous vehicles. Nonetheless, they tend to become numerically intractable as the size of the model grows fast. Recent works use machine learning techniques to overcome this crucial issue, but with no convergence guarantee. This project ....Large Markov decision processes and combinatorial optimisation. Markov decision processes continue to gain in popularity for modelling a wide range of applications ranging from analysis of supply chains and queueing networks to cognitive science and control of autonomous vehicles. Nonetheless, they tend to become numerically intractable as the size of the model grows fast. Recent works use machine learning techniques to overcome this crucial issue, but with no convergence guarantee. This project aims to provide theoretically sound frameworks for solving large Markov decision processes, and exploit them to solve important combinatorial optimisation problems. This timely project can promote Australia's position in the development of such novel frameworks for many scientific and industrial applications.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE200101253
Funder
Australian Research Council
Funding Amount
$349,586.00
Summary
Making Machine Learning Fair(er). This project aims to develop and implement statistical methods to fight against algorithm bias. In doing so, this project expects to generate new knowledge in the mathematical sciences by employing innovative and interdisciplinary approaches to the development of fairness constraints on machine learning algorithms. Fairness will be seen through the lens of invariance, allowing the developed conceptual framework to find broad applications. Expected outcomes of t ....Making Machine Learning Fair(er). This project aims to develop and implement statistical methods to fight against algorithm bias. In doing so, this project expects to generate new knowledge in the mathematical sciences by employing innovative and interdisciplinary approaches to the development of fairness constraints on machine learning algorithms. Fairness will be seen through the lens of invariance, allowing the developed conceptual framework to find broad applications. Expected outcomes of this project include improved techniques for imposing invariance on deep learning algorithms. This should provide significant benefits to the general public by contributing to the advancement of socially responsible and conscientious machine learning.Read moreRead less
Precision ecology: the modern era of designed experiments in plant ecology. This project aims to develop the field of precision ecology, forging a new era of designed experiments where sampling is informed by research questions and what is known about the ecological process being studied. Through the development of novel statistical methods, new experiments globally will be designed to answer important ecological questions including what influence abiotic and biotic factors have on plant commun ....Precision ecology: the modern era of designed experiments in plant ecology. This project aims to develop the field of precision ecology, forging a new era of designed experiments where sampling is informed by research questions and what is known about the ecological process being studied. Through the development of novel statistical methods, new experiments globally will be designed to answer important ecological questions including what influence abiotic and biotic factors have on plant communities over time and different spatial scales. Expected outcomes include new methods and tools that will modernise how future experiments will be conducted in plant ecology. This will provide significant transdisciplinary benefits including new statistical methods that target scientific discovery in ecological studies.Read moreRead less