High Predictive Performance Models via Semi-Parametric Survival Regression. This project will develop novel statistical models for high prediction performance. When applied to help doctor to treat patients, these models allow the users to include gene or other biomarkers for predicting effectiveness of a treatment. When applied to risk management in finance, these models are capable to include an organization's or individual's ongoing finance status to predict, for example, the probability of or ....High Predictive Performance Models via Semi-Parametric Survival Regression. This project will develop novel statistical models for high prediction performance. When applied to help doctor to treat patients, these models allow the users to include gene or other biomarkers for predicting effectiveness of a treatment. When applied to risk management in finance, these models are capable to include an organization's or individual's ongoing finance status to predict, for example, the probability of or time to loan default. Innovative computational methods will be developed for fitting these models. Compared to traditional prediction method, this approach allows greater flexibility while being superior in terms of statistical accuracy and bias. Extensive analyses of healthcare data from diverse fields will be undertaken.Read moreRead less
Principled statistical methods for high-dimensional correlation networks. This project aims to develop a novel and principled approach for building correlation networks. Correlation networks aim to identify the most significant associations present in modern massive datasets, and have numerous applications, ranging from the biomedical and environmental sciences to the social sciences. Nodes of such networks represent features, and edges represent associations, or the lack thereof. Current method ....Principled statistical methods for high-dimensional correlation networks. This project aims to develop a novel and principled approach for building correlation networks. Correlation networks aim to identify the most significant associations present in modern massive datasets, and have numerous applications, ranging from the biomedical and environmental sciences to the social sciences. Nodes of such networks represent features, and edges represent associations, or the lack thereof. Current methods are not readily scalable to modern ultra-high dimensional settings, and do not account for uncertainty in the estimated associations. This project will develop a principled, highly scalable methodology for building such networks, which incorporates uncertainty quantification. Emphasis is placed on modern ultra-high dimensional settings in which differentiating a true correlation from a spurious one is a notoriously difficult task.Read moreRead less
Bayesian inversion and computation applied to atmospheric flux fields. This project aims to make use of unprecedented sources of measurements, from remote sensing and in situ data, to estimate the sources and sinks of greenhouse gases. An overabundance of greenhouse gases in Earth's atmosphere is arguably the most serious long-term threat to the planet's ecosystems. This project will combine measurement uncertainties, process uncertainties in the physical transport models, and any parameter unce ....Bayesian inversion and computation applied to atmospheric flux fields. This project aims to make use of unprecedented sources of measurements, from remote sensing and in situ data, to estimate the sources and sinks of greenhouse gases. An overabundance of greenhouse gases in Earth's atmosphere is arguably the most serious long-term threat to the planet's ecosystems. This project will combine measurement uncertainties, process uncertainties in the physical transport models, and any parameter uncertainties, to provide reliable uncertainty quantification for the estimates. This will be achieved with new Bayesian spatio-temporal inversions and big-data computational strategies. The resulting statistical inferences on greenhouse-gas flux fields will enable the development of critical mitigation strategies. These new statistical inferences will be a valuable resource to policy-makers worldwide, who are assessing progress towards global commitments. Further, the final product may assist in developing cost-effective mitigation strategies in the presence of uncertainty.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE180101252
Funder
Australian Research Council
Funding Amount
$343,450.00
Summary
Statistical theory and algorithms for joint inference of complex networks. This project aims to address the challenges in jointly modelling complex networks by applying an integrated approach encompassing statistical theory, computation, and applications. The project expects to contribute to core statistical methodology development for complex inference and generate new knowledge in the fields of genomics, neuroscience, and social science through in-depth analyses of large-scale multilayered net ....Statistical theory and algorithms for joint inference of complex networks. This project aims to address the challenges in jointly modelling complex networks by applying an integrated approach encompassing statistical theory, computation, and applications. The project expects to contribute to core statistical methodology development for complex inference and generate new knowledge in the fields of genomics, neuroscience, and social science through in-depth analyses of large-scale multilayered network data. Expected outcomes include enhanced theoretical and computational frameworks for probabilistic network models to better utilise the power of multiple observations. This should foster international and interdisciplinary collaborations and add significant value to the rapidly progressing field of networks research.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE200100944
Funder
Australian Research Council
Funding Amount
$427,068.00
Summary
Statistical frameworks for high-parameter imaging cytometry data. The project aims to develop statistical and bioinformatics methodology for characterising the complex interactions between cells in their native environment. Recent advances in imaging cytometry technologies have made it possible to observe the behaviour of multiple cell-types in tissue concurrently. The intended outcome is a suite of statistical methodologies that are crucial for addressing a variety of biological problems with t ....Statistical frameworks for high-parameter imaging cytometry data. The project aims to develop statistical and bioinformatics methodology for characterising the complex interactions between cells in their native environment. Recent advances in imaging cytometry technologies have made it possible to observe the behaviour of multiple cell-types in tissue concurrently. The intended outcome is a suite of statistical methodologies that are crucial for addressing a variety of biological problems with these state-of-the-art technologies. This work will advance knowledge in bioinformatics, statistics and image analysis, providing benefits to scientists studying the fundamental behaviour of cells and underlying disease mechanisms.Read moreRead less
Fast flexible feature selection for high dimensional challenging data. The project aims to provide new frameworks for fast flexible feature selection and appropriate modelling of heterogeneous data through structural varying-coefficient regression models. The outcomes will be a series of new statistical methods and concepts enabling more powerful modelling of complex bioscience data. The project will create the science for building reliable statistical models taking model uncertainty into accoun ....Fast flexible feature selection for high dimensional challenging data. The project aims to provide new frameworks for fast flexible feature selection and appropriate modelling of heterogeneous data through structural varying-coefficient regression models. The outcomes will be a series of new statistical methods and concepts enabling more powerful modelling of complex bioscience data. The project will create the science for building reliable statistical models taking model uncertainty into account, impacting how results will be interpreted, and with accompanying software. This will be a significant improvement in the assessment of model confidence in the food and health research priority areas including areas such as meat science, Huntington’s disease, and kidney transplantation.Read moreRead less
Computational methods for population-size-dependent branching processes. Branching processes are the primary mathematical tool used to model populations that evolve randomly in time. Most key results in the theory are derived under the simplifying assumption that individuals reproduce and die independently of each other. However, this assumption fails in most real-life situations, in particular when the environment has limited resources or when the habitat has a restricted capacity. This project ....Computational methods for population-size-dependent branching processes. Branching processes are the primary mathematical tool used to model populations that evolve randomly in time. Most key results in the theory are derived under the simplifying assumption that individuals reproduce and die independently of each other. However, this assumption fails in most real-life situations, in particular when the environment has limited resources or when the habitat has a restricted capacity. This project aims to develop novel and effective algorithmic techniques and statistical methods for a class of branching processes with dependences. We will use these results to study significant problems in the conservation of endangered island bird populations in Oceania, and to help inform their conservation management.Read moreRead less
New universality in stochastic systems. This project aims to uncover new analyses and effects in the complex behaviour of non-linear systems with random noise. Many systems originate near an unstable equilibrium. This project will develop a new mathematical theory that establishes a universality in the way the long term effect of noise expresses itself as random initial conditions in the dynamics. It will fill gaps in Mathematics and make refinements to existing fundamental scientific laws by in ....New universality in stochastic systems. This project aims to uncover new analyses and effects in the complex behaviour of non-linear systems with random noise. Many systems originate near an unstable equilibrium. This project will develop a new mathematical theory that establishes a universality in the way the long term effect of noise expresses itself as random initial conditions in the dynamics. It will fill gaps in Mathematics and make refinements to existing fundamental scientific laws by including random initial conditions as predicted by our theory. This will advance our understanding of complex systems subjected to noise and will provide significant benefits in the scientific discoveries in Biology, Ecology, Physics and other Sciences where such systems are frequently met.Read moreRead less
Large Markov decision processes and combinatorial optimisation. Markov decision processes continue to gain in popularity for modelling a wide range of applications ranging from analysis of supply chains and queueing networks to cognitive science and control of autonomous vehicles. Nonetheless, they tend to become numerically intractable as the size of the model grows fast. Recent works use machine learning techniques to overcome this crucial issue, but with no convergence guarantee. This project ....Large Markov decision processes and combinatorial optimisation. Markov decision processes continue to gain in popularity for modelling a wide range of applications ranging from analysis of supply chains and queueing networks to cognitive science and control of autonomous vehicles. Nonetheless, they tend to become numerically intractable as the size of the model grows fast. Recent works use machine learning techniques to overcome this crucial issue, but with no convergence guarantee. This project aims to provide theoretically sound frameworks for solving large Markov decision processes, and exploit them to solve important combinatorial optimisation problems. This timely project can promote Australia's position in the development of such novel frameworks for many scientific and industrial applications.Read moreRead less
Technology-Driven and Scalable Regression Methodology, Computing and Theory. Regression is a mainstay of data analysis, statistics, machine learning and data science but is in continual need of enhancement in the face of technological change. Scalability and flexibility for the handling of non-linear signals are fundamental to the practical utility of new regression methodology. Several streams of research aimed at confronting data from specific technologies as well as generic types of data are ....Technology-Driven and Scalable Regression Methodology, Computing and Theory. Regression is a mainstay of data analysis, statistics, machine learning and data science but is in continual need of enhancement in the face of technological change. Scalability and flexibility for the handling of non-linear signals are fundamental to the practical utility of new regression methodology. Several streams of research aimed at confronting data from specific technologies as well as generic types of data are proposed. The project is to be networked with researchers in the United States of America and aims to have Australia-based researchers providing leadership in terms of methodological, theoretical, computational and software development.Read moreRead less