Inference for Hawkes processes with challenging data. The Hawkes processes are statistical models for the analysis of high-impact event sequences, such as bushfires, earthquakes, infectious diseases, and cyber attacks. When the times and/or marks are missing for some events or when the data is otherwise incomplete, it is challenging to fit these models and perform diagnostic checks on the fitted models. This project aims to develop novel statistical methods to fit these models in the presence of ....Inference for Hawkes processes with challenging data. The Hawkes processes are statistical models for the analysis of high-impact event sequences, such as bushfires, earthquakes, infectious diseases, and cyber attacks. When the times and/or marks are missing for some events or when the data is otherwise incomplete, it is challenging to fit these models and perform diagnostic checks on the fitted models. This project aims to develop novel statistical methods to fit these models in the presence of incomplete data and to check the goodness-of-fit of the fitted models. The expected outcomes include publications documenting these methods and software packages implementing them. The primary benefits include the advancement of statistical methodology and the training of junior research personnel. Read moreRead less
Technology-Driven and Scalable Regression Methodology, Computing and Theory. Regression is a mainstay of data analysis, statistics, machine learning and data science but is in continual need of enhancement in the face of technological change. Scalability and flexibility for the handling of non-linear signals are fundamental to the practical utility of new regression methodology. Several streams of research aimed at confronting data from specific technologies as well as generic types of data are ....Technology-Driven and Scalable Regression Methodology, Computing and Theory. Regression is a mainstay of data analysis, statistics, machine learning and data science but is in continual need of enhancement in the face of technological change. Scalability and flexibility for the handling of non-linear signals are fundamental to the practical utility of new regression methodology. Several streams of research aimed at confronting data from specific technologies as well as generic types of data are proposed. The project is to be networked with researchers in the United States of America and aims to have Australia-based researchers providing leadership in terms of methodological, theoretical, computational and software development.Read moreRead less
Statistical and computational methods using a multiscale approach for protein identification and quantification. Proteins are critically important in the onset and ongoing illness associated with disease. Key proteins may serve as markers to diagnose or predict the course of a disease, or even become the target of pharmaceuticals. Accurate, efficient and robust algorithms are a critical component in protein identification. This research provides novel statistical algorithms for protein identific ....Statistical and computational methods using a multiscale approach for protein identification and quantification. Proteins are critically important in the onset and ongoing illness associated with disease. Key proteins may serve as markers to diagnose or predict the course of a disease, or even become the target of pharmaceuticals. Accurate, efficient and robust algorithms are a critical component in protein identification. This research provides novel statistical algorithms for protein identification using multiscale analysis techniques. Their applications in the bio-medical field will enable Australian and international researchers to identify key proteins more accurately, than current methods, leading to improve health, medical, and biological research outcomes.Read moreRead less
New statistical methods for identifying micro-ribonucleic acid (miRNA) regulatory networks. Understanding gene regulatory networks is critical in the understanding of fundamental biological systems. These networks have important implications for the discovery of fundamental mechanisms relating to the diagnosis and management of many illnesses. This research will provide new statistical methods to identify regulatory micro-ribonucleic acid modules and to understand their relationship in gene regu ....New statistical methods for identifying micro-ribonucleic acid (miRNA) regulatory networks. Understanding gene regulatory networks is critical in the understanding of fundamental biological systems. These networks have important implications for the discovery of fundamental mechanisms relating to the diagnosis and management of many illnesses. This research will provide new statistical methods to identify regulatory micro-ribonucleic acid modules and to understand their relationship in gene regulatory networks through multiple covariance estimation and multivariate classification techniques. My results should enable researchers to better understand the regulation underlying biological systems, leading to improved human health, medical and biological research outcomes.Read moreRead less
Innovations in Bayesian likelihood-free inference. Bayesian inference is a statistical method of choice in applied science. This project will develop innovative tools which permit Bayesian inference in problems considered intractable only a few years ago. These methods will expedite advances in multidisciplinary research across a range of applications. With these foundations, this project will accelerate national research efforts into improving frameworks for projecting trends in water availabil ....Innovations in Bayesian likelihood-free inference. Bayesian inference is a statistical method of choice in applied science. This project will develop innovative tools which permit Bayesian inference in problems considered intractable only a few years ago. These methods will expedite advances in multidisciplinary research across a range of applications. With these foundations, this project will accelerate national research efforts into improving frameworks for projecting trends in water availability and management, the impact of climate extremes, telecommunications engineering, HIV and infectious disease modelling and biostatistics. With many sectors unable to recruit appropriately trained statisticians within Australia, this project will train four PhD students in Bayesian statistics.
Read moreRead less
Statistical methods for analysing multi-source microarray data and building gene regulatory networks. I will devise a statistical learning technique that does not force a gene to be assigned to exactly one category. This technique reflects the biological reality that a gene can belong to two or more functional categories. Therefore, the new technique will improve a model's ability to identify regulatory genes in different types of cancer; these regulatory genes can be targeted by new anti-cancer ....Statistical methods for analysing multi-source microarray data and building gene regulatory networks. I will devise a statistical learning technique that does not force a gene to be assigned to exactly one category. This technique reflects the biological reality that a gene can belong to two or more functional categories. Therefore, the new technique will improve a model's ability to identify regulatory genes in different types of cancer; these regulatory genes can be targeted by new anti-cancer drugs resulting in a more effective treatment. I will model gene regulatory networks using microarray data from multiple sources. These networks will be used to identify regulatory cliques - a group of genes that are vital for a cellular function. This will improve our understanding of debilitating conditions such as asthma.Read moreRead less
Optimising experimental design for robust product development: a case study for high-efficiency energy generation. This project tackles key mathematical challenges to provide a powerful new methodology and tool for optimal product design, making smarter use of limited information, minimising costly trials, shortening the product cycle, and boosting the competitiveness of both the Australian manufacturing and alternative energy production industries.
New methods for small group analysis from sample surveys. National and state averages of statistics on issues such as unemployment, salinity, drought impact, and health often hide large differences between population sub-groups and between small areas. This local variation needs to be understood so that effective policies can be developed and carried out efficiently and their impact monitored. This project will provide, for the first time, robust and efficient methods for providing information o ....New methods for small group analysis from sample surveys. National and state averages of statistics on issues such as unemployment, salinity, drought impact, and health often hide large differences between population sub-groups and between small areas. This local variation needs to be understood so that effective policies can be developed and carried out efficiently and their impact monitored. This project will provide, for the first time, robust and efficient methods for providing information on these variations using data from large-scale national and state surveys. This will lead to significant improvements in the data available for small population groups and small areas, allowing better targeting of policies aimed at addressing local differences.Read moreRead less
Handling Missing Data in Complex Household Surveys. The Australian Bureau of Statistics (ABS) has an extensive program of household surveys that is a key source of information on the social and economic conditions of the population. They provide statistics and data on a large range of social and economic topics, such as health, education, the labour force, income and expenditure. Analysis of household survey data by a variety of organisations underpins policy development and evaluation and the e ....Handling Missing Data in Complex Household Surveys. The Australian Bureau of Statistics (ABS) has an extensive program of household surveys that is a key source of information on the social and economic conditions of the population. They provide statistics and data on a large range of social and economic topics, such as health, education, the labour force, income and expenditure. Analysis of household survey data by a variety of organisations underpins policy development and evaluation and the expenditure of billions of dollars. This project will substantially improve the cost-efficiency and reliability of Australian household survey data, by creating new approaches for handling missing data that deal with the realities of typical household surveys.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE130101670
Funder
Australian Research Council
Funding Amount
$370,410.00
Summary
Scalable Bayesian model selection for massive data sets. This project will develop highly innovative, efficient and ultimately effective methodology for Bayesian model selection for large-scale problems which commonly arise in biostatistics and bioinformatics. The resulting methodology will dramatically reduce the duration of analyses in these areas from days or weeks to minutes or hours.