Industrial Transformation Training Centres - Grant ID: IC190100031
Funder
Australian Research Council
Funding Amount
$3,973,202.00
Summary
ARC Training Centre in Data Analytics for Resources and Environments (DARE). Understanding the cumulative impact of actions regarding the use of our resources has important long-term consequences for Australia’s economic, societal and environmental health. Yet despite the importance of these cumulative impacts, and the availability of data, many decisions and policies are based on limited amounts of data and rudimentary data analysis, with little appreciation of the critical role that understand ....ARC Training Centre in Data Analytics for Resources and Environments (DARE). Understanding the cumulative impact of actions regarding the use of our resources has important long-term consequences for Australia’s economic, societal and environmental health. Yet despite the importance of these cumulative impacts, and the availability of data, many decisions and policies are based on limited amounts of data and rudimentary data analysis, with little appreciation of the critical role that understanding and quantifying uncertainty plays in the process. The aim of Data Analytics in Resources and Environment (DARE) is to develop and deliver the data science skills and tools for Australia’s resource industries to make the best possible evidence-based decisions in exploiting and stewarding the nation’s natural resources.Read moreRead less
Leaves in 3D: photosynthesis and water-use efficiency. This project aims to develop leaf anatomical ideotypes with improved photosynthesis and water-use efficiency for wheat, rice, chickpea and cotton using novel three dimensional imaging and modelling techniques. This project expects to generate new understanding of the role of leaf anatomy on leaf function. Expected outcomes of this project include the world's first 3D spatially-explicit, anatomically accurate model of leaves of crop plants to ....Leaves in 3D: photosynthesis and water-use efficiency. This project aims to develop leaf anatomical ideotypes with improved photosynthesis and water-use efficiency for wheat, rice, chickpea and cotton using novel three dimensional imaging and modelling techniques. This project expects to generate new understanding of the role of leaf anatomy on leaf function. Expected outcomes of this project include the world's first 3D spatially-explicit, anatomically accurate model of leaves of crop plants to allow virtual experiments identifying optimized anatomy for improved photosynthetic performance. Benefits to the agricultural industry include increased crop productivity and water-use efficiency to meet future global food demand and to make the most of Australia's limited water resourcesRead moreRead less
Principled statistical methods for high-dimensional correlation networks. This project aims to develop a novel and principled approach for building correlation networks. Correlation networks aim to identify the most significant associations present in modern massive datasets, and have numerous applications, ranging from the biomedical and environmental sciences to the social sciences. Nodes of such networks represent features, and edges represent associations, or the lack thereof. Current method ....Principled statistical methods for high-dimensional correlation networks. This project aims to develop a novel and principled approach for building correlation networks. Correlation networks aim to identify the most significant associations present in modern massive datasets, and have numerous applications, ranging from the biomedical and environmental sciences to the social sciences. Nodes of such networks represent features, and edges represent associations, or the lack thereof. Current methods are not readily scalable to modern ultra-high dimensional settings, and do not account for uncertainty in the estimated associations. This project will develop a principled, highly scalable methodology for building such networks, which incorporates uncertainty quantification. Emphasis is placed on modern ultra-high dimensional settings in which differentiating a true correlation from a spurious one is a notoriously difficult task.Read moreRead less
New methods for modelling real-world extremes. This project aims to develop new theory and methods for analysing and predicting extreme values observed in real-world processes. Many existing techniques are limited by convenient mathematical assumptions that commonly do not hold in practice: dependence at asymptotic levels, process stationarity, and that the observed data are direct measurements of the process of interest. As a result, using these techniques may produce undesirable results. Expec ....New methods for modelling real-world extremes. This project aims to develop new theory and methods for analysing and predicting extreme values observed in real-world processes. Many existing techniques are limited by convenient mathematical assumptions that commonly do not hold in practice: dependence at asymptotic levels, process stationarity, and that the observed data are direct measurements of the process of interest. As a result, using these techniques may produce undesirable results. Expected outcomes of this project include theoretically justified data analysis techniques that can accurately model extreme values seen in the real world. Project benefits include more realistic analyses of nationally important applications in climate, bushfire insurance risk, and anomaly detection.Read moreRead less
Innovative statistical methods for analysing high-dimensional counts. The aim is to develop fast, modern statistical methods for analysing high dimensional data in ecology at large scales, in particular, for visualising, classifying and predicting ecological communities. The benefit of the project is a set of multivariate tools that can be used to better understand biodiversity and its response to environmental drivers, a challenging statistical problem. The proposed methods for analysing high d ....Innovative statistical methods for analysing high-dimensional counts. The aim is to develop fast, modern statistical methods for analysing high dimensional data in ecology at large scales, in particular, for visualising, classifying and predicting ecological communities. The benefit of the project is a set of multivariate tools that can be used to better understand biodiversity and its response to environmental drivers, a challenging statistical problem. The proposed methods for analysing high dimensional data can provide insight into large scale questions in ecology, such as automated identification of biogeographic boundaries. The expected outcome is a powerful statistical toolset for model-based analysis of high dimensional data, introducing modern multivariate approaches to a high-impact area of ecology.Read moreRead less
Fast flexible feature selection for high dimensional challenging data. The project aims to provide new frameworks for fast flexible feature selection and appropriate modelling of heterogeneous data through structural varying-coefficient regression models. The outcomes will be a series of new statistical methods and concepts enabling more powerful modelling of complex bioscience data. The project will create the science for building reliable statistical models taking model uncertainty into accoun ....Fast flexible feature selection for high dimensional challenging data. The project aims to provide new frameworks for fast flexible feature selection and appropriate modelling of heterogeneous data through structural varying-coefficient regression models. The outcomes will be a series of new statistical methods and concepts enabling more powerful modelling of complex bioscience data. The project will create the science for building reliable statistical models taking model uncertainty into account, impacting how results will be interpreted, and with accompanying software. This will be a significant improvement in the assessment of model confidence in the food and health research priority areas including areas such as meat science, Huntington’s disease, and kidney transplantation.Read moreRead less
Bayesian inversion and computation applied to atmospheric flux fields. This project aims to make use of unprecedented sources of measurements, from remote sensing and in situ data, to estimate the sources and sinks of greenhouse gases. An overabundance of greenhouse gases in Earth's atmosphere is arguably the most serious long-term threat to the planet's ecosystems. This project will combine measurement uncertainties, process uncertainties in the physical transport models, and any parameter unce ....Bayesian inversion and computation applied to atmospheric flux fields. This project aims to make use of unprecedented sources of measurements, from remote sensing and in situ data, to estimate the sources and sinks of greenhouse gases. An overabundance of greenhouse gases in Earth's atmosphere is arguably the most serious long-term threat to the planet's ecosystems. This project will combine measurement uncertainties, process uncertainties in the physical transport models, and any parameter uncertainties, to provide reliable uncertainty quantification for the estimates. This will be achieved with new Bayesian spatio-temporal inversions and big-data computational strategies. The resulting statistical inferences on greenhouse-gas flux fields will enable the development of critical mitigation strategies. These new statistical inferences will be a valuable resource to policy-makers worldwide, who are assessing progress towards global commitments. Further, the final product may assist in developing cost-effective mitigation strategies in the presence of uncertainty.Read moreRead less
Fast approximate inference methods: new algorithms, applications and theory. This project aims to develop new algorithms and theory for fast approximate inference and lay down infrastructure to aid future extensions. Fast approximate inference methods are a principled and extensible means of fitting large and complex statistical models to big data sets. They come into their own in applications where speed is paramount and traditional approaches are not feasible. The project aims to lead to prac ....Fast approximate inference methods: new algorithms, applications and theory. This project aims to develop new algorithms and theory for fast approximate inference and lay down infrastructure to aid future extensions. Fast approximate inference methods are a principled and extensible means of fitting large and complex statistical models to big data sets. They come into their own in applications where speed is paramount and traditional approaches are not feasible. The project aims to lead to practical outcomes from better business decision-making for insurance data warehouses, to improved medical imaging technology.Read moreRead less
Computational methods for population-size-dependent branching processes. Branching processes are the primary mathematical tool used to model populations that evolve randomly in time. Most key results in the theory are derived under the simplifying assumption that individuals reproduce and die independently of each other. However, this assumption fails in most real-life situations, in particular when the environment has limited resources or when the habitat has a restricted capacity. This project ....Computational methods for population-size-dependent branching processes. Branching processes are the primary mathematical tool used to model populations that evolve randomly in time. Most key results in the theory are derived under the simplifying assumption that individuals reproduce and die independently of each other. However, this assumption fails in most real-life situations, in particular when the environment has limited resources or when the habitat has a restricted capacity. This project aims to develop novel and effective algorithmic techniques and statistical methods for a class of branching processes with dependences. We will use these results to study significant problems in the conservation of endangered island bird populations in Oceania, and to help inform their conservation management.Read moreRead less
Empirical and computational solutions for multi-omics single-cell assays. Emerging single-cell sequencing technologies are transforming molecular cell biology, but identifying novel cell types and their functions requires the integration of highly heterogeneous data. The development of computational methods able to extract biologically relevant results is hindered by the lack of high-quality datasets. This project aims to develop novel sequencing methodologies and generate data to drive our dime ....Empirical and computational solutions for multi-omics single-cell assays. Emerging single-cell sequencing technologies are transforming molecular cell biology, but identifying novel cell types and their functions requires the integration of highly heterogeneous data. The development of computational methods able to extract biologically relevant results is hindered by the lack of high-quality datasets. This project aims to develop novel sequencing methodologies and generate data to drive our dimension reduction multivariate method developments for data integration. By combining in silico and in vivo approaches, the project is anticipated to benefit scientists willing to work in cutting-edge single-cell research by providing useful protocols and tools to generate novel insights in cell biology. Read moreRead less