ARC Centre of Excellence for Mathematical and Statistical Frontiers of Big Data, Big Models, New Insights. In today's world, massive amounts of data in a variety of forms are collected daily from a multitude of sources. Many of the resulting data sets have the potential to make vital contributions to society, business and government, as well as impact on international developments, but are so large or complex that they are difficult to process and analyse using traditional tools. The aim of this ....ARC Centre of Excellence for Mathematical and Statistical Frontiers of Big Data, Big Models, New Insights. In today's world, massive amounts of data in a variety of forms are collected daily from a multitude of sources. Many of the resulting data sets have the potential to make vital contributions to society, business and government, as well as impact on international developments, but are so large or complex that they are difficult to process and analyse using traditional tools. The aim of this Centre is to create innovative mathematical and statistical models that can uncover the knowledge concealed within the size and complexity of these big data sets, with a focus on using the models to deliver insight into problems vital to the Centre's Collaborative Domains: Healthy People, Sustainable Environments and Prosperous Societies.Read moreRead less
Finite Markov chains in statistical mechanics and combinatorics. Finite Markov chains can be viewed as random walks in a finite set. In applications, this set often consists of certain combinatorial objects whose typical properties are to be understood. If the set is large, obtaining exact solutions to such problems is generally infeasible. Markov chains can provide a highly efficient method to generate randomised approximations in such cases, but only if they equilibrate at a rate that grows sl ....Finite Markov chains in statistical mechanics and combinatorics. Finite Markov chains can be viewed as random walks in a finite set. In applications, this set often consists of certain combinatorial objects whose typical properties are to be understood. If the set is large, obtaining exact solutions to such problems is generally infeasible. Markov chains can provide a highly efficient method to generate randomised approximations in such cases, but only if they equilibrate at a rate that grows slowly with the size of the set of objects under study. The project will study several classes of Markov chains that have been developed to study a number of notoriously difficult problems in statistical mechanics and combinatorics, and determine under what conditions they provide efficient approximation schemes.Read moreRead less
Bayesian estimation of flexible spatial models with applications in medical imaging and econometric modeling. This project aims to develop statistical methodology for estimating flexible highly parameterised Bayesian spatial models. The flexible models examined will include regression, choice and time series models for data that is spatially registered. Spatial smoothing of parameters in the models will involve application of hierarchical spatial prior distributions. The resulting methodology wi ....Bayesian estimation of flexible spatial models with applications in medical imaging and econometric modeling. This project aims to develop statistical methodology for estimating flexible highly parameterised Bayesian spatial models. The flexible models examined will include regression, choice and time series models for data that is spatially registered. Spatial smoothing of parameters in the models will involve application of hierarchical spatial prior distributions. The resulting methodology will be applied to the analysis of medical imaging data and to the estimation of spatial econometric models of residential real estate prices. The expected outcomes include developments in the frontier framework of Bayesian computational estimation methodology, improved methods for medical image processing and estimation of high resolution spatial models of residential real estate prices in Australian metropolitan centres.Read moreRead less
The estimation of genotype-phenotype relationships from family data and of animal abundance from capture-recapture data with frequent capture occasions: A semiparametric approach. Semiparametric statistical methods allow researchers to only model those features of their data that are of interest, but still allow standard statistical inferences to be made about these features. The aim here is to develop non standard applications of semiparametric statistical methods in the estimation of genotype ....The estimation of genotype-phenotype relationships from family data and of animal abundance from capture-recapture data with frequent capture occasions: A semiparametric approach. Semiparametric statistical methods allow researchers to only model those features of their data that are of interest, but still allow standard statistical inferences to be made about these features. The aim here is to develop non standard applications of semiparametric statistical methods in the estimation of genotype-phenotype relationships from family data and the estimation of animal abundance from capture-recapture data. The methods will be applied to real data and their theoretical properties developed. The practical significance of the project is the flexible new statistical methods that will become available to researchers. The theoretical significance will be the insights into semiparametric methods gained by developing these nonstandard applications. The expected outcomes are the new statistical procedures and the resulting theoretical insights into semiparametric statistics.Read moreRead less
Phase transitions in stochastic systems. This project aims to understand models of physical and biological phenomena in the presence of uncertainty/randomness. Such models often exhibit phase transitions if a variable defining the model is modified. For example, a population explosion can occur if the average number of offspring per individual is larger than one, while macroscopic defects can occur in a material if the density of microscopic defects is larger than some threshold. This research c ....Phase transitions in stochastic systems. This project aims to understand models of physical and biological phenomena in the presence of uncertainty/randomness. Such models often exhibit phase transitions if a variable defining the model is modified. For example, a population explosion can occur if the average number of offspring per individual is larger than one, while macroscopic defects can occur in a material if the density of microscopic defects is larger than some threshold. This research could lead to strategies for directing physical and biological systems towards preferred states or phases, and better prediction of adverse events such as fracturing of Antarctic sea ice.Read moreRead less
Australian Laureate Fellowships - Grant ID: FL150100150
Funder
Australian Research Council
Funding Amount
$2,413,112.00
Summary
Bayesian learning for decision making in the big data era. Bayesian learning for decision making in the big data era: This fellowship project aims to develop new techniques in evidence-based learning and decision-making in the big data era. Big data has arrived, and with it a huge global demand for statistical knowledge and skills to analyse these data for improved learning and decision-making. This project will seek to address this need by creating a step-change in knowledge in Bayesian statist ....Bayesian learning for decision making in the big data era. Bayesian learning for decision making in the big data era: This fellowship project aims to develop new techniques in evidence-based learning and decision-making in the big data era. Big data has arrived, and with it a huge global demand for statistical knowledge and skills to analyse these data for improved learning and decision-making. This project will seek to address this need by creating a step-change in knowledge in Bayesian statistics and translating this knowledge to real-world challenges in industry, environment and health. The new big data statistical analysts trained through the project could also create much needed capacity at national and international levels.Read moreRead less
Inverse and related problems in statistics. Modern statistical inverse problems arise in fields from astronomy and biology to engineering and finance. Sometimes the problems involve the analysis of small samples of very high dimensional data, and are central to information aquisition in areas such as genomics and signal analysis. All these topics are of significant national importance, and their solution will bring national and community benefits. In addition, the program to which the proposa ....Inverse and related problems in statistics. Modern statistical inverse problems arise in fields from astronomy and biology to engineering and finance. Sometimes the problems involve the analysis of small samples of very high dimensional data, and are central to information aquisition in areas such as genomics and signal analysis. All these topics are of significant national importance, and their solution will bring national and community benefits. In addition, the program to which the proposal will lead will be used extensively for research training. In Australia, where the demand for research-trained statisticians greatly exceeds supply, this contribution to the nation and the community will be particularly important. Read moreRead less
New and computationally feasible methods of constructing efficient and exact confidence limits from count data. Biological and health science data is commonly in the form of counts. The statistical analysis of such data should be (a) efficient i.e. it should not, in effect, throw away valuable data, (b) exact i.e. it should have precisely known statistical properties and (c) computationally feasible. Kabaila and Lloyd (1997-2001) have proposed and analysed a radically new method of confidence li ....New and computationally feasible methods of constructing efficient and exact confidence limits from count data. Biological and health science data is commonly in the form of counts. The statistical analysis of such data should be (a) efficient i.e. it should not, in effect, throw away valuable data, (b) exact i.e. it should have precisely known statistical properties and (c) computationally feasible. Kabaila and Lloyd (1997-2001) have proposed and analysed a radically new method of confidence limit construction which, for the first time, possesses all of these requirements. The purpose of the project is to establish further theoretical support for the new method, to develop efficient computational algorithms and to write easy-to-use computer programs for its practical use.Read moreRead less
Classification methods for providing personalised and class decisions. This project provides a novel approach to the clustering of multivariate samples on entities in a class that automatically matches the sample clusters across the entities, allowing for inter-sample variation between the samples in a class. The project aims to develop a widely applicable, mixture-model-based framework for the simultaneous clustering of multivariate samples with inter-sample variation in a class and for the mat ....Classification methods for providing personalised and class decisions. This project provides a novel approach to the clustering of multivariate samples on entities in a class that automatically matches the sample clusters across the entities, allowing for inter-sample variation between the samples in a class. The project aims to develop a widely applicable, mixture-model-based framework for the simultaneous clustering of multivariate samples with inter-sample variation in a class and for the matching of the clusters across the entities in the class. The project will use a statistical approach to automatically match the clusters, since the overall mixture model provides a template for the class. It will provide a basis for discriminating between different classes in addition to the identification of atypical data points within a sample and of anomalous samples within a class. Key applications include biological image analysis and the analysis of data in flow cytometry which is one of the fundamental research tools for the life scientist.Read moreRead less
Statistical methods for quantifying variation in spatiotemporal areal data. This project aims to develop new statistical methods for extracting insights into spatial and temporal variation in areal data. These tools will extend the Australian Cancer Atlas which provides small area estimates for 20 cancers across Australia. The project is significant because it will allow government and other organisations to reap dividends from investment in collecting spatial information and it will enable mode ....Statistical methods for quantifying variation in spatiotemporal areal data. This project aims to develop new statistical methods for extracting insights into spatial and temporal variation in areal data. These tools will extend the Australian Cancer Atlas which provides small area estimates for 20 cancers across Australia. The project is significant because it will allow government and other organisations to reap dividends from investment in collecting spatial information and it will enable modelled small-area estimates to be released without compromising confidentiality. The expected outcomes include new statistical knowledge and new insights into cancer. The results will benefit the many disciplines, managers and policy makers that make decisions based on geographic data mapped over space and time. Read moreRead less