ARC Centre of Excellence for Mathematical and Statistical Frontiers of Big Data, Big Models, New Insights. In today's world, massive amounts of data in a variety of forms are collected daily from a multitude of sources. Many of the resulting data sets have the potential to make vital contributions to society, business and government, as well as impact on international developments, but are so large or complex that they are difficult to process and analyse using traditional tools. The aim of this ....ARC Centre of Excellence for Mathematical and Statistical Frontiers of Big Data, Big Models, New Insights. In today's world, massive amounts of data in a variety of forms are collected daily from a multitude of sources. Many of the resulting data sets have the potential to make vital contributions to society, business and government, as well as impact on international developments, but are so large or complex that they are difficult to process and analyse using traditional tools. The aim of this Centre is to create innovative mathematical and statistical models that can uncover the knowledge concealed within the size and complexity of these big data sets, with a focus on using the models to deliver insight into problems vital to the Centre's Collaborative Domains: Healthy People, Sustainable Environments and Prosperous Societies.Read moreRead less
Complex data, model selection and bootstrap inference. The project will provide new statistical methods and associated software for the analysis and modelling of complex data, as well as quality research training. This project will benefit researchers in statistics and users of statistics who encounter the complex data considered in this project and who need to model and make inferences from these data. Since these kinds of data arise in many areas (such as medicine, genetics, chemistry etc), ....Complex data, model selection and bootstrap inference. The project will provide new statistical methods and associated software for the analysis and modelling of complex data, as well as quality research training. This project will benefit researchers in statistics and users of statistics who encounter the complex data considered in this project and who need to model and make inferences from these data. Since these kinds of data arise in many areas (such as medicine, genetics, chemistry etc), Australia and Australian industry will ultimately benefit from the proposed research. The strengthening of international link and the training of highly trained research scientists in an area of national importance will also benefit Australia.Read moreRead less
Building models for complex data. The purpose of this project is to better understand the process of building statistical models and construct new methods for building models for particular kinds of complex data. The expected outcomes include a new way of thinking about model building and practical tools which together enable us to get more value out of analysing complex data.
Novel statistical methods for data with non-Euclidean geometric structure. This project aims to develop new flexible regression models and classification algorithms, along with robust and efficient inference methods, applicable to a wide range of non-Euclidean data types which arise in many fields of science, business and technology. There are serious flaws with currently available methods of analysis for non-Euclidean data. This project expects to transform such analyses by providing new quanti ....Novel statistical methods for data with non-Euclidean geometric structure. This project aims to develop new flexible regression models and classification algorithms, along with robust and efficient inference methods, applicable to a wide range of non-Euclidean data types which arise in many fields of science, business and technology. There are serious flaws with currently available methods of analysis for non-Euclidean data. This project expects to transform such analyses by providing new quantitative tools within a unifying framework. The anticipated project outcomes will be of mathematical interest and valuable in applications such as finance (predicting Australian stock returns); modelling electroencephalography data; Australian geochemical data, relating to sediments; and Australian X-ray tumour image data. Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE180100220
Funder
Australian Research Council
Funding Amount
$369,075.00
Summary
Statistics for manifold-valued data. This project aims to develop, and then implement, a new suite of fully flexible, interpretable and tractable models for manifold-valued data, along with robust and accurate estimation techniques for their parameters. Multivariate data with complicated constraints, such as manifold-valued data, is frequently encountered in the physical, biological and medical sciences, however it is difficult to define tractable statistical models and estimate their parameters ....Statistics for manifold-valued data. This project aims to develop, and then implement, a new suite of fully flexible, interpretable and tractable models for manifold-valued data, along with robust and accurate estimation techniques for their parameters. Multivariate data with complicated constraints, such as manifold-valued data, is frequently encountered in the physical, biological and medical sciences, however it is difficult to define tractable statistical models and estimate their parameters due to the curvature and nonlinear geometry of the sample space. The outcomes of the project are of direct mathematical interest as well as having significant interest to science and business disciplines where manifold-valued data is commonly observed.Read moreRead less
Prediction, inference and their application to modelling correlated data. This project aims to create new, improved methods for prediction and making inference about predictions for a variety of correlated data types through inventing sophisticated and novel resampling schemes such as the generalised fast bootstrap and repeated partial permutation. The research will impact on both the theory and practice of statistics and on substantive fields which use mixed or compositional models to analyse d ....Prediction, inference and their application to modelling correlated data. This project aims to create new, improved methods for prediction and making inference about predictions for a variety of correlated data types through inventing sophisticated and novel resampling schemes such as the generalised fast bootstrap and repeated partial permutation. The research will impact on both the theory and practice of statistics and on substantive fields which use mixed or compositional models to analyse dependent data. This will be a significant improvement in the assessment and stability of statistical models in areas such as social, ecological and geological sciences.Read moreRead less
Dimension reduction and model selection for statistically challenging data. This project aims to develop a deep theoretical understanding of the relationship between various dimension reduction and model selection methods used in statistical model building, and then use this understanding to develop new, improved methods of model building for statistically challenging data. The research will impact on both the theory and practice of statistics, and on substantive fields which collect and analyse ....Dimension reduction and model selection for statistically challenging data. This project aims to develop a deep theoretical understanding of the relationship between various dimension reduction and model selection methods used in statistical model building, and then use this understanding to develop new, improved methods of model building for statistically challenging data. The research will impact on both the theory and practice of statistics, and on substantive fields which collect and analyse these kinds of data. This will provide a significant improvement in the statistical model building in areas such as epidemiology, chemical and ecological sciences. The project is timely because of the increasing collection of large-dimensional, complex, correlated data sets in these and many other fields.Read moreRead less
New methods for small group analysis from sample surveys. National and state averages of statistics on issues such as unemployment, salinity, drought impact, and health often hide large differences between population sub-groups and between small areas. This local variation needs to be understood so that effective policies can be developed and carried out efficiently and their impact monitored. This project will provide, for the first time, robust and efficient methods for providing information o ....New methods for small group analysis from sample surveys. National and state averages of statistics on issues such as unemployment, salinity, drought impact, and health often hide large differences between population sub-groups and between small areas. This local variation needs to be understood so that effective policies can be developed and carried out efficiently and their impact monitored. This project will provide, for the first time, robust and efficient methods for providing information on these variations using data from large-scale national and state surveys. This will lead to significant improvements in the data available for small population groups and small areas, allowing better targeting of policies aimed at addressing local differences.Read moreRead less
Theory and Applications of Computer-Intensive Statistical Methods. The availability of powerful computing equipment has had a dramatic impact on statistical methods and thinking. It has motivated development of novel approaches to data analysis, whose conception
and appreciation, even their application, often demand sophisticated and complex theoretical methods. In this context, the project will develop new approaches to solving non-standard statistical problems. These techniques will eithe ....Theory and Applications of Computer-Intensive Statistical Methods. The availability of powerful computing equipment has had a dramatic impact on statistical methods and thinking. It has motivated development of novel approaches to data analysis, whose conception
and appreciation, even their application, often demand sophisticated and complex theoretical methods. In this context, the project will develop new approaches to solving non-standard statistical problems. These techniques will either have direct application to solving practical problems of national or community concern, or provide a better understanding of the nature of such problems.Read moreRead less
NONPARAMETRIC STATISTICS. Nonparametric statistical methods are techniques that implicitly choose statistical models from exceptionally large and highly adaptive classes. The project aims to develop innovative and practicable nonparametric methods in four areas: Statistical Smoothing, Data Mining, Mixture Methods and Robust Inference. The significance of the work lies in its novelty, the breadth of its practical motivation, and its position at the leading edge of contemporary work in statisti ....NONPARAMETRIC STATISTICS. Nonparametric statistical methods are techniques that implicitly choose statistical models from exceptionally large and highly adaptive classes. The project aims to develop innovative and practicable nonparametric methods in four areas: Statistical Smoothing, Data Mining, Mixture Methods and Robust Inference. The significance of the work lies in its novelty, the breadth of its practical motivation, and its position at the leading edge of contemporary work in statistics. Expected outcomes include new technologies for data analysis.Read moreRead less