Classification of Microarray Gene-Expression Data. The broad aim is to provide statistical methodology for the classification of microarray gene-expression data. Microarrays are part of a new biotechnology that allows the monitoring of expression levels for thousands of genes simultaneously. The explosion in microarrays has produced massive quantities of data that require new statistical techniques for analysis in order to exploit their enormous scientific potential. One of the main uses of ....Classification of Microarray Gene-Expression Data. The broad aim is to provide statistical methodology for the classification of microarray gene-expression data. Microarrays are part of a new biotechnology that allows the monitoring of expression levels for thousands of genes simultaneously. The explosion in microarrays has produced massive quantities of data that require new statistical techniques for analysis in order to exploit their enormous scientific potential. One of the main uses of the methodology to be developed is to expedite the discovery of new subclasses of diseases. Another is to provide prediction rules for the diagnosis and treatment of diseases.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE160101565
Funder
Australian Research Council
Funding Amount
$330,000.00
Summary
Flexible data modelling via skew mixture models:challenges and applications. This project seeks to explore new models for handling data with non-normal features. Parametric distributions are fundamental to statistical modelling and inference. For centuries, the ‘normal’ distribution has been the dominant model for continuous data. However, real data rarely satisfy the assumption of normality. There is thus a strong demand for more flexible distributions. This project aims to develop new methodol ....Flexible data modelling via skew mixture models:challenges and applications. This project seeks to explore new models for handling data with non-normal features. Parametric distributions are fundamental to statistical modelling and inference. For centuries, the ‘normal’ distribution has been the dominant model for continuous data. However, real data rarely satisfy the assumption of normality. There is thus a strong demand for more flexible distributions. This project aims to develop new methodologies in finite mixture modelling using skew component distributions to provide better models for handling data with non-normal features (such as skewness, heavy/light tails, and multimodality). Applications may include security intrusion detection, clinical diagnosis and prognosis, and flow and mass cytometry.Read moreRead less
On-line and Incremental EM-based Neural Networks: Application to Hospital Utlilization and Gene Expression Data. Artificial neural networks have been widely applied as universal classifiers in many fields, such as biomedicine. However, misunderstanding of fundamental statistical principles, which can cause misleading findings, has been frequently observed in the literature. This project aims to integrate statistical methodologies in neural networks to provide a unified approach to improve its ....On-line and Incremental EM-based Neural Networks: Application to Hospital Utlilization and Gene Expression Data. Artificial neural networks have been widely applied as universal classifiers in many fields, such as biomedicine. However, misunderstanding of fundamental statistical principles, which can cause misleading findings, has been frequently observed in the literature. This project aims to integrate statistical methodologies in neural networks to provide a unified approach to improve its applicability and efficiency in implementation. The system developed from this proposed cross-disciplinary research will be applied to hospital utilization data (hospital morbidity database, Western Australia) and gene expression data (DNA microarrays databases, Harvard University). This collaborative research will advance the international standard of Australian research communities.
Read moreRead less
Goodness-of-fit Testing Of Log-link Models For Categorical Outcome Data
Funder
National Health and Medical Research Council
Funding Amount
$260,863.00
Summary
Information about the health consequences of exposure to causal factors is obtained from mathematical models of observed data. Incorrect inferences are possible if the model does not adequately represent the data. Relative risk models are recommended for observations over time on a cohort of subjects, but it is not known how best to assess the adequacy of such models. This project will assess the performance of summary measures of goodness-of-fit when applied to relative risk models.
Design And Analysis Of Interrupted Time Series Studies In Health Care Research: Resolution Of Methodological Issues
Funder
National Health and Medical Research Council
Funding Amount
$307,125.00
Summary
An interrupted time series (ITS) study involves a population observed on multiple occasions before and after the implementation of an intervention program. However, methods for statistical analysis and designing such studies have not been well developed and many statistical analyses of such studies are flawed. This proposal will investigate appropriate methods for design and analysis, and develop guidelines and software for its implementation by health researchers.
Prof Speed is a statistician specializing in bioinformatics and computational biology, applying my skills in support of basic research in molecular and cell biology and genetics.
I am a statistician specializing in bioinformatics and computational biology, applying my skills in support of basic research in molecular and cell biology and genetics.