Understanding somatic mutation in plants: new methods, new software, new data. Somatic mutations accumulate as plants grow, affecting everything from short-term ecological interactions to long-term evolutionary dynamics. These mutations have important consequences for plant industry and conservation, but because they are so hard to measure almost nothing is known about them. This project aims to develop new methods and software to detect, analyse, and compare the genome-wide history of somatic m ....Understanding somatic mutation in plants: new methods, new software, new data. Somatic mutations accumulate as plants grow, affecting everything from short-term ecological interactions to long-term evolutionary dynamics. These mutations have important consequences for plant industry and conservation, but because they are so hard to measure almost nothing is known about them. This project aims to develop new methods and software to detect, analyse, and compare the genome-wide history of somatic mutation in individual plants, providing an unprecedented level of detail into an important but understudied source of biological variation. By applying these methods to an iconic experimental population, This project aims to provide the first insights into the genome-wide causes and consequences of somatic mutation in plants.Read moreRead less
Visual Analytics for Next Generation Sequencing. Next-generation sequencing technologies have brought a revolution in biology and healthcare, while taxing the ability of scientists and clinicians to identify and process relevant data, to make sense of it all and communicate it to others in a concise and meaningful way. This project aims to tackle this problem through fundamentally new approaches to data selection and visualisation at very large scale, actively encoding for insight into underlyin ....Visual Analytics for Next Generation Sequencing. Next-generation sequencing technologies have brought a revolution in biology and healthcare, while taxing the ability of scientists and clinicians to identify and process relevant data, to make sense of it all and communicate it to others in a concise and meaningful way. This project aims to tackle this problem through fundamentally new approaches to data selection and visualisation at very large scale, actively encoding for insight into underlying biological and biomedical processes, bringing sustainable discovery of new relationships and variations within the data. The project aims to support new approaches to medical diagnosis and treatment, and offer crucial lessons to address the broader challenge of understanding large, complex data sets.Read moreRead less
Deep correction of DNA sequencing errors by data mining algorithms. This project aims to investigate the many layers of error correction problems in the terabytes of genomic sequence data, and aims to solve these problems by novel data mining algorithms. High-throughput sequencing platforms have generated massive amounts of useful raw data, but also made widespread errors. The new algorithms are capable of correcting errors at deeper layers to further enhance data quality. Expected outcome inclu ....Deep correction of DNA sequencing errors by data mining algorithms. This project aims to investigate the many layers of error correction problems in the terabytes of genomic sequence data, and aims to solve these problems by novel data mining algorithms. High-throughput sequencing platforms have generated massive amounts of useful raw data, but also made widespread errors. The new algorithms are capable of correcting errors at deeper layers to further enhance data quality. Expected outcome includes the knowledge advancement of genomic data industry and interdisciplinary collaboration between biotechnology and data mining. This also provides significant benefit for genomic decisions in forensics and personalised medicine which demand accurate genomic information.Read moreRead less
Searching for near-exact protein models. This project aims to develop novel and efficient heuristic-based algorithms leading to near accurate protein tertiary structure models. Knowledge about protein structures is fundamental to our understanding of living systems. The progress on experimental determination of these structures has been extremely limited and remains an open challenge in molecular biology. Computational prediction of protein structures from sequences is emerging as a promising ap ....Searching for near-exact protein models. This project aims to develop novel and efficient heuristic-based algorithms leading to near accurate protein tertiary structure models. Knowledge about protein structures is fundamental to our understanding of living systems. The progress on experimental determination of these structures has been extremely limited and remains an open challenge in molecular biology. Computational prediction of protein structures from sequences is emerging as a promising approach, but its accuracy is far from satisfactory. The software systems developed in this project will be used in structural identification of target proteins in drug design. This will make drug design process more efficient, saving time and cost, potentially saving lives.Read moreRead less
Learning Software Security Analysers with Imperfect Data. This project aims to systematically investigate next-generation learning-based software security analysis to detect vulnerabilities in real-world large-scale software. The expected learning-based foundation will support the handling of imperfect data in order to provide a precise, scalable and adaptive security analysis of the critical software components, thus capturing important security vulnerabilities missed by existing approaches. Th ....Learning Software Security Analysers with Imperfect Data. This project aims to systematically investigate next-generation learning-based software security analysis to detect vulnerabilities in real-world large-scale software. The expected learning-based foundation will support the handling of imperfect data in order to provide a precise, scalable and adaptive security analysis of the critical software components, thus capturing important security vulnerabilities missed by existing approaches. The success of this project will further enhance the international competitiveness of Australian research in this important field and will benefit any Australian industry and business where software systems are deeply-rooted, such as transportation, smart homes, medical devices, defence and finance.Read moreRead less
Viewable Measures for Component-Based Software Engineering. Software costs over US$300 billion per year worldwide with around 70% of large projects failing to complete in time. Reliable estimation of development effort is a great challenge in Software Engineering. This project aims to develop a reliable multi-dimensional software size measure and an effort estimation model for a new method of development called component based software engineering. We will validate our results theoretically and ....Viewable Measures for Component-Based Software Engineering. Software costs over US$300 billion per year worldwide with around 70% of large projects failing to complete in time. Reliable estimation of development effort is a great challenge in Software Engineering. This project aims to develop a reliable multi-dimensional software size measure and an effort estimation model for a new method of development called component based software engineering. We will validate our results theoretically and test it against empirical data from software industry. We will provide novel visualization techniques to comprehend measurements of large systems. The outcomes will help software projects better estimate deadlines and budgets thus reducing costs significantly.Read moreRead less
Real-time phylogenetics for food-borne outbreak surveillance. The project aims to introduce, for the first time, real-time evolutionary analysis of agricultural pathogens so that outbreaks affecting crops and the food supply can be managed precisely and rapidly. An expert team will implement a large-scale data analytics framework in user-friendly software that integrates Australian infectious disease genomics data with global data. Underpinning this work are new theory and algorithms that apply ....Real-time phylogenetics for food-borne outbreak surveillance. The project aims to introduce, for the first time, real-time evolutionary analysis of agricultural pathogens so that outbreaks affecting crops and the food supply can be managed precisely and rapidly. An expert team will implement a large-scale data analytics framework in user-friendly software that integrates Australian infectious disease genomics data with global data. Underpinning this work are new theory and algorithms that apply Sequential Monte Carlo to update phylogenetic analyses continuously as new data arrives. Expected outcomes include new knowledge of statistical algorithms for evolutionary analysis, relevant to biological disciplines beyond infectious disease; and enhanced capacity for infectious disease analysis. Read moreRead less
Achieving Tailored Industrial Software Process Improvement with Experience Repositories and Electronic Process Guides. There is a well-recognised need for organisations to improve their software development processes in order to achieve higher software quality and greater efficiencies in development. The use of electronic process guides and experience repositories have been two technologies independently proposed to achieve these aims. This research will develop a framework, methods and tools ....Achieving Tailored Industrial Software Process Improvement with Experience Repositories and Electronic Process Guides. There is a well-recognised need for organisations to improve their software development processes in order to achieve higher software quality and greater efficiencies in development. The use of electronic process guides and experience repositories have been two technologies independently proposed to achieve these aims. This research will develop a framework, methods and tools to allow integration of experience repositories and electronic process guides to facilitate process tailoring, process improvement, and project management. The result will be significant improvements in software development productivity and quality.Read moreRead less
Ownership-based Alias Analysis for Securing Unsafe Rust Programs. This project aims to develop an ownership-based alias analysis as a complement to Rust's ownership type system for improving Rust's memory safety. This project, therefore, expects to deliver an alias analysis foundation that can provide stronger memory safety guarantees than the state-of-the-art in detecting memory-safety violations and security vulnerabilities in real-world Rust programs that use unsafe language features. The exp ....Ownership-based Alias Analysis for Securing Unsafe Rust Programs. This project aims to develop an ownership-based alias analysis as a complement to Rust's ownership type system for improving Rust's memory safety. This project, therefore, expects to deliver an alias analysis foundation that can provide stronger memory safety guarantees than the state-of-the-art in detecting memory-safety violations and security vulnerabilities in real-world Rust programs that use unsafe language features. The expected outcomes are a deployable ownership-based alias analysis in the Rust compiler and an industrial-strength open-source framework. These outcomes are expected to provide significant benefits in improving software quality and security in Rust, an emerging language that offers both performance and safety.Read moreRead less
Exploring novel coding genomic features through integrative proteogenomics. Knowledge of the full extent to which the human genome is made into proteins is of fundamental importance in the study of health and disease. New technological advances are now enabling functional studies of genomes with increasing detail. This project aims to develop and apply cutting edge bioinformatics methods to perform an integrative and comprehensive exploration of the extent to which the genes of a human cell line ....Exploring novel coding genomic features through integrative proteogenomics. Knowledge of the full extent to which the human genome is made into proteins is of fundamental importance in the study of health and disease. New technological advances are now enabling functional studies of genomes with increasing detail. This project aims to develop and apply cutting edge bioinformatics methods to perform an integrative and comprehensive exploration of the extent to which the genes of a human cell line are made into proteins. The project will improve our understanding of the human genome and deliver cutting edge methodology applicable for genome annotation in all living organisms.Read moreRead less