Automated assessment of data quality in biological knowledge resources. This project aims to develop methods for identifying poor quality data in biological databases. Research in biomedicine is underpinned by massive databases of biological data. Data quality is largely managed through manual curation, but automated methods to assess quality are critically needed. This project expects to develop a suite of computational tools for assessing biological data quality, utilising an innovative approa ....Automated assessment of data quality in biological knowledge resources. This project aims to develop methods for identifying poor quality data in biological databases. Research in biomedicine is underpinned by massive databases of biological data. Data quality is largely managed through manual curation, but automated methods to assess quality are critically needed. This project expects to develop a suite of computational tools for assessing biological data quality, utilising an innovative approach based on network analysis of database record connectivity. These tools will enable quantifying data quality at scale. Researchers, evidence-based decision-makers in biomedicine, and the analytical or predictive tools that use this data will make more reliable inferences and decisions.Read moreRead less
Deep Pattern Mining for Brain Graph Analysis: A Data Mining Perspective. This project brings together experts in the fields of data mining and cognitive neuroscience. This project aims to develop new data analytics tools, algorithms, and models to combine complex multi-source neuroimage brain data and non-imaging data, to explore the interplays among these different data structures and identify novel functional patterns from complex brain graph structures. The research undertaken in this project ....Deep Pattern Mining for Brain Graph Analysis: A Data Mining Perspective. This project brings together experts in the fields of data mining and cognitive neuroscience. This project aims to develop new data analytics tools, algorithms, and models to combine complex multi-source neuroimage brain data and non-imaging data, to explore the interplays among these different data structures and identify novel functional patterns from complex brain graph structures. The research undertaken in this project expects to provide practical data analysis approaches and establish the theoretical foundations for data mining with multiple sources of brain data.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE200101610
Funder
Australian Research Council
Funding Amount
$403,398.00
Summary
Towards Explainable Multi-source Multivariate Time-series Analysis. The aim of this project is to build deep learning models with transparent reasoning behind the results that can be easily interpreted by humans. The research rests on translating pertinent knowledge from multiple sources of complex data containing event sequences into graph form and embedding those knowledge graphs into a sophisticated deep learning model. Such an accomplishment represents the next great advance in machine intel ....Towards Explainable Multi-source Multivariate Time-series Analysis. The aim of this project is to build deep learning models with transparent reasoning behind the results that can be easily interpreted by humans. The research rests on translating pertinent knowledge from multiple sources of complex data containing event sequences into graph form and embedding those knowledge graphs into a sophisticated deep learning model. Such an accomplishment represents the next great advance in machine intelligence and will lay the theoretical foundations for building intelligent analysis tools that truly work in tandem with people. The potential benefits to science, society, and the Australian economy, particularly in finance, sensor technologies, and emergency health services would be appreciable.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE210101458
Funder
Australian Research Council
Funding Amount
$387,141.00
Summary
Scalable and Deep Anomaly Detection from Big Data with Similarity Hashing. Anomaly detection, aiming to identify anomalous but insightful patterns in data mining, is an important big data analytics technique. The nature of big data requires a detection method that can handle fast-evolving data of diverse types. However, existing methods suffer from either high computational cost or low detection performance. This project aims to develop a detection framework to advance detection performance and ....Scalable and Deep Anomaly Detection from Big Data with Similarity Hashing. Anomaly detection, aiming to identify anomalous but insightful patterns in data mining, is an important big data analytics technique. The nature of big data requires a detection method that can handle fast-evolving data of diverse types. However, existing methods suffer from either high computational cost or low detection performance. This project aims to develop a detection framework to advance detection performance and efficiency, based on a novel deep learning model called deep isolation forest which is different from the traditional artificial neural network based models. The outcome will bring huge benefits to various applications such as real-time predictive maintenance in smart manufacturing, and intrusion detection in cybersecurity.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE190101118
Funder
Australian Research Council
Funding Amount
$339,000.00
Summary
High performance density-based clustering in parallel environments. This project aims to conduct a comprehensive study on density-based clustering to improve data management in parallel computing environments. Clustering, a fundamental task in data management, is to group a set of objects such that objects in the same group (called a cluster) are more similar to each other than those in other groups in order to simplify retrieval of similar information. Clustering is widely used in many fields i ....High performance density-based clustering in parallel environments. This project aims to conduct a comprehensive study on density-based clustering to improve data management in parallel computing environments. Clustering, a fundamental task in data management, is to group a set of objects such that objects in the same group (called a cluster) are more similar to each other than those in other groups in order to simplify retrieval of similar information. Clustering is widely used in many fields including machine learning, pattern recognition, information retrieval, bioinformatics and image analysis. It is expected that the developed clustering techniques will provide significant performance improvements in industry sectors where decisions are made based on clustering data analytics, such as the sectors of finance, renewable energy and artificial intelligence.Read moreRead less
Efficient spatial data management for enabling true ride-sharing. This data management project aims to examine ride-sharing as a model of a complex decision system that can be optimised to deliver better outcomes. Popular ride-sharing apps have quickly evolved into ride-sourcing services that are comparable to calling a taxi on a mobile phone. Such arrangements miss many of the key benefits of true ride-sharing for the society. The project will model incentives by helping people agree on points ....Efficient spatial data management for enabling true ride-sharing. This data management project aims to examine ride-sharing as a model of a complex decision system that can be optimised to deliver better outcomes. Popular ride-sharing apps have quickly evolved into ride-sourcing services that are comparable to calling a taxi on a mobile phone. Such arrangements miss many of the key benefits of true ride-sharing for the society. The project will model incentives by helping people agree on points of interest rather than directly seeking trips from others to set destinations. It also aims to introduce privacy-aware dynamic matching of sharers, and expand to transportation at large, to generate new shared transportation services. The expected outcome of this project is to elevate today's taxi-like ride-sharing services to true ride-sharing arrangements. This is expected to provide benefits such as reduced traffic and emissions, as well as addressing parking issues and other traffic problems.Read moreRead less
Personalised data analytics for the Internet of Me. This project aims to develop data mining methods for extracting comprehensive personalised knowledge, without breaching trust. The Internet of Things will lead to the Internet of Me. Billions of smart devices connected to the Internet record people’s lives. Companies wish to provide highly personalised services that engage their customers, while individuals wish to understand their health, lifestyle, education and personal performance. The chal ....Personalised data analytics for the Internet of Me. This project aims to develop data mining methods for extracting comprehensive personalised knowledge, without breaching trust. The Internet of Things will lead to the Internet of Me. Billions of smart devices connected to the Internet record people’s lives. Companies wish to provide highly personalised services that engage their customers, while individuals wish to understand their health, lifestyle, education and personal performance. The challenge is to analyse individuals’ personal data, and discover how they differentiate from and overlap with others’. This project expects to enable businesses to deepen customer satisfaction and individuals to better understand their personal place in a connected world.Read moreRead less
Fast effective clustering technologies for highly dynamic massive networks. Clustering is a fundamental data mining and analysis task. In an interconnected evolving world, friendships and information flows are modelled as large dynamic networks. Structural clustering and correlation clustering are important and well-studied approaches for static networks; for evolving networks, where links appear and disappear over time, we lack efficient techniques. Anticipated outcomes are new practical cluste ....Fast effective clustering technologies for highly dynamic massive networks. Clustering is a fundamental data mining and analysis task. In an interconnected evolving world, friendships and information flows are modelled as large dynamic networks. Structural clustering and correlation clustering are important and well-studied approaches for static networks; for evolving networks, where links appear and disappear over time, we lack efficient techniques. Anticipated outcomes are new practical clustering algorithms for dynamic networks – with performance guarantees of efficiency and clustering quality – and prototype software, guiding us to pick a good clustering. Expected benefits include better understanding of spread in evolving social networks, accelerating the software testing cycle, and improved topic detection.Read moreRead less
Developing key vision technology for automation of aquaculture factory. This project aims to investigate structural, coloured textural, and hyperspectral analysis approaches to achieve automated lobster molt-cycle staging and classification to the level required for commercial production. High labour cost, water contamination, and disease transmission are major barriers in Australian bay lobster aquaculture inhibiting its large scale production. Automation of the production process and reducing ....Developing key vision technology for automation of aquaculture factory. This project aims to investigate structural, coloured textural, and hyperspectral analysis approaches to achieve automated lobster molt-cycle staging and classification to the level required for commercial production. High labour cost, water contamination, and disease transmission are major barriers in Australian bay lobster aquaculture inhibiting its large scale production. Automation of the production process and reducing the human contact with animals are of high priority in the development of this Australian-led emerging industry. The project aims to develop technology to bring this world- first aquaculture factory to large scale production, and create new export opportunities for lobsters and production systems.Read moreRead less