Automated assessment of data quality in biological knowledge resources. This project aims to develop methods for identifying poor quality data in biological databases. Research in biomedicine is underpinned by massive databases of biological data. Data quality is largely managed through manual curation, but automated methods to assess quality are critically needed. This project expects to develop a suite of computational tools for assessing biological data quality, utilising an innovative approa ....Automated assessment of data quality in biological knowledge resources. This project aims to develop methods for identifying poor quality data in biological databases. Research in biomedicine is underpinned by massive databases of biological data. Data quality is largely managed through manual curation, but automated methods to assess quality are critically needed. This project expects to develop a suite of computational tools for assessing biological data quality, utilising an innovative approach based on network analysis of database record connectivity. These tools will enable quantifying data quality at scale. Researchers, evidence-based decision-makers in biomedicine, and the analytical or predictive tools that use this data will make more reliable inferences and decisions.Read moreRead less
Efficient spatial data management for enabling true ride-sharing. This data management project aims to examine ride-sharing as a model of a complex decision system that can be optimised to deliver better outcomes. Popular ride-sharing apps have quickly evolved into ride-sourcing services that are comparable to calling a taxi on a mobile phone. Such arrangements miss many of the key benefits of true ride-sharing for the society. The project will model incentives by helping people agree on points ....Efficient spatial data management for enabling true ride-sharing. This data management project aims to examine ride-sharing as a model of a complex decision system that can be optimised to deliver better outcomes. Popular ride-sharing apps have quickly evolved into ride-sourcing services that are comparable to calling a taxi on a mobile phone. Such arrangements miss many of the key benefits of true ride-sharing for the society. The project will model incentives by helping people agree on points of interest rather than directly seeking trips from others to set destinations. It also aims to introduce privacy-aware dynamic matching of sharers, and expand to transportation at large, to generate new shared transportation services. The expected outcome of this project is to elevate today's taxi-like ride-sharing services to true ride-sharing arrangements. This is expected to provide benefits such as reduced traffic and emissions, as well as addressing parking issues and other traffic problems.Read moreRead less
Personalised data analytics for the Internet of Me. This project aims to develop data mining methods for extracting comprehensive personalised knowledge, without breaching trust. The Internet of Things will lead to the Internet of Me. Billions of smart devices connected to the Internet record people’s lives. Companies wish to provide highly personalised services that engage their customers, while individuals wish to understand their health, lifestyle, education and personal performance. The chal ....Personalised data analytics for the Internet of Me. This project aims to develop data mining methods for extracting comprehensive personalised knowledge, without breaching trust. The Internet of Things will lead to the Internet of Me. Billions of smart devices connected to the Internet record people’s lives. Companies wish to provide highly personalised services that engage their customers, while individuals wish to understand their health, lifestyle, education and personal performance. The challenge is to analyse individuals’ personal data, and discover how they differentiate from and overlap with others’. This project expects to enable businesses to deepen customer satisfaction and individuals to better understand their personal place in a connected world.Read moreRead less
Deep Pattern Mining for Brain Graph Analysis: A Data Mining Perspective. This project brings together experts in the fields of data mining and cognitive neuroscience. This project aims to develop new data analytics tools, algorithms, and models to combine complex multi-source neuroimage brain data and non-imaging data, to explore the interplays among these different data structures and identify novel functional patterns from complex brain graph structures. The research undertaken in this project ....Deep Pattern Mining for Brain Graph Analysis: A Data Mining Perspective. This project brings together experts in the fields of data mining and cognitive neuroscience. This project aims to develop new data analytics tools, algorithms, and models to combine complex multi-source neuroimage brain data and non-imaging data, to explore the interplays among these different data structures and identify novel functional patterns from complex brain graph structures. The research undertaken in this project expects to provide practical data analysis approaches and establish the theoretical foundations for data mining with multiple sources of brain data.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE210101458
Funder
Australian Research Council
Funding Amount
$387,141.00
Summary
Scalable and Deep Anomaly Detection from Big Data with Similarity Hashing. Anomaly detection, aiming to identify anomalous but insightful patterns in data mining, is an important big data analytics technique. The nature of big data requires a detection method that can handle fast-evolving data of diverse types. However, existing methods suffer from either high computational cost or low detection performance. This project aims to develop a detection framework to advance detection performance and ....Scalable and Deep Anomaly Detection from Big Data with Similarity Hashing. Anomaly detection, aiming to identify anomalous but insightful patterns in data mining, is an important big data analytics technique. The nature of big data requires a detection method that can handle fast-evolving data of diverse types. However, existing methods suffer from either high computational cost or low detection performance. This project aims to develop a detection framework to advance detection performance and efficiency, based on a novel deep learning model called deep isolation forest which is different from the traditional artificial neural network based models. The outcome will bring huge benefits to various applications such as real-time predictive maintenance in smart manufacturing, and intrusion detection in cybersecurity.Read moreRead less
Searching Cohesive Subgraphs in Big Attributed Graph Data. The availability of big attributed graph data brings great opportunities for realizing big values of data. Making sense of such big attributed graph data finds many applications, including health, science, engineering, business, environment, etc. A cohesive subgraph, one of key components that captures the latent properties in a graph, is essential to graph analysis. This project aims to invent effective models of cohesive subgraphs and ....Searching Cohesive Subgraphs in Big Attributed Graph Data. The availability of big attributed graph data brings great opportunities for realizing big values of data. Making sense of such big attributed graph data finds many applications, including health, science, engineering, business, environment, etc. A cohesive subgraph, one of key components that captures the latent properties in a graph, is essential to graph analysis. This project aims to invent effective models of cohesive subgraphs and efficient algorithms for searching and monitoring cohesive subgraphs in big and dynamic attributed graphs from both structure and attribute perspectives. The methods, techniques, and prototype systems developed in this project can be deployed to facilitate the smart use of big graph data across the nation. Read moreRead less
Modelling and Searching Cohesive Groups over Heterogeneous Graphs . Heterogeneous information networks (HINs) contain richer structural and semantic information represented as different types of objects and links. Searching cohesive groups from HINs finds many applications and also brings challenges at both conceptual and technical levels. This project aims to investigate the effective modelling of cohesive groups that take both homogeneous and heterogeneous information into account for differen ....Modelling and Searching Cohesive Groups over Heterogeneous Graphs . Heterogeneous information networks (HINs) contain richer structural and semantic information represented as different types of objects and links. Searching cohesive groups from HINs finds many applications and also brings challenges at both conceptual and technical levels. This project aims to investigate the effective modelling of cohesive groups that take both homogeneous and heterogeneous information into account for different applications and devise efficient algorithms for searching and monitoring those cohesive groups based on different models. The methods, techniques, and evaluation systems developed in this project can be deployed to facilitate the smart use of heterogeneous information networks across the nation.Read moreRead less
Next-generation Intelligent Explorations of Geo-located Data . This project aims to build a next-generation intelligent exploration framework over massive geo-located data, varying from points-of-interest to areas-of-interest data, in order to dramatically enhance user experiences when interacting with various forms of geo-located data over maps. Expected outcomes include novel exploration models, efficient and scalable algorithms for retrieving and visualizing the exploration results, online up ....Next-generation Intelligent Explorations of Geo-located Data . This project aims to build a next-generation intelligent exploration framework over massive geo-located data, varying from points-of-interest to areas-of-interest data, in order to dramatically enhance user experiences when interacting with various forms of geo-located data over maps. Expected outcomes include novel exploration models, efficient and scalable algorithms for retrieving and visualizing the exploration results, online updating of personal preferences during the life cycle of exploration, as well as a prototype system to evaluate and demonstrate practical value of the research. It will complement existing map services and significantly benefit many location-aware services, e.g., logistics, health services and urban planning.Read moreRead less
Secure and efficient data leak prevention on cloud. The leak of sensitive data on cloud not only poses serious threats to both public and private organisations but also puts their employees and clients at risk, e.g., economic loss and social impact. The aim of this project is to develop a secure and efficient solution that can detect and prevent leak of data in real-time. Uniquely, the proposed research will develop novel techniques that can monitor data leak security incidents happening over ti ....Secure and efficient data leak prevention on cloud. The leak of sensitive data on cloud not only poses serious threats to both public and private organisations but also puts their employees and clients at risk, e.g., economic loss and social impact. The aim of this project is to develop a secure and efficient solution that can detect and prevent leak of data in real-time. Uniquely, the proposed research will develop novel techniques that can monitor data leak security incidents happening over time and captured by different sensors and identify correlations between historic security incidents and current data attacks. This project will significantly help to secure data on cloud for organisations in Australia and benefit fast-growing security sensitive data hosting and applications on cloud.Read moreRead less
Effective and Efficient Data Quality Management for Data Lakes. This project aims to enhance the quality and completeness for data in data lakes by innovative and judicious use of Database and Artificial Intelligence techniques. To achieve the aim, we will develop knowledge-enhanced error correction during data ingestion, flexible and efficient data exploration, and heterogeneity-tolerant scalable data integration solutions. Its significance lies in integrating techniques from both database and ....Effective and Efficient Data Quality Management for Data Lakes. This project aims to enhance the quality and completeness for data in data lakes by innovative and judicious use of Database and Artificial Intelligence techniques. To achieve the aim, we will develop knowledge-enhanced error correction during data ingestion, flexible and efficient data exploration, and heterogeneity-tolerant scalable data integration solutions. Its significance lies in integrating techniques from both database and artificial intelligence areas to deliver effective solutions for challenging problems in data lakes. The outcome of this project will provide new knowledge in this cutting-edge domain, and provide additional value and immediate benefits to all applications built upon data lakes. Read moreRead less