Towards knowledge discovery from imperfect and evolving data. Information extraction from data is critical, both to analyse and protect consumer data. However, many learning techniques are developed using perfect, static datasets, quite different to messy, ever-changing real-world data. This project aims to develop data analytics techniques that can extract accurate information in complex structures from imperfect/incomplete data that changes over time. Expected outcomes are a prototype tool, te ....Towards knowledge discovery from imperfect and evolving data. Information extraction from data is critical, both to analyse and protect consumer data. However, many learning techniques are developed using perfect, static datasets, quite different to messy, ever-changing real-world data. This project aims to develop data analytics techniques that can extract accurate information in complex structures from imperfect/incomplete data that changes over time. Expected outcomes are a prototype tool, tested on real datasets, that combines new techniques in data modelling, algorithm development, and system design. Likely benefits are enhanced Australia's competence in data science through student training and new, robust data tools relevant to critical sectors such as cybersecurity, healthcare, and defence.Read moreRead less
Short Sequence Representation Learning with Limited Supervision . Predicting events based on short text and video data is widely found in real-world applications such as online crime detection, cyber-attack identification, and public security protection. However, to develop such an effective prediction model is very difficult due to the problems such as limited supervision, heterogeneous multiple sources, and missing and low-quality data. This project is to tackle these challenges. Expected outc ....Short Sequence Representation Learning with Limited Supervision . Predicting events based on short text and video data is widely found in real-world applications such as online crime detection, cyber-attack identification, and public security protection. However, to develop such an effective prediction model is very difficult due to the problems such as limited supervision, heterogeneous multiple sources, and missing and low-quality data. This project is to tackle these challenges. Expected outcome of this project will lay a theoretical foundation for effective short sequence representation learning and build next-generation intelligent systems. This should benefit our society and economy through the applications of multimodality-integrated video technologies for cybersecurity and public safety. Read moreRead less
Responsible modelling respecting privacy, data quality, and green computing. With the unprecedented growing impact of data on science, the economy and society, there comes the need for responsible data science practices which are accountable for the social good. This project aims to investigate the challenging problem of how to provide responsible data management, spanning across privacy-aware data exploration, resilient modelling to cope with imperfect data, and efficient model architectures fo ....Responsible modelling respecting privacy, data quality, and green computing. With the unprecedented growing impact of data on science, the economy and society, there comes the need for responsible data science practices which are accountable for the social good. This project aims to investigate the challenging problem of how to provide responsible data management, spanning across privacy-aware data exploration, resilient modelling to cope with imperfect data, and efficient model architectures for resource-constrained environments. This will be achieved by developing theories and techniques for complex real-world multi-modal data retrieval throughout the data life-cycle. The expected outcomes will significantly contribute to building capability in emerging technologies in the context of responsible data science. Read moreRead less