Towards knowledge discovery from imperfect and evolving data. Information extraction from data is critical, both to analyse and protect consumer data. However, many learning techniques are developed using perfect, static datasets, quite different to messy, ever-changing real-world data. This project aims to develop data analytics techniques that can extract accurate information in complex structures from imperfect/incomplete data that changes over time. Expected outcomes are a prototype tool, te ....Towards knowledge discovery from imperfect and evolving data. Information extraction from data is critical, both to analyse and protect consumer data. However, many learning techniques are developed using perfect, static datasets, quite different to messy, ever-changing real-world data. This project aims to develop data analytics techniques that can extract accurate information in complex structures from imperfect/incomplete data that changes over time. Expected outcomes are a prototype tool, tested on real datasets, that combines new techniques in data modelling, algorithm development, and system design. Likely benefits are enhanced Australia's competence in data science through student training and new, robust data tools relevant to critical sectors such as cybersecurity, healthcare, and defence.Read moreRead less
Short Sequence Representation Learning with Limited Supervision . Predicting events based on short text and video data is widely found in real-world applications such as online crime detection, cyber-attack identification, and public security protection. However, to develop such an effective prediction model is very difficult due to the problems such as limited supervision, heterogeneous multiple sources, and missing and low-quality data. This project is to tackle these challenges. Expected outc ....Short Sequence Representation Learning with Limited Supervision . Predicting events based on short text and video data is widely found in real-world applications such as online crime detection, cyber-attack identification, and public security protection. However, to develop such an effective prediction model is very difficult due to the problems such as limited supervision, heterogeneous multiple sources, and missing and low-quality data. This project is to tackle these challenges. Expected outcome of this project will lay a theoretical foundation for effective short sequence representation learning and build next-generation intelligent systems. This should benefit our society and economy through the applications of multimodality-integrated video technologies for cybersecurity and public safety. Read moreRead less