Techniques for active conceptual modelling and guided data mining for rapid knowledge discovery. Quick, accurate responses to rapidly evolving phenomena are essential. This project will develop a platform able to accept data from a variety of sources in advance of the full definition of the associated conceptual model. The project will facilitate rapid querying and direct manipulation of the mining process allowing fast, user-oriented results.
Online Learning for Large Scale Structured Data in Complex Situations. Online Learning (OL) is the process of predicting answers for a sequence of questions. OL has enjoyed much attention in recent years due to its natural ability of processing large scale non-structured data and adapting to a changing environment. However, OL has three weaknesses: it does not scale for structured data; it often assumes that all of the data are equally important; it often considers that all of the data are compl ....Online Learning for Large Scale Structured Data in Complex Situations. Online Learning (OL) is the process of predicting answers for a sequence of questions. OL has enjoyed much attention in recent years due to its natural ability of processing large scale non-structured data and adapting to a changing environment. However, OL has three weaknesses: it does not scale for structured data; it often assumes that all of the data are equally important; it often considers that all of the data are complete and noise-free. These weaknesses limit its utility, because real data such as those that must be analysed in processing social networks, fraud detection do not satisfy the restrictions. The aim of this project is to develop theoretical and practical advances in OL that overcome the existing weaknesses.Read moreRead less
Probabilistic Graphical Models For Interventional Queries. The project intends to develop methods to suggest how to optimally intervene so that the future state of the system will best suit our interests. The power of probabilistic graphical models to model complex relationships and interactions among a large number of variables facilitates many applications. However, such models only aim to understand the underlying environment. What is ultimately needed in many real-world applications is to su ....Probabilistic Graphical Models For Interventional Queries. The project intends to develop methods to suggest how to optimally intervene so that the future state of the system will best suit our interests. The power of probabilistic graphical models to model complex relationships and interactions among a large number of variables facilitates many applications. However, such models only aim to understand the underlying environment. What is ultimately needed in many real-world applications is to suggest how we ought to intervene or act, so as to alter the environment to best suit our interests. The proposed project aims to achieve this using probabilistic graphical models on massive real-world data sets, thus facilitating a variety of applications from health care to commerce and the environment.Read moreRead less
Intelligent Technologies for Smart Cryptography. This project aims to improve cybersecurity by automating the process of generating cryptographic software for smart devices. The expected outcomes are tools that automatically produce efficient cryptographic software that resists attacks. The main benefit of this project is to reduce the amount of expert labour required when developing secure software.
Privacy-Preserving Classification for Big-Data Driven Network Traffic. Protecting sensitive information in large network traffic flows while ensuring data usability for classification emerges as a critical problem of increasing significance. Existing techniques do not work on highly heterogeneous traffic from big-data applications for both privacy protection and classification (such as port-based and load- based methods). This project investigates new theories, methods and techniques for solving ....Privacy-Preserving Classification for Big-Data Driven Network Traffic. Protecting sensitive information in large network traffic flows while ensuring data usability for classification emerges as a critical problem of increasing significance. Existing techniques do not work on highly heterogeneous traffic from big-data applications for both privacy protection and classification (such as port-based and load- based methods). This project investigates new theories, methods and techniques for solving this problem. It proposes to develop a set of effective methods for privacy-preserving data publication through combining randomisation with anonymisation, and for classifying the published data through uncertainty leveraging by probabilistic reasoning and accuracy lifting by inter-flow correlation analysis and active learning.Read moreRead less