Build competency aware and assuring machine learning systems. Recent development in machine learning (ML) has seen ML models with extremely high prediction accuracy. However, to support human-machine partnership in decision-making in complex environments, beyond accuracy, it is essential for ML systems to be competency aware and reliable, and at the same time be exploratory. This project aims to develop novel techniques to equip a ML system with the ability to identify own competency, to justify ....Build competency aware and assuring machine learning systems. Recent development in machine learning (ML) has seen ML models with extremely high prediction accuracy. However, to support human-machine partnership in decision-making in complex environments, beyond accuracy, it is essential for ML systems to be competency aware and reliable, and at the same time be exploratory. This project aims to develop novel techniques to equip a ML system with the ability to identify own competency, to justify its competency and decisions, to explore unknown situations and fully utilise existing expertise to deal with unknowns. The expected outcomes of the project will enable ML systems to become truely intelligent and reliable machine partners for human decision makers in a wide range of applications.Read moreRead less
Towards knowledge discovery from imperfect and evolving data. Information extraction from data is critical, both to analyse and protect consumer data. However, many learning techniques are developed using perfect, static datasets, quite different to messy, ever-changing real-world data. This project aims to develop data analytics techniques that can extract accurate information in complex structures from imperfect/incomplete data that changes over time. Expected outcomes are a prototype tool, te ....Towards knowledge discovery from imperfect and evolving data. Information extraction from data is critical, both to analyse and protect consumer data. However, many learning techniques are developed using perfect, static datasets, quite different to messy, ever-changing real-world data. This project aims to develop data analytics techniques that can extract accurate information in complex structures from imperfect/incomplete data that changes over time. Expected outcomes are a prototype tool, tested on real datasets, that combines new techniques in data modelling, algorithm development, and system design. Likely benefits are enhanced Australia's competence in data science through student training and new, robust data tools relevant to critical sectors such as cybersecurity, healthcare, and defence.Read moreRead less
Techniques for active conceptual modelling and guided data mining for rapid knowledge discovery. Quick, accurate responses to rapidly evolving phenomena are essential. This project will develop a platform able to accept data from a variety of sources in advance of the full definition of the associated conceptual model. The project will facilitate rapid querying and direct manipulation of the mining process allowing fast, user-oriented results.
Discovery Early Career Researcher Award - Grant ID: DE120101161
Funder
Australian Research Council
Funding Amount
$375,000.00
Summary
Compressive sensing based probabilistic graphical models (PGM). The aim of the project is to develop fast, large scale probabilistic graphical models (PGM) learning and inference methods. The resulting system will be able to process large scale PGMs on a standard PC, and will be easily extendable to computer clustering for larger scale PGMs requiring higher precision.
Efficient causal discovery from observational data. Discovering cause-effect relationships is the ultimate goal for many applications. Randomised control trial is the gold standard for discovering causal relationships. However, conducting such trials is impossible in many cases due to cost and/or ethical concerns. In contrast, a large amount of data has been accumulated in all areas. It is desirable to infer causal relationships from data directly and automatically. This project aims to develop ....Efficient causal discovery from observational data. Discovering cause-effect relationships is the ultimate goal for many applications. Randomised control trial is the gold standard for discovering causal relationships. However, conducting such trials is impossible in many cases due to cost and/or ethical concerns. In contrast, a large amount of data has been accumulated in all areas. It is desirable to infer causal relationships from data directly and automatically. This project aims to develop fast and scalable data mining methods for identifying causal relationships from large and/or high dimensional data sets. The developed methods will mainly be evaluated in real world biological applications. The research outcomes will be useful in many areas for causal reasoning and decision making.Read moreRead less
Fairness aware data mining for discrimination free decision-making. This project aims to develop data mining methods to detect algorithmic discriminations and to build fair decision models. It expects to provide techniques for regulatory organisations to detect discriminations in algorithmic decisions, and for various companies and organisations to build fair decision systems. Expected outcomes are novel and accurate methods for discrimination detection, practical and versatile techniques for fa ....Fairness aware data mining for discrimination free decision-making. This project aims to develop data mining methods to detect algorithmic discriminations and to build fair decision models. It expects to provide techniques for regulatory organisations to detect discriminations in algorithmic decisions, and for various companies and organisations to build fair decision systems. Expected outcomes are novel and accurate methods for discrimination detection, practical and versatile techniques for fair decision model building, and improved understanding of the relationships between privacy preservation and discrimination prevention to enable new techniques to achieve both goals. The developed techniques enable society to tackle ethical challenges in the big data era where many decisions are analytics based. Read moreRead less
Developing novel data mining methods to reveal complex group relationships from heterogeneous data. This project aims to develop novel and effective data mining methods that will enable us to unravel the relationships between multiple, rather than individual, components of complex systems (such as genes, gene regulators and cancer), which is crucial to understanding how such systems work. Potential applications for such methods are extensive.
Online Learning for Large Scale Structured Data in Complex Situations. Online Learning (OL) is the process of predicting answers for a sequence of questions. OL has enjoyed much attention in recent years due to its natural ability of processing large scale non-structured data and adapting to a changing environment. However, OL has three weaknesses: it does not scale for structured data; it often assumes that all of the data are equally important; it often considers that all of the data are compl ....Online Learning for Large Scale Structured Data in Complex Situations. Online Learning (OL) is the process of predicting answers for a sequence of questions. OL has enjoyed much attention in recent years due to its natural ability of processing large scale non-structured data and adapting to a changing environment. However, OL has three weaknesses: it does not scale for structured data; it often assumes that all of the data are equally important; it often considers that all of the data are complete and noise-free. These weaknesses limit its utility, because real data such as those that must be analysed in processing social networks, fraud detection do not satisfy the restrictions. The aim of this project is to develop theoretical and practical advances in OL that overcome the existing weaknesses.Read moreRead less
Probabilistic Graphical Models For Interventional Queries. The project intends to develop methods to suggest how to optimally intervene so that the future state of the system will best suit our interests. The power of probabilistic graphical models to model complex relationships and interactions among a large number of variables facilitates many applications. However, such models only aim to understand the underlying environment. What is ultimately needed in many real-world applications is to su ....Probabilistic Graphical Models For Interventional Queries. The project intends to develop methods to suggest how to optimally intervene so that the future state of the system will best suit our interests. The power of probabilistic graphical models to model complex relationships and interactions among a large number of variables facilitates many applications. However, such models only aim to understand the underlying environment. What is ultimately needed in many real-world applications is to suggest how we ought to intervene or act, so as to alter the environment to best suit our interests. The proposed project aims to achieve this using probabilistic graphical models on massive real-world data sets, thus facilitating a variety of applications from health care to commerce and the environment.Read moreRead less
Efficient data mining methods for evidence-based decision making. This project aims to develop efficient data mining methods for causal predictions. Evidence-based decision making (EBD), such as evidence-based medicine and policy, is always preferable. To support EBD, causal predictions forecast how outcomes change when conditions are manipulated. Progress has been made in theoretical research on causal inference based on observational data, but few methods can automatically mine causal signals ....Efficient data mining methods for evidence-based decision making. This project aims to develop efficient data mining methods for causal predictions. Evidence-based decision making (EBD), such as evidence-based medicine and policy, is always preferable. To support EBD, causal predictions forecast how outcomes change when conditions are manipulated. Progress has been made in theoretical research on causal inference based on observational data, but few methods can automatically mine causal signals from the data and methods for efficient causal predictions based on data are even fewer. This project will apply its methods to biomedical problems. The outcomes could support smart and data-driven evidence based decision making in many areas, such as therapeutics and government policy making.Read moreRead less