Uncertainty, Risk and Related Concepts in Machine Learning. Machine learning is the science of making sense of data. It does not and cannot remove all risk and uncertainty. This project proposes to study the foundations of how machine learning uses, represents and communicates risk and uncertainty. It aims to do so by finding new theoretical connections between diverse notions that have arisen in allied disciplines. These include risk, uncertainty, scoring rules and loss functions, divergences, ....Uncertainty, Risk and Related Concepts in Machine Learning. Machine learning is the science of making sense of data. It does not and cannot remove all risk and uncertainty. This project proposes to study the foundations of how machine learning uses, represents and communicates risk and uncertainty. It aims to do so by finding new theoretical connections between diverse notions that have arisen in allied disciplines. These include risk, uncertainty, scoring rules and loss functions, divergences, statistics and different ways of aggregating information. By building a more complete theoretical map it is expected that new machine learning methods will be developed, but more importantly that machine learning will be able to be better integrated into larger socio-technical systems.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE170101134
Funder
Australian Research Council
Funding Amount
$360,000.00
Summary
Feasible algorithms for big inference. This project aims to develop algorithms for computationally-intensive statistical tools to analyse Big Data. Big Data is ubiquitous in science, engineering, industry and finance, but needs special machine learning to conduct correct inferential analysis. Computational bottlenecks make many tried-and-true tools of statistical inference inadequate. This project will develop tools including false discovery rate control, heteroscedastic and robust regression an ....Feasible algorithms for big inference. This project aims to develop algorithms for computationally-intensive statistical tools to analyse Big Data. Big Data is ubiquitous in science, engineering, industry and finance, but needs special machine learning to conduct correct inferential analysis. Computational bottlenecks make many tried-and-true tools of statistical inference inadequate. This project will develop tools including false discovery rate control, heteroscedastic and robust regression and mixture models, via Big Data-appropriate optimisation and composite-likelihood estimation. It will make open, well-documented, and accessible software available for the scalable and distributable analysis of Big Data. The expected outcome is a suite of scalable algorithms to analyse Big Data.Read moreRead less
Theoretical Foundations of Ethical Machine Learning. The project aims to develop a systematic theory of ethical machine learning. Machine learning is a powerful and pervasive technology that is already having a huge impact on Australia. When applied to data about people there are a range of ethical harms that can arise (fairness, and privacy are two of them). The project aims to develop a rigorously grounded foundation for managing such ethical harms. For example it will allow the quantification ....Theoretical Foundations of Ethical Machine Learning. The project aims to develop a systematic theory of ethical machine learning. Machine learning is a powerful and pervasive technology that is already having a huge impact on Australia. When applied to data about people there are a range of ethical harms that can arise (fairness, and privacy are two of them). The project aims to develop a rigorously grounded foundation for managing such ethical harms. For example it will allow the quantification of the inevitable trade-offs between fairness and utility. The benefits of the project should include better ways of managing these trade-offs, a competitive advantage for Australian firms developing the technology, and will ensure that the country retains a social license to use the technology.Read moreRead less
Rating and ranking sports players and teams using minimum message length. All sorts of games and sports could use better systems for rating and ranking teams. This is as true in sports-mad Australia as any other country. Improved and more accessible rating systems across a variety of activities should encourage the general public to take a greater interest in the mathematics, statistics, information theory and machine learning behind the systems. With Cadability as our Australia-based interna ....Rating and ranking sports players and teams using minimum message length. All sorts of games and sports could use better systems for rating and ranking teams. This is as true in sports-mad Australia as any other country. Improved and more accessible rating systems across a variety of activities should encourage the general public to take a greater interest in the mathematics, statistics, information theory and machine learning behind the systems. With Cadability as our Australia-based international industry partner, the global use of these systems will be to Australia's economic advantage. Having a more accurate rating system which is wider-reaching both in the number of sports and games and the number of participants per sport and game should also encourage greater participation from the general public.Read moreRead less
Machine learning in adversarial environments. Machine learning underpins the technologies driving the economies of both Silicon Valley and Wall Street, from web search and ad placement, to stock predictions and efforts in fighting cybercrime. This project aims to answer the question: How can machines learn from data when contributors act maliciously for personal gain?
Australian Laureate Fellowships - Grant ID: FL200100176
Funder
Australian Research Council
Funding Amount
$3,128,080.00
Summary
Theoretical Foundations of Ethical Machine Learning. The project will develop a systematic theory of ethical machine learning. Machine learning is a powerful and pervasive technology that is already having a huge impact on Australia. When applied to data about people there are a range of ethical harms that can arise (fairness, and privacy are two of them). The project will develop a rigorously grounded foundation for managing such ethical harms. For example it will allow the quantification of t ....Theoretical Foundations of Ethical Machine Learning. The project will develop a systematic theory of ethical machine learning. Machine learning is a powerful and pervasive technology that is already having a huge impact on Australia. When applied to data about people there are a range of ethical harms that can arise (fairness, and privacy are two of them). The project will develop a rigorously grounded foundation for managing such ethical harms. For example it will allow the quantification of the inevitable trade-offs between fairness and utility. The benefits of the project will include the best possible ways of managing these trade-offs, competitive advantage for Australian firms developing the technology, and will ensure that the country retains a social license to use the technology.Read moreRead less
Privacy-preserving cloud data mining-as-a-service. This project aims to explore practical privacy-preserving solutions for cloud data mining-as-a-service based on the Intel Software Guard Extensions (SGX) technology. The research addresses privacy concerns of users when outsourcing data mining needs to the cloud. These concerns have increased as more businesses evaluate data mining-as-an outsourced service due to lack of expertise or computation resources. The expected outcomes from the research ....Privacy-preserving cloud data mining-as-a-service. This project aims to explore practical privacy-preserving solutions for cloud data mining-as-a-service based on the Intel Software Guard Extensions (SGX) technology. The research addresses privacy concerns of users when outsourcing data mining needs to the cloud. These concerns have increased as more businesses evaluate data mining-as-an outsourced service due to lack of expertise or computation resources. The expected outcomes from the research will include new data privacy models, new privacy-preserving data mining algorithms, and a prototype of cloud data mining software. These will help businesses cut costs for data mining and privacy protection, and provide significant benefits toward helping Australia achieve its national cyber security strategy and potentially provide economic impact from commercialisation of new software technology for the industry partner.Read moreRead less
Identifying and Tracking Influential Events in Large Social Networks. This project aims to invent a novel model and techniques for identifying and tracking influential events in large and dynamic social networks in real time. The proposed model would take into account the structure and content of social networks, and the influence of events. The project also plans to develop efficient strategies for identifying and tracking events in large and dynamic social network environments based on the mod ....Identifying and Tracking Influential Events in Large Social Networks. This project aims to invent a novel model and techniques for identifying and tracking influential events in large and dynamic social networks in real time. The proposed model would take into account the structure and content of social networks, and the influence of events. The project also plans to develop efficient strategies for identifying and tracking events in large and dynamic social network environments based on the model, In particular, the project plans to investigate flexible social network query methods to make users’ event search easy. Finally the project plans to build an evaluation system to demonstrate the efficiency of the algorithms and effectiveness of the model.Read moreRead less
Biclique discovery in Big Data. This project aims to design algorithms to capture Big Data. Biclique is a popular graph model that can capture important cohesive structures in many applications. However, traditional biclique discovery algorithms which only focus on simple, small-scale, static and deterministic data are inadequate in the era of Big Data where data has Variety (various formats), Volume (large quantity), Velocity (dynamic update) and Veracity (uncertainty). This project expects to ....Biclique discovery in Big Data. This project aims to design algorithms to capture Big Data. Biclique is a popular graph model that can capture important cohesive structures in many applications. However, traditional biclique discovery algorithms which only focus on simple, small-scale, static and deterministic data are inadequate in the era of Big Data where data has Variety (various formats), Volume (large quantity), Velocity (dynamic update) and Veracity (uncertainty). This project expects to benefit real applications in both public and private sectors and add value to Australian manufactured products.Read moreRead less
Learning the Focus of Attention to Detect Distributed Coordinated Attacks. Cyber security analysts need to detect and respond to attacks as soon as possible, to minimise the damage attackers can inflict. However, the growth in highly distributed attacks that span multiple networks has meant that massive volumes of data need to be analysed. While machine learning techniques can help filter the data, we need techniques that can automatically provide a focus of attention for analysts on the most re ....Learning the Focus of Attention to Detect Distributed Coordinated Attacks. Cyber security analysts need to detect and respond to attacks as soon as possible, to minimise the damage attackers can inflict. However, the growth in highly distributed attacks that span multiple networks has meant that massive volumes of data need to be analysed. While machine learning techniques can help filter the data, we need techniques that can automatically provide a focus of attention for analysts on the most relevant observations. Our aim is to devise a novel suite of attention mechanisms that can focus the search of machine learning techniques for cyber security. The results of this project will improve the accuracy and efficiency of detecting distributed attacks across multiple networks.Read moreRead less