Investigation and Development of Parallel Large Scale Record Linkage Techniques. Record linkage aims at matching records of the same entity (like customer or patient) in large (administrative) databases. The outcomes of the proposed research will improve current techniques in terms of efficiency, accuracy and the need for human intervention. Through experimental studies and stochastic modelling the performance of traditional and new methods for data cleaning, standardisation and linkage will be ....Investigation and Development of Parallel Large Scale Record Linkage Techniques. Record linkage aims at matching records of the same entity (like customer or patient) in large (administrative) databases. The outcomes of the proposed research will improve current techniques in terms of efficiency, accuracy and the need for human intervention. Through experimental studies and stochastic modelling the performance of traditional and new methods for data cleaning, standardisation and linkage will be assessed. The effect of the statistical dependency of attribute values will be studied. New methods using clustering for blocking large datasets, and predictive models including interaction terms will be implemented, analysed and evaluated on high-performance computers and office-based PC clusters.
Read moreRead less
Creating the social genome: Advanced techniques for linking dynamic data. This project aims to develop novel efficient and effective models and techniques that enable record linkage of large dynamic databases while preserving the privacy of sensitive personal data. Social genomes are the digital footprints of our society. They are the basis of population informatics, which is revolutionising how researchers in various domains conduct studies, governments plan services and expenditures, and busin ....Creating the social genome: Advanced techniques for linking dynamic data. This project aims to develop novel efficient and effective models and techniques that enable record linkage of large dynamic databases while preserving the privacy of sensitive personal data. Social genomes are the digital footprints of our society. They are the basis of population informatics, which is revolutionising how researchers in various domains conduct studies, governments plan services and expenditures, and businesses advertise and interact with their customers. A core requirement of population informatics is the linking of large dynamic databases that contain details about people from diverse sources. The expected outcomes of this project will provide novel solutions to the challenges of population informatics faced by Australian organisations.Read moreRead less
Special Research Initiatives - Grant ID: SR0354744
Funder
Australian Research Council
Funding Amount
$20,000.00
Summary
Improving Australia's Data Mining and Knowledge Discovery Research. The network will bring together over 50 active researchers in data mining and knowledge discovery to enhance and better coordinate Australia's impressive research performance in these dual disciplines. Specifically, the network will (a) facilitate communication and collaboration between researchers, (b) fund or underwrite opportunities for international collaboration, (c) run a number of specialist workshops and symposia and (d ....Improving Australia's Data Mining and Knowledge Discovery Research. The network will bring together over 50 active researchers in data mining and knowledge discovery to enhance and better coordinate Australia's impressive research performance in these dual disciplines. Specifically, the network will (a) facilitate communication and collaboration between researchers, (b) fund or underwrite opportunities for international collaboration, (c) run a number of specialist workshops and symposia and (d) establish a national annual conference.Read moreRead less