Investigation and Development of Parallel Large Scale Record Linkage Techniques. Record linkage aims at matching records of the same entity (like customer or patient) in large (administrative) databases. The outcomes of the proposed research will improve current techniques in terms of efficiency, accuracy and the need for human intervention. Through experimental studies and stochastic modelling the performance of traditional and new methods for data cleaning, standardisation and linkage will be ....Investigation and Development of Parallel Large Scale Record Linkage Techniques. Record linkage aims at matching records of the same entity (like customer or patient) in large (administrative) databases. The outcomes of the proposed research will improve current techniques in terms of efficiency, accuracy and the need for human intervention. Through experimental studies and stochastic modelling the performance of traditional and new methods for data cleaning, standardisation and linkage will be assessed. The effect of the statistical dependency of attribute values will be studied. New methods using clustering for blocking large datasets, and predictive models including interaction terms will be implemented, analysed and evaluated on high-performance computers and office-based PC clusters.
Read moreRead less
Creating the social genome: Advanced techniques for linking dynamic data. This project aims to develop novel efficient and effective models and techniques that enable record linkage of large dynamic databases while preserving the privacy of sensitive personal data. Social genomes are the digital footprints of our society. They are the basis of population informatics, which is revolutionising how researchers in various domains conduct studies, governments plan services and expenditures, and busin ....Creating the social genome: Advanced techniques for linking dynamic data. This project aims to develop novel efficient and effective models and techniques that enable record linkage of large dynamic databases while preserving the privacy of sensitive personal data. Social genomes are the digital footprints of our society. They are the basis of population informatics, which is revolutionising how researchers in various domains conduct studies, governments plan services and expenditures, and businesses advertise and interact with their customers. A core requirement of population informatics is the linking of large dynamic databases that contain details about people from diverse sources. The expected outcomes of this project will provide novel solutions to the challenges of population informatics faced by Australian organisations.Read moreRead less