Investigation and Development of Parallel Large Scale Record Linkage Techniques. Record linkage aims at matching records of the same entity (like customer or patient) in large (administrative) databases. The outcomes of the proposed research will improve current techniques in terms of efficiency, accuracy and the need for human intervention. Through experimental studies and stochastic modelling the performance of traditional and new methods for data cleaning, standardisation and linkage will be ....Investigation and Development of Parallel Large Scale Record Linkage Techniques. Record linkage aims at matching records of the same entity (like customer or patient) in large (administrative) databases. The outcomes of the proposed research will improve current techniques in terms of efficiency, accuracy and the need for human intervention. Through experimental studies and stochastic modelling the performance of traditional and new methods for data cleaning, standardisation and linkage will be assessed. The effect of the statistical dependency of attribute values will be studied. New methods using clustering for blocking large datasets, and predictive models including interaction terms will be implemented, analysed and evaluated on high-performance computers and office-based PC clusters.
Read moreRead less
Efficient Techniques for Mining Exceptional Patterns. This research will develop totally new techniques for exceptional pattern discovery that are useful for deeper understanding data mining and capturing the hidden interactions (class-bridge rules and out-expectation patterns) within data. This will enable Australian data marketers to access valuable implicit information that is contained in their data, but not currently accessible. The outcomes will keep Australia in the international leading ....Efficient Techniques for Mining Exceptional Patterns. This research will develop totally new techniques for exceptional pattern discovery that are useful for deeper understanding data mining and capturing the hidden interactions (class-bridge rules and out-expectation patterns) within data. This will enable Australian data marketers to access valuable implicit information that is contained in their data, but not currently accessible. The outcomes will keep Australia in the international leading edge and preserve its competitive status in preemptively defining the information market of tomorrow. To 'Frontier Technologies for Building and Transforming Australian Industries', discovering new exceptional patterns within data will lead to increased efficiency in Australian Industries.Read moreRead less
New Directions in Mining Complex Spatial Relationships in Large Scientific Databases. International and Australian organizations are investing in large projects involving the collection of terabytes of scientific data. The Anglo-Australian Galaxy Redshift Survey in eastern Australia has obtained data for a quarter of a million galaxies. Similarly the Tropical Ocean Global Atmophere(TOGA) program is being expanded to collect data from the equatorial pacific region which will help better understa ....New Directions in Mining Complex Spatial Relationships in Large Scientific Databases. International and Australian organizations are investing in large projects involving the collection of terabytes of scientific data. The Anglo-Australian Galaxy Redshift Survey in eastern Australia has obtained data for a quarter of a million galaxies. Similarly the Tropical Ocean Global Atmophere(TOGA) program is being expanded to collect data from the equatorial pacific region which will help better understand the El Nino/Southern Oscillation Cycle. We are developing powerful spatial data mining tools which will go a long way in finding potential nuggets of useful information in these large databases and help Australian and international scientists hypothesise new theories to explain the underlying phenomenon.Read moreRead less