Defence Against Phishing Attacks. Australian businesses and citizens are losing millions of dollars in cybercrimes every year. Rural and regional businesses depend on the integrity of their Internet banking service, and yet, cybercriminals are working hard to defraud these users. This project aims to build a reliable defence against phishing attacks which rely on social engineering to steal online identities, using intelligence gathered from the brazen trade of credentials in the public domain.
Ask the Net: Intelligent Natural Language Learning. Natural Language Processing (NLP) has progressed rapidly using corpus-based machine learning techniques. However, corpus development costs cause a ?data bottleneck? which prevents systems from reaching human competence. This project overcomes the difficulties of creating huge corpora by employing the innate language ability of untrained contributors. We will show how to automatically select and present examples, containing informative lingui ....Ask the Net: Intelligent Natural Language Learning. Natural Language Processing (NLP) has progressed rapidly using corpus-based machine learning techniques. However, corpus development costs cause a ?data bottleneck? which prevents systems from reaching human competence. This project overcomes the difficulties of creating huge corpora by employing the innate language ability of untrained contributors. We will show how to automatically select and present examples, containing informative linguistic structures, which are most beneficial for training NLP systems. These examples will be analysed by many contributors whose responses will be automatically collated into corpora. Huge corpora are vital to emerging language technologies for managing textual information in the global economy.
Read moreRead less
Effective Information Retrieval for Partitioned Document Collections. Current information retrieval services make use of massive indexes in order to resolve content-based queries. Monolithic approaches like this have been effective until now because the volume of data stored has been manageable on a single machine or tightly-coupled cluster of machines, and because the data has been available for collection. But with an increasing amount of automatically generated data, and an increasing diversi ....Effective Information Retrieval for Partitioned Document Collections. Current information retrieval services make use of massive indexes in order to resolve content-based queries. Monolithic approaches like this have been effective until now because the volume of data stored has been manageable on a single machine or tightly-coupled cluster of machines, and because the data has been available for collection. But with an increasing amount of automatically generated data, and an increasing diversity of information sources, other approaches are required. In this project we will investigate mechanisms for handling retrieval tasks when the indexes to the data are stored locally with the data, and when no central index is viable.Read moreRead less
Achieving higher availability of storage subsystems through application of a self learning expert system. In todays global business environment the management, storage and security of enterprise data (data unavailability, data loss and corruption, systems performance) has become the heart of so-called Enterprise computing. The storage subsystems increasingly have become the critical subcomponent and single point of failure. Discovering the cause of failure in complex environments involving mul ....Achieving higher availability of storage subsystems through application of a self learning expert system. In todays global business environment the management, storage and security of enterprise data (data unavailability, data loss and corruption, systems performance) has become the heart of so-called Enterprise computing. The storage subsystems increasingly have become the critical subcomponent and single point of failure. Discovering the cause of failure in complex environments involving multiple vendors, machines, software products, topologies and cultures (languages) is in many cases time consuming and difficult resulting in unacceptable systems downtime and high maintenance costs. A more sophisticated tool is needed allowing the accumulation of knowledge, the ability to deal with complexity and change, the ability to interface with unlike knowledge bases and predict solution probability based on experience and feedback. Multi-lingual support and capability through the development of a Natural Language interface would provide a functional capability suited to managing enterprise data in todays global businesses.Read moreRead less
Parsing the web: Exploiting redundancy to understand language. This project will automatically learn the grammatical structure of language by exploiting redundancy of facts, like 'Mozart was born in 1756', from a trillion words of web text. These facts will be used to understand more complex sentences. This will enable smart information use of text with grammatical information for large-scale information access for the first time. This project will strengthen Australia's world-class expertise, ....Parsing the web: Exploiting redundancy to understand language. This project will automatically learn the grammatical structure of language by exploiting redundancy of facts, like 'Mozart was born in 1756', from a trillion words of web text. These facts will be used to understand more complex sentences. This will enable smart information use of text with grammatical information for large-scale information access for the first time. This project will strengthen Australia's world-class expertise, providing opportunities for future researchers in this area. Our expanded C&C tools and trillion word corpus will be used by academics, companies and governments, in Australia and internationally, aiding applications including financial surveillance and fraud detection.
Read moreRead less
Automatic Ontology Learning and Data Reasoning in Web Mining. This research has an impact on both research and practical applications. In research, it provides opportunities for research students to carry out research using both data mining and data reasoning to solving Web based application problems. In practical, it can help IT industry to design the new generation of Web mining systems in order to provide invaluable service to users. This research also develops new techniques for data automa ....Automatic Ontology Learning and Data Reasoning in Web Mining. This research has an impact on both research and practical applications. In research, it provides opportunities for research students to carry out research using both data mining and data reasoning to solving Web based application problems. In practical, it can help IT industry to design the new generation of Web mining systems in order to provide invaluable service to users. This research also develops new techniques for data automatic processing within areas of smart information use in Australia. In particular it further develops data mining techniques by introducing data reasoning models for using discovered knowledge. It must be useful to improve the efficiency of the existing data mining systems. Read moreRead less
Exploring Scientific Information with Advanced New Search Tools. The rapidly growth of scientific literature in many fields makes finding information a challenge. For example, biologists produce over 1 million articles each year. Existing search tools have only limited success satisfying the demands of scientists' queries. This project will deliver intelligent e-research assistants capable of answering scientists' questions directly rather than returning a list of documents. This will allow scie ....Exploring Scientific Information with Advanced New Search Tools. The rapidly growth of scientific literature in many fields makes finding information a challenge. For example, biologists produce over 1 million articles each year. Existing search tools have only limited success satisfying the demands of scientists' queries. This project will deliver intelligent e-research assistants capable of answering scientists' questions directly rather than returning a list of documents. This will allow scientists to more efficiently exploit the literature enabling them to be more innovative and productive. This technology is applicable where ever finding facts in large volumes of text is critical, e.g. analysing surveillance material. Advanced search tools will have considerable academic and industrial impact.Read moreRead less