Ask the Net: Intelligent Natural Language Learning. Natural Language Processing (NLP) has progressed rapidly using corpus-based machine learning techniques. However, corpus development costs cause a ?data bottleneck? which prevents systems from reaching human competence. This project overcomes the difficulties of creating huge corpora by employing the innate language ability of untrained contributors. We will show how to automatically select and present examples, containing informative lingui ....Ask the Net: Intelligent Natural Language Learning. Natural Language Processing (NLP) has progressed rapidly using corpus-based machine learning techniques. However, corpus development costs cause a ?data bottleneck? which prevents systems from reaching human competence. This project overcomes the difficulties of creating huge corpora by employing the innate language ability of untrained contributors. We will show how to automatically select and present examples, containing informative linguistic structures, which are most beneficial for training NLP systems. These examples will be analysed by many contributors whose responses will be automatically collated into corpora. Huge corpora are vital to emerging language technologies for managing textual information in the global economy.
Read moreRead less
Understanding Indonesian: developing a machine-usable grammar, dictionary and corpus. Australia's relationship with Indonesia is of great significance. The need for good relationships founded on appreciation of the range of societies and views in modern Indonesia is widely acknowledged. A better knowledge of the languages is essential for this, and so are fast, efficient information gathering systems for processing multilingual sources (including Indonesian text), that can analyse large volumes ....Understanding Indonesian: developing a machine-usable grammar, dictionary and corpus. Australia's relationship with Indonesia is of great significance. The need for good relationships founded on appreciation of the range of societies and views in modern Indonesia is widely acknowledged. A better knowledge of the languages is essential for this, and so are fast, efficient information gathering systems for processing multilingual sources (including Indonesian text), that can analyse large volumes of text. The skills to build such systems exist internationally. Through collaboration with established international teams, we plan to transfer cutting-edge skills in the development of machine-useable grammars to Australian researchers, and to create the language resources essential for understanding Indonesian.Read moreRead less
Exploring Scientific Information with Advanced New Search Tools. The rapidly growth of scientific literature in many fields makes finding information a challenge. For example, biologists produce over 1 million articles each year. Existing search tools have only limited success satisfying the demands of scientists' queries. This project will deliver intelligent e-research assistants capable of answering scientists' questions directly rather than returning a list of documents. This will allow scie ....Exploring Scientific Information with Advanced New Search Tools. The rapidly growth of scientific literature in many fields makes finding information a challenge. For example, biologists produce over 1 million articles each year. Existing search tools have only limited success satisfying the demands of scientists' queries. This project will deliver intelligent e-research assistants capable of answering scientists' questions directly rather than returning a list of documents. This will allow scientists to more efficiently exploit the literature enabling them to be more innovative and productive. This technology is applicable where ever finding facts in large volumes of text is critical, e.g. analysing surveillance material. Advanced search tools will have considerable academic and industrial impact.Read moreRead less