Information Delivery from Segmented Textual Data Streams. This project will contribute to the advancement of ICT innovation in Australia by developing a robust, reusable language understanding engine. The technology will be tailored to web applications, in the form of a conceptually-aware web search engine capable of tracking cross-document dialogues and identifying the core semantic thread of the dialogue. It will place Australia at the forefront of next-generation language technology developme ....Information Delivery from Segmented Textual Data Streams. This project will contribute to the advancement of ICT innovation in Australia by developing a robust, reusable language understanding engine. The technology will be tailored to web applications, in the form of a conceptually-aware web search engine capable of tracking cross-document dialogues and identifying the core semantic thread of the dialogue. It will place Australia at the forefront of next-generation language technology development, with applications in areas including concept-based multi-document summarisation and email surveillance.Read moreRead less
A scalable and portable question-answering system. The current availability of large volumes of free text digitally stored demands the development of methodologies that can automatically find specific answers to user questions about this "unstructured" information. The goal of this project is to develop a scalable portable and domain-independent real-time natural-language question-answering system that explores the logical contents of the text. To achieve this we will fuse current approaches to ....A scalable and portable question-answering system. The current availability of large volumes of free text digitally stored demands the development of methodologies that can automatically find specific answers to user questions about this "unstructured" information. The goal of this project is to develop a scalable portable and domain-independent real-time natural-language question-answering system that explores the logical contents of the text. To achieve this we will fuse current approaches to question answering with approaches that look at the logical contents of the questions and answer candidates. A central part of the project will be the characterisation of the optimal logical forms, the determination of efficient methods to create and store sentence logical forms of potentially large volumes of text, and the treatment of difficult questions by incorporating summarisation and text generation techniques.Read moreRead less
A Layered Controlled Natural Language for Knowledge Representation. In this research project we will develop a controlled natural language for knowledge representation that has the potential to bridge the gap between fragments of natural language and formal languages. This controlled language will be based on a variety of increasing sophisticated layers, each building upon those below it by providing enhancements in expressive power. Sentences of the controlled language will be unambiguously tra ....A Layered Controlled Natural Language for Knowledge Representation. In this research project we will develop a controlled natural language for knowledge representation that has the potential to bridge the gap between fragments of natural language and formal languages. This controlled language will be based on a variety of increasing sophisticated layers, each building upon those below it by providing enhancements in expressive power. Sentences of the controlled language will be unambiguously translatable into a corresponding formal language. Anyone who can read and write English can immediately use the controlled language with the help an intelligent text editor. This technology will make it possible for non-specialists to write problem specifications in terms of the application domain without the need to formally encode the information.Read moreRead less
Query interpretation and response generation in large on-line resources. The unprecedented information explosion associated with the evolution of the Internet makes salient the challenge of providing users with answers to queries posed to Internet resources. The proposed project will apply machine learning and reasoning under uncertainty techniques to leverage the large amount of data found in the Internet in order to perform three tasks: (1) infer users' informational goals from their questions ....Query interpretation and response generation in large on-line resources. The unprecedented information explosion associated with the evolution of the Internet makes salient the challenge of providing users with answers to queries posed to Internet resources. The proposed project will apply machine learning and reasoning under uncertainty techniques to leverage the large amount of data found in the Internet in order to perform three tasks: (1) infer users' informational goals from their questions, (2) modify questions to improve the accuracy of retrieval engines, and (3) compose concise replies from the retrieved documents. The envisioned outcome of this project is a system that will generate replies to questions posed to on-line resources.Read moreRead less
A Minimum Message Length Approach for Discourse Interpretation. The ability to communicate with computer systems in Natural Language has great potential to improve the overall experience for users. However, current systems support only limited means of communication. In this project, we propose to investigate the application of model-selection techniques for interpreting human discourse. In particular, we will consider situations where the computer interprets users' discourse in the context of i ....A Minimum Message Length Approach for Discourse Interpretation. The ability to communicate with computer systems in Natural Language has great potential to improve the overall experience for users. However, current systems support only limited means of communication. In this project, we propose to investigate the application of model-selection techniques for interpreting human discourse. In particular, we will consider situations where the computer interprets users' discourse in the context of its own knowledge. The versatility of our approach will be demonstrated by using it to (1) interpret discourse in a human-computer dialogue, (2) provide feedback to short essays, and (3) determine the impact of a document on a model.Read moreRead less
Automatic Ontology Learning and Data Reasoning in Web Mining. This research has an impact on both research and practical applications. In research, it provides opportunities for research students to carry out research using both data mining and data reasoning to solving Web based application problems. In practical, it can help IT industry to design the new generation of Web mining systems in order to provide invaluable service to users. This research also develops new techniques for data automa ....Automatic Ontology Learning and Data Reasoning in Web Mining. This research has an impact on both research and practical applications. In research, it provides opportunities for research students to carry out research using both data mining and data reasoning to solving Web based application problems. In practical, it can help IT industry to design the new generation of Web mining systems in order to provide invaluable service to users. This research also develops new techniques for data automatic processing within areas of smart information use in Australia. In particular it further develops data mining techniques by introducing data reasoning models for using discovered knowledge. It must be useful to improve the efficiency of the existing data mining systems. Read moreRead less
Supporting adaptive, interactive documents. The project will improve comprehensibility of technical material, reduce paper usage, encourage collaborative science, improve the reliability of published science (by allowing post-publication annotation and correction), and improve the accessibility of technical material for readers who are blind or have poor vision. The project also holds considerable potential for supporting Australian companies in the publishing and document processing industries.
Parsing the web: Exploiting redundancy to understand language. This project will automatically learn the grammatical structure of language by exploiting redundancy of facts, like 'Mozart was born in 1756', from a trillion words of web text. These facts will be used to understand more complex sentences. This will enable smart information use of text with grammatical information for large-scale information access for the first time. This project will strengthen Australia's world-class expertise, ....Parsing the web: Exploiting redundancy to understand language. This project will automatically learn the grammatical structure of language by exploiting redundancy of facts, like 'Mozart was born in 1756', from a trillion words of web text. These facts will be used to understand more complex sentences. This will enable smart information use of text with grammatical information for large-scale information access for the first time. This project will strengthen Australia's world-class expertise, providing opportunities for future researchers in this area. Our expanded C&C tools and trillion word corpus will be used by academics, companies and governments, in Australia and internationally, aiding applications including financial surveillance and fraud detection.
Read moreRead less
A study of the potential for the public to be involved in the design of large scale public works. Public acceptability of infrastructure such as desalination plants or new public spaces, is a concern for the Australian Commonwealth and State Governments. However, tensions exist between the need for expedient planning and development of critical public infrastructure and Australian principles of democratic social and economic participation. The instrument developed by this research will inform pu ....A study of the potential for the public to be involved in the design of large scale public works. Public acceptability of infrastructure such as desalination plants or new public spaces, is a concern for the Australian Commonwealth and State Governments. However, tensions exist between the need for expedient planning and development of critical public infrastructure and Australian principles of democratic social and economic participation. The instrument developed by this research will inform public policy to negotiate and understand arrangements that balance social participation with Government objectives.Read moreRead less
Incremental Knowledge Acquisition for Machine Translation from Multiple Experts. With increasing globalisation and an increasing amount of electronically available documents the need for machine translation is growing dramatically. The state-of-the-art in machine translation is still far from satisfactory. Substantial post-editing is necessary for most non-technical texts and even for many technical documents to make the translation really understandable. This project will develop a new approach ....Incremental Knowledge Acquisition for Machine Translation from Multiple Experts. With increasing globalisation and an increasing amount of electronically available documents the need for machine translation is growing dramatically. The state-of-the-art in machine translation is still far from satisfactory. Substantial post-editing is necessary for most non-technical texts and even for many technical documents to make the translation really understandable. This project will develop a new approach for buildingmachine translation systems by extending the unorthodox approach of Ripple-Down Rules, which proved very successful for building expert systems in the medical domain.It is intended to build a machine translation system by integrating the knowledge from many experts.Read moreRead less