Sentiment detection from opinion surveys -- the quest for customer and employee satisfaction. The research will yield improved international standing through scientific advances disseminated through high impact refereed publications and open source software. The advances made through the application of sophisticated probabilistic techniques to Language Technology problems will attract post-graduate students, and promote commercial interest. The demonstration prototype will provide proof of conce ....Sentiment detection from opinion surveys -- the quest for customer and employee satisfaction. The research will yield improved international standing through scientific advances disseminated through high impact refereed publications and open source software. The advances made through the application of sophisticated probabilistic techniques to Language Technology problems will attract post-graduate students, and promote commercial interest. The demonstration prototype will provide proof of concept of an application that enables business intelligence to automatically process free-form feedback from customers and employees, with resultant recommendations leading to increased customer and employee satisfaction. The applicability of the outcomes of this research to service industries will further improve Australia's service reputation.Read moreRead less
Efficient data manipulation in document classification. Document Classification has an enormous relevance in an era where large amounts of textual information is available. Document Classification is based on statistical and machine learning techniques that model documents represented as points in a multidimensional space. The Computer Engineering Laboratory (CEL) has ongoing projects using neural networks and other techniques for document classification. We are developing a development environm ....Efficient data manipulation in document classification. Document Classification has an enormous relevance in an era where large amounts of textual information is available. Document Classification is based on statistical and machine learning techniques that model documents represented as points in a multidimensional space. The Computer Engineering Laboratory (CEL) has ongoing projects using neural networks and other techniques for document classification. We are developing a development environment for large classification tasks, and Prof. Lee¡¯s work will focus in managing large amounts of data for them. Using his experience in data compression, databases and web applications, he will produce a set of tools for handling Gigabytes of textual data in our classification environment.Read moreRead less
Defence Against Phishing Attacks. Australian businesses and citizens are losing millions of dollars in cybercrimes every year. Rural and regional businesses depend on the integrity of their Internet banking service, and yet, cybercriminals are working hard to defraud these users. This project aims to build a reliable defence against phishing attacks which rely on social engineering to steal online identities, using intelligence gathered from the brazen trade of credentials in the public domain.
Ask the Net: Intelligent Natural Language Learning. Natural Language Processing (NLP) has progressed rapidly using corpus-based machine learning techniques. However, corpus development costs cause a ?data bottleneck? which prevents systems from reaching human competence. This project overcomes the difficulties of creating huge corpora by employing the innate language ability of untrained contributors. We will show how to automatically select and present examples, containing informative lingui ....Ask the Net: Intelligent Natural Language Learning. Natural Language Processing (NLP) has progressed rapidly using corpus-based machine learning techniques. However, corpus development costs cause a ?data bottleneck? which prevents systems from reaching human competence. This project overcomes the difficulties of creating huge corpora by employing the innate language ability of untrained contributors. We will show how to automatically select and present examples, containing informative linguistic structures, which are most beneficial for training NLP systems. These examples will be analysed by many contributors whose responses will be automatically collated into corpora. Huge corpora are vital to emerging language technologies for managing textual information in the global economy.
Read moreRead less
A Layered Controlled Natural Language for Knowledge Representation. In this research project we will develop a controlled natural language for knowledge representation that has the potential to bridge the gap between fragments of natural language and formal languages. This controlled language will be based on a variety of increasing sophisticated layers, each building upon those below it by providing enhancements in expressive power. Sentences of the controlled language will be unambiguously tra ....A Layered Controlled Natural Language for Knowledge Representation. In this research project we will develop a controlled natural language for knowledge representation that has the potential to bridge the gap between fragments of natural language and formal languages. This controlled language will be based on a variety of increasing sophisticated layers, each building upon those below it by providing enhancements in expressive power. Sentences of the controlled language will be unambiguously translatable into a corresponding formal language. Anyone who can read and write English can immediately use the controlled language with the help an intelligent text editor. This technology will make it possible for non-specialists to write problem specifications in terms of the application domain without the need to formally encode the information.Read moreRead less
A scalable and portable question-answering system. The current availability of large volumes of free text digitally stored demands the development of methodologies that can automatically find specific answers to user questions about this "unstructured" information. The goal of this project is to develop a scalable portable and domain-independent real-time natural-language question-answering system that explores the logical contents of the text. To achieve this we will fuse current approaches to ....A scalable and portable question-answering system. The current availability of large volumes of free text digitally stored demands the development of methodologies that can automatically find specific answers to user questions about this "unstructured" information. The goal of this project is to develop a scalable portable and domain-independent real-time natural-language question-answering system that explores the logical contents of the text. To achieve this we will fuse current approaches to question answering with approaches that look at the logical contents of the questions and answer candidates. A central part of the project will be the characterisation of the optimal logical forms, the determination of efficient methods to create and store sentence logical forms of potentially large volumes of text, and the treatment of difficult questions by incorporating summarisation and text generation techniques.Read moreRead less
An knowledge-based approach to multi-document text summarisation for automated meta-analysis of the scientific literature. The biomedical sciences produce literature at an exponential rate, and the size of this knowledge base far exceeds the capacity of humans to keep up with the growth in new knowledge. This project will develop computational text summarisation methods to abstract the content of scientific journal articles reporting clinical trials, and develop multi-document summarisation meth ....An knowledge-based approach to multi-document text summarisation for automated meta-analysis of the scientific literature. The biomedical sciences produce literature at an exponential rate, and the size of this knowledge base far exceeds the capacity of humans to keep up with the growth in new knowledge. This project will develop computational text summarisation methods to abstract the content of scientific journal articles reporting clinical trials, and develop multi-document summarisation methods to synthesise these abstracts using automated statistical meta-analysis methods. These methods have broad potential to improve text-summarisation technologies in general, to profoundly enhance our ability to integrate published knowledge, and to make a highly significant and specific contribution to improving the quality of evidence used in health decision-making. Read moreRead less
Query interpretation and response generation in large on-line resources. The unprecedented information explosion associated with the evolution of the Internet makes salient the challenge of providing users with answers to queries posed to Internet resources. The proposed project will apply machine learning and reasoning under uncertainty techniques to leverage the large amount of data found in the Internet in order to perform three tasks: (1) infer users' informational goals from their questions ....Query interpretation and response generation in large on-line resources. The unprecedented information explosion associated with the evolution of the Internet makes salient the challenge of providing users with answers to queries posed to Internet resources. The proposed project will apply machine learning and reasoning under uncertainty techniques to leverage the large amount of data found in the Internet in order to perform three tasks: (1) infer users' informational goals from their questions, (2) modify questions to improve the accuracy of retrieval engines, and (3) compose concise replies from the retrieved documents. The envisioned outcome of this project is a system that will generate replies to questions posed to on-line resources.Read moreRead less
A study of the potential for the public to be involved in the design of large scale public works. Public acceptability of infrastructure such as desalination plants or new public spaces, is a concern for the Australian Commonwealth and State Governments. However, tensions exist between the need for expedient planning and development of critical public infrastructure and Australian principles of democratic social and economic participation. The instrument developed by this research will inform pu ....A study of the potential for the public to be involved in the design of large scale public works. Public acceptability of infrastructure such as desalination plants or new public spaces, is a concern for the Australian Commonwealth and State Governments. However, tensions exist between the need for expedient planning and development of critical public infrastructure and Australian principles of democratic social and economic participation. The instrument developed by this research will inform public policy to negotiate and understand arrangements that balance social participation with Government objectives.Read moreRead less
Concept-Based Multilingual Web Content Mining. Towards smart use of Web information, this pioneer project will develop an innovative concept-based approach for discovering global knowledge embedded within multilingual Web documents. Departing from the traditional bilingual term-to-term machine translation techniques, the approach overcomes the notorious vocabulary mismatch problem by enabling synchronised lexical mapping of multiple languages. A series of intelligent concept-based techniques usi ....Concept-Based Multilingual Web Content Mining. Towards smart use of Web information, this pioneer project will develop an innovative concept-based approach for discovering global knowledge embedded within multilingual Web documents. Departing from the traditional bilingual term-to-term machine translation techniques, the approach overcomes the notorious vocabulary mismatch problem by enabling synchronised lexical mapping of multiple languages. A series of intelligent concept-based techniques using fuzzy logic and neural networks will be investigated to support smart Web information browsing and exploration. This project will provide valuable new insights into developing state-of-the-art multilingual Web mining applications for enhancing business intelligence in Australia's knowledge driven industries.Read moreRead less