Parsing the web: Exploiting redundancy to understand language. This project will automatically learn the grammatical structure of language by exploiting redundancy of facts, like 'Mozart was born in 1756', from a trillion words of web text. These facts will be used to understand more complex sentences. This will enable smart information use of text with grammatical information for large-scale information access for the first time. This project will strengthen Australia's world-class expertise, ....Parsing the web: Exploiting redundancy to understand language. This project will automatically learn the grammatical structure of language by exploiting redundancy of facts, like 'Mozart was born in 1756', from a trillion words of web text. These facts will be used to understand more complex sentences. This will enable smart information use of text with grammatical information for large-scale information access for the first time. This project will strengthen Australia's world-class expertise, providing opportunities for future researchers in this area. Our expanded C&C tools and trillion word corpus will be used by academics, companies and governments, in Australia and internationally, aiding applications including financial surveillance and fraud detection.
Read moreRead less
Exploring Scientific Information with Advanced New Search Tools. The rapidly growth of scientific literature in many fields makes finding information a challenge. For example, biologists produce over 1 million articles each year. Existing search tools have only limited success satisfying the demands of scientists' queries. This project will deliver intelligent e-research assistants capable of answering scientists' questions directly rather than returning a list of documents. This will allow scie ....Exploring Scientific Information with Advanced New Search Tools. The rapidly growth of scientific literature in many fields makes finding information a challenge. For example, biologists produce over 1 million articles each year. Existing search tools have only limited success satisfying the demands of scientists' queries. This project will deliver intelligent e-research assistants capable of answering scientists' questions directly rather than returning a list of documents. This will allow scientists to more efficiently exploit the literature enabling them to be more innovative and productive. This technology is applicable where ever finding facts in large volumes of text is critical, e.g. analysing surveillance material. Advanced search tools will have considerable academic and industrial impact.Read moreRead less