Exploring Scientific Information with Advanced New Search Tools. The rapidly growth of scientific literature in many fields makes finding information a challenge. For example, biologists produce over 1 million articles each year. Existing search tools have only limited success satisfying the demands of scientists' queries. This project will deliver intelligent e-research assistants capable of answering scientists' questions directly rather than returning a list of documents. This will allow scie ....Exploring Scientific Information with Advanced New Search Tools. The rapidly growth of scientific literature in many fields makes finding information a challenge. For example, biologists produce over 1 million articles each year. Existing search tools have only limited success satisfying the demands of scientists' queries. This project will deliver intelligent e-research assistants capable of answering scientists' questions directly rather than returning a list of documents. This will allow scientists to more efficiently exploit the literature enabling them to be more innovative and productive. This technology is applicable where ever finding facts in large volumes of text is critical, e.g. analysing surveillance material. Advanced search tools will have considerable academic and industrial impact.Read moreRead less
Parsing the web: Exploiting redundancy to understand language. This project will automatically learn the grammatical structure of language by exploiting redundancy of facts, like 'Mozart was born in 1756', from a trillion words of web text. These facts will be used to understand more complex sentences. This will enable smart information use of text with grammatical information for large-scale information access for the first time. This project will strengthen Australia's world-class expertise, ....Parsing the web: Exploiting redundancy to understand language. This project will automatically learn the grammatical structure of language by exploiting redundancy of facts, like 'Mozart was born in 1756', from a trillion words of web text. These facts will be used to understand more complex sentences. This will enable smart information use of text with grammatical information for large-scale information access for the first time. This project will strengthen Australia's world-class expertise, providing opportunities for future researchers in this area. Our expanded C&C tools and trillion word corpus will be used by academics, companies and governments, in Australia and internationally, aiding applications including financial surveillance and fraud detection.
Read moreRead less
Special Research Initiatives - Grant ID: SR0567353
Funder
Australian Research Council
Funding Amount
$98,035.00
Summary
An Intelligent Search Infrastructure for Language Resources on the Web
. Language occupies a central role on the web: most content is expressed in language, and most access takes place via natural language search. Today, investigation of human language depends on access to this vast store of language data. This project will develop new infrastructure for accessing language resources, namely a language-aware search engine. Language technologies will be employed to classify web content, and a ....An Intelligent Search Infrastructure for Language Resources on the Web
. Language occupies a central role on the web: most content is expressed in language, and most access takes place via natural language search. Today, investigation of human language depends on access to this vast store of language data. This project will develop new infrastructure for accessing language resources, namely a language-aware search engine. Language technologies will be employed to classify web content, and a special search keyword 'lang:' will constrain search results to be in the specified language. The system will be integrated with major language archives in Australia and overseas, and deployed on the high performance computing infrastructure at Melbourne University's Advanced Research Computing Centre.
Read moreRead less