New methodologies for representing and accessing resources on endangered languages: a case study from South Efate. Linguists produce material which has immense cultural significance as it is often the only record of endangered cultures. With new technologies come new ways of working with indigenous languages. This APD will develop an innovative methodology for documenting and archiving data from a language of the Pacific. It will do this by linking a dictionary, texts, audio, video, images and a ....New methodologies for representing and accessing resources on endangered languages: a case study from South Efate. Linguists produce material which has immense cultural significance as it is often the only record of endangered cultures. With new technologies come new ways of working with indigenous languages. This APD will develop an innovative methodology for documenting and archiving data from a language of the Pacific. It will do this by linking a dictionary, texts, audio, video, images and a grammar to facilitate presentation of both the data and its analysis to speakers, fellow linguists, and the general public. The methodology developed in this APD will result in innovative linguistic data management techniques conformant to emerging international standards.Read moreRead less
Special Research Initiatives - Grant ID: SR0567353
Funder
Australian Research Council
Funding Amount
$98,035.00
Summary
An Intelligent Search Infrastructure for Language Resources on the Web
. Language occupies a central role on the web: most content is expressed in language, and most access takes place via natural language search. Today, investigation of human language depends on access to this vast store of language data. This project will develop new infrastructure for accessing language resources, namely a language-aware search engine. Language technologies will be employed to classify web content, and a ....An Intelligent Search Infrastructure for Language Resources on the Web
. Language occupies a central role on the web: most content is expressed in language, and most access takes place via natural language search. Today, investigation of human language depends on access to this vast store of language data. This project will develop new infrastructure for accessing language resources, namely a language-aware search engine. Language technologies will be employed to classify web content, and a special search keyword 'lang:' will constrain search results to be in the specified language. The system will be integrated with major language archives in Australia and overseas, and deployed on the high performance computing infrastructure at Melbourne University's Advanced Research Computing Centre.
Read moreRead less
Parsing the web: Exploiting redundancy to understand language. This project will automatically learn the grammatical structure of language by exploiting redundancy of facts, like 'Mozart was born in 1756', from a trillion words of web text. These facts will be used to understand more complex sentences. This will enable smart information use of text with grammatical information for large-scale information access for the first time. This project will strengthen Australia's world-class expertise, ....Parsing the web: Exploiting redundancy to understand language. This project will automatically learn the grammatical structure of language by exploiting redundancy of facts, like 'Mozart was born in 1756', from a trillion words of web text. These facts will be used to understand more complex sentences. This will enable smart information use of text with grammatical information for large-scale information access for the first time. This project will strengthen Australia's world-class expertise, providing opportunities for future researchers in this area. Our expanded C&C tools and trillion word corpus will be used by academics, companies and governments, in Australia and internationally, aiding applications including financial surveillance and fraud detection.
Read moreRead less
Exploring Scientific Information with Advanced New Search Tools. The rapidly growth of scientific literature in many fields makes finding information a challenge. For example, biologists produce over 1 million articles each year. Existing search tools have only limited success satisfying the demands of scientists' queries. This project will deliver intelligent e-research assistants capable of answering scientists' questions directly rather than returning a list of documents. This will allow scie ....Exploring Scientific Information with Advanced New Search Tools. The rapidly growth of scientific literature in many fields makes finding information a challenge. For example, biologists produce over 1 million articles each year. Existing search tools have only limited success satisfying the demands of scientists' queries. This project will deliver intelligent e-research assistants capable of answering scientists' questions directly rather than returning a list of documents. This will allow scientists to more efficiently exploit the literature enabling them to be more innovative and productive. This technology is applicable where ever finding facts in large volumes of text is critical, e.g. analysing surveillance material. Advanced search tools will have considerable academic and industrial impact.Read moreRead less