Language engineering in the field: preserving 100 endangered languages in New Guinea. Efforts to preserve the world's endangered linguistic heritage are labour-intensive, and unable to keep up with the pace of language loss. This project investigates a new approach to language preservation, using techniques from language engineering, and leveraging the labour of mother-tongue speakers.
Biochemical text mining for advancing chemical and pharmaceutical knowledge. The project aims to develop novel natural language processing methods to find, extract and structure complex chemical reaction information in scientific literature. The project addresses a recognised bottleneck to efficiency in the drug discovery process, by enabling biochemical research results to be turned into actionable information. This has the potential to inform and accelerate development of effective drug treatm ....Biochemical text mining for advancing chemical and pharmaceutical knowledge. The project aims to develop novel natural language processing methods to find, extract and structure complex chemical reaction information in scientific literature. The project addresses a recognised bottleneck to efficiency in the drug discovery process, by enabling biochemical research results to be turned into actionable information. This has the potential to inform and accelerate development of effective drug treatments through the linking of relevant biochemical information. By delivering new methods that improve the compilation of knowledge about chemicals and drugs from textual information resources, the project hopes to enable faster drug discovery.Read moreRead less
Automated assessment of data quality in biological knowledge resources. This project aims to develop methods for identifying poor quality data in biological databases. Research in biomedicine is underpinned by massive databases of biological data. Data quality is largely managed through manual curation, but automated methods to assess quality are critically needed. This project expects to develop a suite of computational tools for assessing biological data quality, utilising an innovative approa ....Automated assessment of data quality in biological knowledge resources. This project aims to develop methods for identifying poor quality data in biological databases. Research in biomedicine is underpinned by massive databases of biological data. Data quality is largely managed through manual curation, but automated methods to assess quality are critically needed. This project expects to develop a suite of computational tools for assessing biological data quality, utilising an innovative approach based on network analysis of database record connectivity. These tools will enable quantifying data quality at scale. Researchers, evidence-based decision-makers in biomedicine, and the analytical or predictive tools that use this data will make more reliable inferences and decisions.Read moreRead less
Spoken conversational search: contextual interactive techniques to support effective information search over a speech-only communication channel. This project will develop new techniques for effective information search using speech only, supporting improved information access for visually impaired people or in situations that require focused visual attention (e.g. driving). The techniques are based on a conversational approach to information search and presentation of results.
Making human place knowledge digestible by computers. This project aims to develop the tools that will enable people to interact intuitively with computers about places and the relations between places. People understand their environment in a different way to computers; they think of places and their relations, while computers use coordinates and maps. People’s interaction with maps is cognitively costly and error-prone, which is becoming untenable in situations needing time-critical decision m ....Making human place knowledge digestible by computers. This project aims to develop the tools that will enable people to interact intuitively with computers about places and the relations between places. People understand their environment in a different way to computers; they think of places and their relations, while computers use coordinates and maps. People’s interaction with maps is cognitively costly and error-prone, which is becoming untenable in situations needing time-critical decision making. The project will revolutionise the design of information services where computers deal with humans and location in time-critical or stressful situations, including emergency calls, disaster response and local search queries. The uptake of this design by industry will lead to economic benefits as well as a safer society living in a smarter environment.Read moreRead less
Personalised topic modelling and sentiment analysis for enhanced information discovery over document streams. This project will develop personalised information discovery, navigation and management systems of online content for the creative industries, e.g. to help advertising agencies understand market trends, and enable designers to discover and analyse information relating to new product concepts.
Natural language processing for automated validation of protein databases. The project aims to use natural language processing and information retrieval to reconcile and improve sources of biological information. Biological research has produced vast volumes of information about proteins, captured in structured resources (databases) and unstructured documents. However, the accuracy of much of this information is questionable. The project proposes to develop methods to validate data and reduce th ....Natural language processing for automated validation of protein databases. The project aims to use natural language processing and information retrieval to reconcile and improve sources of biological information. Biological research has produced vast volumes of information about proteins, captured in structured resources (databases) and unstructured documents. However, the accuracy of much of this information is questionable. The project proposes to develop methods to validate data and reduce the dramatic inconsistencies in protein information resources by leveraging observed correlations and complementarity between them, and specifically through targeted fact extraction from the biomedical literature. These methods will be applied at scale across millions of published articles, to infer and validate functional information.Read moreRead less
Talking about place: tapping human knowledge to enrich national spatial data sets. Place descriptions are a common way for people to describe a location, but no current tools are smart enough to understand them. Emergency call centres are risking lives, users of navigation or web services are frustrated and addressing these problems costs billions of dollars per year. This project comes with a novel, interdisciplinary approach to automatically interpret human place descriptions and will develop ....Talking about place: tapping human knowledge to enrich national spatial data sets. Place descriptions are a common way for people to describe a location, but no current tools are smart enough to understand them. Emergency call centres are risking lives, users of navigation or web services are frustrated and addressing these problems costs billions of dollars per year. This project comes with a novel, interdisciplinary approach to automatically interpret human place descriptions and will develop novel methods to capture placenames with their meaning for smarter databases and automatic interpretation procedures. This acquired knowledge will be an important step forward for Australia's data custodians and users. Australia's location information industry will gain a significant advantage on a highly competitive global market.Read moreRead less