Accurate Context-Aware Search for Managed Document Collections. Search is a key component of a vast range of computing applications. However, while search on large collections of general-purpose text is well-understood, and the best systems are highly effective, search on smaller or special-purpose collections is much less reliable and has attracted relatively little research. By identifying general ways of using context such as user history or prior page usefulness, the quality of search in s ....Accurate Context-Aware Search for Managed Document Collections. Search is a key component of a vast range of computing applications. However, while search on large collections of general-purpose text is well-understood, and the best systems are highly effective, search on smaller or special-purpose collections is much less reliable and has attracted relatively little research. By identifying general ways of using context such as user history or prior page usefulness, the quality of search in such cases can be greatly improved. Products that make use of these principles will provide greater workplace efficiency and be able to locate information that other search tools cannot identify.Read moreRead less
Using Past Queries for Fast and Accurate Web Searching. Searching the entire Internet, or a company web site, has become a vital task for modern organisations. While there has been significant research into improving search engines through using web pages themselves, very little attention has been paid to improving web search by exploiting the vast numbers of queries that users submit to search engines each day. This project will use state of the art compression and algorithmic techniques to imp ....Using Past Queries for Fast and Accurate Web Searching. Searching the entire Internet, or a company web site, has become a vital task for modern organisations. While there has been significant research into improving search engines through using web pages themselves, very little attention has been paid to improving web search by exploiting the vast numbers of queries that users submit to search engines each day. This project will use state of the art compression and algorithmic techniques to improve the speed and accuracy of web search using data gleaned from millions of Internet queries (provided under agreement by Microsoft). Improving search engines will have a direct benefit to many Australian industries, and support the government's priority area of "smart information use".Read moreRead less
XML Views of Relational Databases: Semantics and Update Problems. XML is the standard for representing, publishing and exchanging data over the Internet and relational database is the dominant technology for data management. Updating XML views over relational data is fundamental to bring these two technologies together to serve Internet-based applications. Australia has been a leading country in both developing and applying internet technologies. The theoretic outcomes of this project will contr ....XML Views of Relational Databases: Semantics and Update Problems. XML is the standard for representing, publishing and exchanging data over the Internet and relational database is the dominant technology for data management. Updating XML views over relational data is fundamental to bring these two technologies together to serve Internet-based applications. Australia has been a leading country in both developing and applying internet technologies. The theoretic outcomes of this project will contribute to the advance in database and web research communities and establish us as an internationally leading group in this research area. The technological outcomes will help organisations in Australia effectively and efficiently conduct e-Business on the Internet. Read moreRead less
Comparative analysis and exploration of collections of data clusterings. Data clustering is an important technique for extracting knowledge from complex datasets. It is widely used by Australian science, government and industry, in areas such as genomics, proteomics, crime analysis, marketing and customer profiling. This project will develop new techniques that will allow users to explore and analyse collections of data clusterings. This will improve the current generation of clustering softw ....Comparative analysis and exploration of collections of data clusterings. Data clustering is an important technique for extracting knowledge from complex datasets. It is widely used by Australian science, government and industry, in areas such as genomics, proteomics, crime analysis, marketing and customer profiling. This project will develop new techniques that will allow users to explore and analyse collections of data clusterings. This will improve the current generation of clustering software and allow deeper investigation of challenging and complex data.
Read moreRead less
Linkage Infrastructure, Equipment And Facilities - Grant ID: LE0561231
Funder
Australian Research Council
Funding Amount
$671,715.00
Summary
MRI GRID Computing Facility: Design, Optimisation and Image Processing. The MRI Grid Computing Facility provides the IT infrastructure to achieve effective e-research in the area of magnetic resonance (MR) imaging, a field of neuroscience research that revolutionizes the way brain diseases are identified and treated. The facility consists of a dedicated high performance grid compute engine, distributed visualisation workstations, and distributed data warehouse facilities. Software tools acc ....MRI GRID Computing Facility: Design, Optimisation and Image Processing. The MRI Grid Computing Facility provides the IT infrastructure to achieve effective e-research in the area of magnetic resonance (MR) imaging, a field of neuroscience research that revolutionizes the way brain diseases are identified and treated. The facility consists of a dedicated high performance grid compute engine, distributed visualisation workstations, and distributed data warehouse facilities. Software tools accessible through the Internet will enable researchers to archive, retrieve and exchange data and software; access distributed MR image databases and the latest MR image analysis tools; schedule analysis tasks on the grid compute engine, the outcomes of which will be visualized by the visualization workstations.Read moreRead less
eResearch in the Neurosciences: Building collaborations in Asia. The proposed Australasian collaboration on eResearch in Neuroscience will promote and maintain the good health of Australians by 'improving critical mass through collaboration and information sharing' through increased access to advanced imaging technology in Korea and analysis techniques in Japan. The collaboration will also promote frontier technologies for building and transforming Australian industries by developing a creative ....eResearch in the Neurosciences: Building collaborations in Asia. The proposed Australasian collaboration on eResearch in Neuroscience will promote and maintain the good health of Australians by 'improving critical mass through collaboration and information sharing' through increased access to advanced imaging technology in Korea and analysis techniques in Japan. The collaboration will also promote frontier technologies for building and transforming Australian industries by developing a creative and innovative research environment and enhancing Australian scientists' participation in breakthrough science. Great national benefit can be derived from international research collaboration, due to the contribution frontier technology can make to science and health. Read moreRead less
Efficient Synchronisation of Large Repositories. Accuracy and maintenance of vast quantities of data are essential for any modern society. The economy, health institutes and industries, and our defence and legal systems rely on having data being distributed widely and securely, and on queries being answered accurately and quickly. Complete synchronisation of databases is often impossible due to the limitations of internet bandwidth. Better compression techniques have the potential to allow crit ....Efficient Synchronisation of Large Repositories. Accuracy and maintenance of vast quantities of data are essential for any modern society. The economy, health institutes and industries, and our defence and legal systems rely on having data being distributed widely and securely, and on queries being answered accurately and quickly. Complete synchronisation of databases is often impossible due to the limitations of internet bandwidth. Better compression techniques have the potential to allow critical data to be distributed much more efficiently; we anticipate in some applications that the size of a compressed file could be reduced tenfold or more compared to previous best methods, leading to dramatic savings.Read moreRead less
A new erasure resilient technique for encoding internet packets. Efficient internet communication tolerates losing some packets sent across the web by sending a bit more information than is required. Any holes in the transmission can be repaired using the redundant data. We propose a new transmission protocol that is much simpler to encode and repairs broken messages faster. This new approach, based on sending data plus summed versions of itself, has generic applicability across all packet switc ....A new erasure resilient technique for encoding internet packets. Efficient internet communication tolerates losing some packets sent across the web by sending a bit more information than is required. Any holes in the transmission can be repaired using the redundant data. We propose a new transmission protocol that is much simpler to encode and repairs broken messages faster. This new approach, based on sending data plus summed versions of itself, has generic applicability across all packet switched information networks.Read moreRead less
SeqSeeker: a search engine for large numbers of very long sequences. Large sets of very long sequences arise in many important domains. Well known examples are time series sequences in financial markets and meteorology and DNA and protein sequences in biology. This project will develop a search system, SeqSeeker, that can perform search on massive databases of such sequences. This will allow experts from many domains to get more value from their data and to investigate datasets which are cu ....SeqSeeker: a search engine for large numbers of very long sequences. Large sets of very long sequences arise in many important domains. Well known examples are time series sequences in financial markets and meteorology and DNA and protein sequences in biology. This project will develop a search system, SeqSeeker, that can perform search on massive databases of such sequences. This will allow experts from many domains to get more value from their data and to investigate datasets which are currently beyond the reach of today's technology.Read moreRead less
Indexes Allowing Fast and Efficient Text Search. Since the arrival of search engines such as Google, it has become an expectation that we can find a few words in a very large amount of text very quickly. This is also true of biologists, who expect to be able to submit protein sequences for matching against a massive database and retrieve and answer in seconds. If successful, this project will invent fundamental software that will allow the discovery of words in text much more quickly, and using ....Indexes Allowing Fast and Efficient Text Search. Since the arrival of search engines such as Google, it has become an expectation that we can find a few words in a very large amount of text very quickly. This is also true of biologists, who expect to be able to submit protein sequences for matching against a massive database and retrieve and answer in seconds. If successful, this project will invent fundamental software that will allow the discovery of words in text much more quickly, and using less computing resources than current methods. The benefits of this technology to the searching public, scientists, and industry will be immediate as productivity will be improved and costs reduced.Read moreRead less