Efficient data manipulation in document classification. Document Classification has an enormous relevance in an era where large amounts of textual information is available. Document Classification is based on statistical and machine learning techniques that model documents represented as points in a multidimensional space. The Computer Engineering Laboratory (CEL) has ongoing projects using neural networks and other techniques for document classification. We are developing a development environm ....Efficient data manipulation in document classification. Document Classification has an enormous relevance in an era where large amounts of textual information is available. Document Classification is based on statistical and machine learning techniques that model documents represented as points in a multidimensional space. The Computer Engineering Laboratory (CEL) has ongoing projects using neural networks and other techniques for document classification. We are developing a development environment for large classification tasks, and Prof. Lee¡¯s work will focus in managing large amounts of data for them. Using his experience in data compression, databases and web applications, he will produce a set of tools for handling Gigabytes of textual data in our classification environment.Read moreRead less
Special Research Initiatives - Grant ID: SR0564829
Funder
Australian Research Council
Funding Amount
$80,000.00
Summary
Development of e-Research Tools for an MRI Grid Computing Facility. The proposed middleware tools will provide a resource management system for the national MRI grid computing facility. The main functions of the middleware tools are resource discovery and allocation, job scheduling and monitoring, workflow management and data management. The middleware tools will allow system developers and maintainers to simplify and optimize the development and deployment of MRI grid applications. The complete ....Development of e-Research Tools for an MRI Grid Computing Facility. The proposed middleware tools will provide a resource management system for the national MRI grid computing facility. The main functions of the middleware tools are resource discovery and allocation, job scheduling and monitoring, workflow management and data management. The middleware tools will allow system developers and maintainers to simplify and optimize the development and deployment of MRI grid applications. The complete system incorporating the middleware tools will provide a set of web and application-based user interfaces that allow secure, seamless and uniform access to resources in a heterogeneous grid environment.Read moreRead less
Data Management Technologies for the Magnetic Resonance Imaging e-Research Grid. Howard Florey Institute researchers will collaborate with SGI's file-systems engineering team. Substantial benefits are expected from the development of techniques to support centralized and distributed processing medical image datasets. Issues requiring research include file space allocation algorithms and caching strategies. The proposed rapid database access technologies aim at solving these problems in the medic ....Data Management Technologies for the Magnetic Resonance Imaging e-Research Grid. Howard Florey Institute researchers will collaborate with SGI's file-systems engineering team. Substantial benefits are expected from the development of techniques to support centralized and distributed processing medical image datasets. Issues requiring research include file space allocation algorithms and caching strategies. The proposed rapid database access technologies aim at solving these problems in the medical imaging research context. The project attempts to 'improve data management for existing and new business applications'. This enhanced sharing of information will improve critical mass therefore fostering national and international collaboration. Read moreRead less
Exposing the anonymous attacker: detecting identity crimes using real-time entity resolution on large dynamic databases. Given the increasingly large costs of identity crimes in Australia, developing improved electronic identity verification techniques is highly significant in reducing losses from such crimes, making the Australian economy more competitive, and increasing consumer confidence in Australian financial institutions. Veda Advantage is widely used for identity verification by Australi ....Exposing the anonymous attacker: detecting identity crimes using real-time entity resolution on large dynamic databases. Given the increasingly large costs of identity crimes in Australia, developing improved electronic identity verification techniques is highly significant in reducing losses from such crimes, making the Australian economy more competitive, and increasing consumer confidence in Australian financial institutions. Veda Advantage is widely used for identity verification by Australian financial service providers, so the benefits of the techniques developed in this project will automatically flow through to the Australian community. These techniques will be sufficiently generic to be of use for real-time identity verification in a broad range of applications, including e-Government portals, electronic banking, online stores, or national security systems.Read moreRead less
Extensions to the page scoring algorithm in internet search engine studies. This project proposes to study two extensions to the Page rank equation which is one of the theoretical underpinning of Google's web page scoring engine. In particular, we wish to explore ways to combine page connectivity and page characteristics in the scoring of web pages. This will be the first time a rational way is proposed for combining these two factors. The expected outcome will be a deeper understanding on how t ....Extensions to the page scoring algorithm in internet search engine studies. This project proposes to study two extensions to the Page rank equation which is one of the theoretical underpinning of Google's web page scoring engine. In particular, we wish to explore ways to combine page connectivity and page characteristics in the scoring of web pages. This will be the first time a rational way is proposed for combining these two factors. The expected outcome will be a deeper understanding on how these two factors affect the scores of a web page in a search engine, and hence how they affect the visibility of the page in response to a query.
Read moreRead less
Data structures which change with time, a machine learning approach. Visibility of web pages, based on page importance, on the Internet controls their accessibility by users which is critical for e-Commerce applications. The page importance depends on its contents and its link structure to other web pages, both of which can be time varying. This project proposes a novel model in which time varying aspects of the changes to contents and their link structures are captured, thus allowing us a bette ....Data structures which change with time, a machine learning approach. Visibility of web pages, based on page importance, on the Internet controls their accessibility by users which is critical for e-Commerce applications. The page importance depends on its contents and its link structure to other web pages, both of which can be time varying. This project proposes a novel model in which time varying aspects of the changes to contents and their link structures are captured, thus allowing us a better understanding of how these influence the page importance over time. It will also allow us insight on how to improve the visibility of web pages.Read moreRead less
Investigations in Learning Algorithms for Web Page Scoring Systems. Modification of web page scores to satisfy requirements, e.g., one page should have a higher page score than another, a home page should have higher score than any other pages in the same site, using modifications of the forcing function, and the link connectivity matrix respectively of the PageRank equation will be studied. By clustering web pages either by ranks or by scores will help overcome issues of scale and complexity wh ....Investigations in Learning Algorithms for Web Page Scoring Systems. Modification of web page scores to satisfy requirements, e.g., one page should have a higher page score than another, a home page should have higher score than any other pages in the same site, using modifications of the forcing function, and the link connectivity matrix respectively of the PageRank equation will be studied. By clustering web pages either by ranks or by scores will help overcome issues of scale and complexity which are required for the live world wide web. Outcomes will provide a rational basis together with practical methods for modifying web page scores by a web site administrator.Read moreRead less
Linkage Infrastructure, Equipment And Facilities - Grant ID: LE0561231
Funder
Australian Research Council
Funding Amount
$671,715.00
Summary
MRI GRID Computing Facility: Design, Optimisation and Image Processing. The MRI Grid Computing Facility provides the IT infrastructure to achieve effective e-research in the area of magnetic resonance (MR) imaging, a field of neuroscience research that revolutionizes the way brain diseases are identified and treated. The facility consists of a dedicated high performance grid compute engine, distributed visualisation workstations, and distributed data warehouse facilities. Software tools acc ....MRI GRID Computing Facility: Design, Optimisation and Image Processing. The MRI Grid Computing Facility provides the IT infrastructure to achieve effective e-research in the area of magnetic resonance (MR) imaging, a field of neuroscience research that revolutionizes the way brain diseases are identified and treated. The facility consists of a dedicated high performance grid compute engine, distributed visualisation workstations, and distributed data warehouse facilities. Software tools accessible through the Internet will enable researchers to archive, retrieve and exchange data and software; access distributed MR image databases and the latest MR image analysis tools; schedule analysis tasks on the grid compute engine, the outcomes of which will be visualized by the visualization workstations.Read moreRead less
eResearch in the Neurosciences: Building collaborations in Asia. The proposed Australasian collaboration on eResearch in Neuroscience will promote and maintain the good health of Australians by 'improving critical mass through collaboration and information sharing' through increased access to advanced imaging technology in Korea and analysis techniques in Japan. The collaboration will also promote frontier technologies for building and transforming Australian industries by developing a creative ....eResearch in the Neurosciences: Building collaborations in Asia. The proposed Australasian collaboration on eResearch in Neuroscience will promote and maintain the good health of Australians by 'improving critical mass through collaboration and information sharing' through increased access to advanced imaging technology in Korea and analysis techniques in Japan. The collaboration will also promote frontier technologies for building and transforming Australian industries by developing a creative and innovative research environment and enhancing Australian scientists' participation in breakthrough science. Great national benefit can be derived from international research collaboration, due to the contribution frontier technology can make to science and health. Read moreRead less
Mining Distributed, High-Speed, Time-Variant Data Streams. With the high-speed and large volume of data generation, the data mining research community is facing an unprecedented challenge to provide instant data mining outcomes for prompt usage. Getting access to derived information from multiple, dynamically changing data is vital for many business, science and security services. Extended networks of sensors and other devices assist many environments with data collection that should be correlat ....Mining Distributed, High-Speed, Time-Variant Data Streams. With the high-speed and large volume of data generation, the data mining research community is facing an unprecedented challenge to provide instant data mining outcomes for prompt usage. Getting access to derived information from multiple, dynamically changing data is vital for many business, science and security services. Extended networks of sensors and other devices assist many environments with data collection that should be correlated and processed towards discovery of dependencies, regularities and patterns. Data mining tools, especially of this new generation, are capable of dealing with data streams, and they offer great benefits for users from many industry sectors; defence, health management, security, commerce and science.Read moreRead less