Data Mining by Clustering in Very Large Relational Databases. Many commercial and governmental entities possess very large relational data that cannot be feasibly analyzed by today's computers, e.g., gene expression data, product usage databases and telecommunication call records. The clustering tools developed in this project will have a significant benefit on many business processes that involve clustering this type of data, such as fraud detection and market segmentation.
Handling unreliable, uncertain and inadequate data for Intelligence led Investigation. Intelligence led investigation has been successful recently in drug and people smuggling, preparation or instigation of acts of terrorism, and can benefit profoundly from the techniques we will develop, in the timely management and inference from many sources and kinds of uncertain information. This work will assist in making Australia a safer and more secure country.
E.g., Australian Bureau of Statistics ....Handling unreliable, uncertain and inadequate data for Intelligence led Investigation. Intelligence led investigation has been successful recently in drug and people smuggling, preparation or instigation of acts of terrorism, and can benefit profoundly from the techniques we will develop, in the timely management and inference from many sources and kinds of uncertain information. This work will assist in making Australia a safer and more secure country.
E.g., Australian Bureau of Statistics figures show that for 2004, investigations of some 35% of murders, 63% of kidnappings, and 80% of robberies are incomplete at 30 days. Terrorism investigations are harder in that usually there is no initial crime trigger for an investigation. Any assistance our tools can provide in will be of significant benefit to Australia.Read moreRead less
Extensions to the page scoring algorithm in internet search engine studies. This project proposes to study two extensions to the Page rank equation which is one of the theoretical underpinning of Google's web page scoring engine. In particular, we wish to explore ways to combine page connectivity and page characteristics in the scoring of web pages. This will be the first time a rational way is proposed for combining these two factors. The expected outcome will be a deeper understanding on how t ....Extensions to the page scoring algorithm in internet search engine studies. This project proposes to study two extensions to the Page rank equation which is one of the theoretical underpinning of Google's web page scoring engine. In particular, we wish to explore ways to combine page connectivity and page characteristics in the scoring of web pages. This will be the first time a rational way is proposed for combining these two factors. The expected outcome will be a deeper understanding on how these two factors affect the scores of a web page in a search engine, and hence how they affect the visibility of the page in response to a query.
Read moreRead less
Data structures which change with time, a machine learning approach. Visibility of web pages, based on page importance, on the Internet controls their accessibility by users which is critical for e-Commerce applications. The page importance depends on its contents and its link structure to other web pages, both of which can be time varying. This project proposes a novel model in which time varying aspects of the changes to contents and their link structures are captured, thus allowing us a bette ....Data structures which change with time, a machine learning approach. Visibility of web pages, based on page importance, on the Internet controls their accessibility by users which is critical for e-Commerce applications. The page importance depends on its contents and its link structure to other web pages, both of which can be time varying. This project proposes a novel model in which time varying aspects of the changes to contents and their link structures are captured, thus allowing us a better understanding of how these influence the page importance over time. It will also allow us insight on how to improve the visibility of web pages.Read moreRead less
Investigations in Learning Algorithms for Web Page Scoring Systems. Modification of web page scores to satisfy requirements, e.g., one page should have a higher page score than another, a home page should have higher score than any other pages in the same site, using modifications of the forcing function, and the link connectivity matrix respectively of the PageRank equation will be studied. By clustering web pages either by ranks or by scores will help overcome issues of scale and complexity wh ....Investigations in Learning Algorithms for Web Page Scoring Systems. Modification of web page scores to satisfy requirements, e.g., one page should have a higher page score than another, a home page should have higher score than any other pages in the same site, using modifications of the forcing function, and the link connectivity matrix respectively of the PageRank equation will be studied. By clustering web pages either by ranks or by scores will help overcome issues of scale and complexity which are required for the live world wide web. Outcomes will provide a rational basis together with practical methods for modifying web page scores by a web site administrator.Read moreRead less
Uncertain Information Processing for Situation Awareness and Dynamic Decision-Making in Emergency Management. The Australian national counter-terrorism committee indicates that Australia should have a strong intelligence-led prevention and preparedness to support Australia on risk management, emergency services and maintaining capabilities to manage various types of terrorist attacks. The proposed situation awareness support technique can be used to develop situation analysis software systems or ....Uncertain Information Processing for Situation Awareness and Dynamic Decision-Making in Emergency Management. The Australian national counter-terrorism committee indicates that Australia should have a strong intelligence-led prevention and preparedness to support Australia on risk management, emergency services and maintaining capabilities to manage various types of terrorist attacks. The proposed situation awareness support technique can be used to develop situation analysis software systems or directly support Australia government agencies and industries to correctly assess a situation, increase awareness for crisis problems, and therefore improve emergency management and decision-making effectiveness, in particular, for avoiding disaster problems in the first place and preparing plans for those that undoubtedly will occur. Read moreRead less
Efficient data manipulation in document classification. Document Classification has an enormous relevance in an era where large amounts of textual information is available. Document Classification is based on statistical and machine learning techniques that model documents represented as points in a multidimensional space. The Computer Engineering Laboratory (CEL) has ongoing projects using neural networks and other techniques for document classification. We are developing a development environm ....Efficient data manipulation in document classification. Document Classification has an enormous relevance in an era where large amounts of textual information is available. Document Classification is based on statistical and machine learning techniques that model documents represented as points in a multidimensional space. The Computer Engineering Laboratory (CEL) has ongoing projects using neural networks and other techniques for document classification. We are developing a development environment for large classification tasks, and Prof. Lee¡¯s work will focus in managing large amounts of data for them. Using his experience in data compression, databases and web applications, he will produce a set of tools for handling Gigabytes of textual data in our classification environment.Read moreRead less
Concept-Based Multilingual Web Content Mining. Towards smart use of Web information, this pioneer project will develop an innovative concept-based approach for discovering global knowledge embedded within multilingual Web documents. Departing from the traditional bilingual term-to-term machine translation techniques, the approach overcomes the notorious vocabulary mismatch problem by enabling synchronised lexical mapping of multiple languages. A series of intelligent concept-based techniques usi ....Concept-Based Multilingual Web Content Mining. Towards smart use of Web information, this pioneer project will develop an innovative concept-based approach for discovering global knowledge embedded within multilingual Web documents. Departing from the traditional bilingual term-to-term machine translation techniques, the approach overcomes the notorious vocabulary mismatch problem by enabling synchronised lexical mapping of multiple languages. A series of intelligent concept-based techniques using fuzzy logic and neural networks will be investigated to support smart Web information browsing and exploration. This project will provide valuable new insights into developing state-of-the-art multilingual Web mining applications for enhancing business intelligence in Australia's knowledge driven industries.Read moreRead less
Investigations into Distributed Information Processing of the World Wide Web: Addressing Major Bottlenecks in Search Engine Design. The Internet is a global medium used increasingly for commercial purposes. Nationally provided commercial services and products, as well as general types of information are made available globally via the Internet. Web search engines are the only method by which a common user can find a relevant service or information on the Internet. The sheer size and the dynamics ....Investigations into Distributed Information Processing of the World Wide Web: Addressing Major Bottlenecks in Search Engine Design. The Internet is a global medium used increasingly for commercial purposes. Nationally provided commercial services and products, as well as general types of information are made available globally via the Internet. Web search engines are the only method by which a common user can find a relevant service or information on the Internet. The sheer size and the dynamics of the Internet pose a significant challenge to search engines. This project proposes to address some major bottlenecks in search engine design (viz. the page rank computation). This may help future search engines to maintain a good level of Web penetration and, consequently will help to ensure a suitable coverage of nationally available services and information to the world.
Read moreRead less
A Comprehensive Platform for Dynamic Decision Support in Warning Systems through Better Management of Uncertain Information. Public and individual warning systems are installed widely in Australia for emergency situations such as fire, terrorist attack, tsunami, and financial risk. The developed platform with its uncertain information management and dynamic decision support software will directly assist Australian government agencies, industries, and professional officers responsible for public ....A Comprehensive Platform for Dynamic Decision Support in Warning Systems through Better Management of Uncertain Information. Public and individual warning systems are installed widely in Australia for emergency situations such as fire, terrorist attack, tsunami, and financial risk. The developed platform with its uncertain information management and dynamic decision support software will directly assist Australian government agencies, industries, and professional officers responsible for public warning systems by improving the reliability of generated warnings, effective design of any new warning system, and accurate decision making in responding threats. It will also greatly contribute to the processing of uncertain information in other organizational information systems, and enhance training exercise facilities in complex emergency environments.Read moreRead less