Development of a three dimensional audio-visual next generation speech recognition system. To overcome the disadvantages of current Audio-Visual Speech Recognition Systems, we propose a set of robust algorithms in three dimensional computer vision and speech processing. The proposed system will have far-reaching implications in various areas, for example, human-machine interaction for speech recognition in automated dialog systems and voice-to-text conversions.
Linkage Infrastructure, Equipment And Facilities - Grant ID: LE100100235
Funder
Australian Research Council
Funding Amount
$280,000.00
Summary
Accelerating Australia's large scale video surveillance research programmes. The research to be conducted using this infrastructure will bring immense benefits to Australia in terms of increased levels of public safety and in the protection of critical facilities from terrorism and other crimes, by developing better surveillance systems. This will provide both increases in measurable research outputs and opportunities for Australian business to commercialise these systems. The infrastructure wil ....Accelerating Australia's large scale video surveillance research programmes. The research to be conducted using this infrastructure will bring immense benefits to Australia in terms of increased levels of public safety and in the protection of critical facilities from terrorism and other crimes, by developing better surveillance systems. This will provide both increases in measurable research outputs and opportunities for Australian business to commercialise these systems. The infrastructure will accelerate the pace of surveillance research and development in Australia, enhancing the competitiveness of both Australia's researchers and the businesses that will commercialise these researchers' discoveries.Read moreRead less
Detecting, Locating and Tracking Human Faces using Skin Colour. With growing concerns for national security and public safety, government agencies in Australia and around the world are taking strong measures to introduce biometric-enhanced official identification documents such as passports, visas, and ID cards. The proposed face detection and tracking system will play a key role in personal identification and human activity monitoring. The developed system will have a huge potential in surveill ....Detecting, Locating and Tracking Human Faces using Skin Colour. With growing concerns for national security and public safety, government agencies in Australia and around the world are taking strong measures to introduce biometric-enhanced official identification documents such as passports, visas, and ID cards. The proposed face detection and tracking system will play a key role in personal identification and human activity monitoring. The developed system will have a huge potential in surveillance, security, law enforcement, and ICT. This project will contribute to building a knowledge economy in Australia and help safeguard and protect Australia from terrorism and crime. Furthermore, its outcomes will enhance the reputation of Australia as a leader in frontier technologies and smart information use.Read moreRead less
Visual Solutions for Automated Translation Between Spoken and Signed Languages. We propose to build a robust visual speech recognition system that analyzes images of spoken language and achieves a recognition of the utterances with at least human expert recognition rates. This visual speech recognition system will then be integrated with our existing gesture recognition system to improve performance, just as humans combine visual and audio data for language understanding. The result will be a sy ....Visual Solutions for Automated Translation Between Spoken and Signed Languages. We propose to build a robust visual speech recognition system that analyzes images of spoken language and achieves a recognition of the utterances with at least human expert recognition rates. This visual speech recognition system will then be integrated with our existing gesture recognition system to improve performance, just as humans combine visual and audio data for language understanding. The result will be a system providing translation between English and the Australian sign language Auslan in a practical application domain. Significantly, our work will provide insights into the cognitive models of neural activity linking language and gesture.Read moreRead less
Taming media for the masses: Computational frameworks for intelligent digital media capture, management, and sharing. The core issues tackled in this project are learning, recognition and application of semantics in multimedia data and the context of its creation and use - a foundational issue in pattern recognition with many applications. The project is part of the Institute for Multi-sensor Processing and Content Analysis whose aim is to tackle technical issues in large scale pattern recogniti ....Taming media for the masses: Computational frameworks for intelligent digital media capture, management, and sharing. The core issues tackled in this project are learning, recognition and application of semantics in multimedia data and the context of its creation and use - a foundational issue in pattern recognition with many applications. The project is part of the Institute for Multi-sensor Processing and Content Analysis whose aim is to tackle technical issues in large scale pattern recognition. By developing scalable and robust techniques to extract information from large scale multi-modal data, the applications include large scale surveillance systems from multi-modal data (e.g. airport security, smart homes for the aged), context-aware devices, and the next generation of media creation and repurposing tools - a fast-growing sector of the economy.Read moreRead less
Integration of Spatiotemporal Video Data for Realtime Smart Proactive Surveillance. This project will have a great impact on the national security by helping the law enforcement agencies to stop crime before it happens. It will automatically detect and tag criminal activities in surveillance videos. It will detect, authenticate, track and profile individuals in sensitive installations. At airports, it will match faces to electronic images embedded in passports. The system will use existing surve ....Integration of Spatiotemporal Video Data for Realtime Smart Proactive Surveillance. This project will have a great impact on the national security by helping the law enforcement agencies to stop crime before it happens. It will automatically detect and tag criminal activities in surveillance videos. It will detect, authenticate, track and profile individuals in sensitive installations. At airports, it will match faces to electronic images embedded in passports. The system will use existing surveillance infrastructures for locating lost people and will also ensure privacy protection of public. On the commercial side, this project can recognize old customers for better and customized services. It can count the number of people present in each floor of a building for rescue operations and for designing future buildings.Read moreRead less
Bridging the semantic gap for building effective content management systems: Computational media aesthetics. This project focuses on video abstraction and aims to bridge the semantic gap between the simplicity of available visual features and the richness of user descriptions. We examine how visual and aural techniques are brought together to influence the engagement of audience in a story portrayal. The major outcome will be a computational framework for extracting the semantics associated wi ....Bridging the semantic gap for building effective content management systems: Computational media aesthetics. This project focuses on video abstraction and aims to bridge the semantic gap between the simplicity of available visual features and the richness of user descriptions. We examine how visual and aural techniques are brought together to influence the engagement of audience in a story portrayal. The major outcome will be a computational framework for extracting the semantics associated with audiovisual elements in television/film, and scalable software tools that can rapidly and consistently analyse media along various aesthetic dimensions. It will allow for high-level annotation of media and the building of more effective content management systems with enhanced user querying capabilities.Read moreRead less
Automated Determination of the Pose of a Human from Visual Information - Markerless 3D Pose Recovery of Humans from Videos. The development of 3D human pose recovery has been sought by computer vision researchers for many years. Our results will, firstly, have benefit for Australia's standing in the international computer vision community. Over time, the research outcomes will be developed into a software product for rehabilitation analysis by recognizing discrepancies between the walking pat ....Automated Determination of the Pose of a Human from Visual Information - Markerless 3D Pose Recovery of Humans from Videos. The development of 3D human pose recovery has been sought by computer vision researchers for many years. Our results will, firstly, have benefit for Australia's standing in the international computer vision community. Over time, the research outcomes will be developed into a software product for rehabilitation analysis by recognizing discrepancies between the walking patterns of healthy individuals and those with abnormalities as a result of accidents or diseases. The Australian economy will benefit by the reduction in the lifetime cost of injuries. This software will also provide benefits to the movie animation, computer games industry, and the training of athletes.Read moreRead less
Space-based space surveillance with robust computer vision algorithms. Space-based space surveillance with robust computer vision algorithms. This project aims to develop computer vision algorithms to detect man-made objects in space. These algorithms function on nanosatellite platforms, enabling space-based space surveillance. This technology is expected to provide always-on monitoring of the Earth's orbit to enhance existing defence infrastructure and protect vital space assets, including comm ....Space-based space surveillance with robust computer vision algorithms. Space-based space surveillance with robust computer vision algorithms. This project aims to develop computer vision algorithms to detect man-made objects in space. These algorithms function on nanosatellite platforms, enabling space-based space surveillance. This technology is expected to provide always-on monitoring of the Earth's orbit to enhance existing defence infrastructure and protect vital space assets, including communications and navigational satellites, in Earth’s orbit from collisions and covert sabotage. Increased space use by government and civilian agencies opens up opportunities for the space industry. This project is expected to develop Australia’s space surveillance capabilities, protect space assets and capture a growing market.Read moreRead less
An automated 3D model-based object recognition system. A novel, practical 3D vision system is proposed as a platform for fundamental applied research in 3D data acquisition, object modelling and object recognition. The significance of the vision system lies in the advancement of knowledge in three key areas of computer vision, registration, recognition and error propagation. The result is a system capable of sensing, modelling and identifying arbitrarily shaped free-form objects in a scene, an a ....An automated 3D model-based object recognition system. A novel, practical 3D vision system is proposed as a platform for fundamental applied research in 3D data acquisition, object modelling and object recognition. The significance of the vision system lies in the advancement of knowledge in three key areas of computer vision, registration, recognition and error propagation. The result is a system capable of sensing, modelling and identifying arbitrarily shaped free-form objects in a scene, an attribute lacking in current systems. Such a system can provide substantial economic benefits to industrial procedures such as grasp planning and quality control.Read moreRead less