Robust speech recognition in realistic hostile environments. Australia leads the world in the adoption of speech recognition technology but sadly lags in the development of the fundamental advances in the area. This research will help propel Australia to the forefront of new innovations in speech recognition technology and contributions to fundamental science. Our project will provide an excellent training ground for graduate students and researchers, with the real possibility of significant com ....Robust speech recognition in realistic hostile environments. Australia leads the world in the adoption of speech recognition technology but sadly lags in the development of the fundamental advances in the area. This research will help propel Australia to the forefront of new innovations in speech recognition technology and contributions to fundamental science. Our project will provide an excellent training ground for graduate students and researchers, with the real possibility of significant commercial benefit to the nation. The deployment of our system in the community will greatly enhance the defence and police forces ability for surveillance and security, and will provide new assistive aids to improve the quality of life and safety for the elderly and disabled.Read moreRead less
Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods. The development of robust multilingual speaker recognition systems will benefit the community through the elimination of fraud incurred by financial institutions and customers by enabling several person authentication applications such as: voice based signatures and document issuance; credit card verification by voice and secure over-the-phone financi ....Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods. The development of robust multilingual speaker recognition systems will benefit the community through the elimination of fraud incurred by financial institutions and customers by enabling several person authentication applications such as: voice based signatures and document issuance; credit card verification by voice and secure over-the-phone financial transactions. The technology will also assist in the protection of the community and safeguard Australia by enabling the implementation of the following: suspect identification using voice print; national security measures for combating terrorism by using voice to locate and track terrorists; preemptive criminal activity counter-measures; surveillance and secure building access by voice.Read moreRead less
Robust speaker recognition with reduced utterance duration and intersession variability. The development of robust and accurate speaker recognition systems will enable secure person authentication in over-the-phone financial transactions and benefit the community through the elimination of identity fraud incurred by customers and financial institutions. The technology will also assist in safeguarding Australia by enabling the implementation of suspect identification using voice and security meas ....Robust speaker recognition with reduced utterance duration and intersession variability. The development of robust and accurate speaker recognition systems will enable secure person authentication in over-the-phone financial transactions and benefit the community through the elimination of identity fraud incurred by customers and financial institutions. The technology will also assist in safeguarding Australia by enabling the implementation of suspect identification using voice and security measures for combating terrorism by using voice to locate and track terrorists. Our research at QUT Speech Research Lab is at the forefront of development in this field and will provide Australia with a technological advantage in the rapidly evolving global market for speaker recognition technology for person authentication applications.Read moreRead less
Robust Automatic Speaker Diarisation of Audio Documents by Exploiting Prior Sources of Information. Speaker Diarisation, the task of determining who spoke when, is a technology fundamental in deriving intelligent information from audio and multimedia resources. The requirement for efficient and accurate Speaker Diarisation systems, portable across different domains is heightened by the explosive growth of audio and multimedia archives online and throughout the world. This research will provide t ....Robust Automatic Speaker Diarisation of Audio Documents by Exploiting Prior Sources of Information. Speaker Diarisation, the task of determining who spoke when, is a technology fundamental in deriving intelligent information from audio and multimedia resources. The requirement for efficient and accurate Speaker Diarisation systems, portable across different domains is heightened by the explosive growth of audio and multimedia archives online and throughout the world. This research will provide the foundation for a commercial service of automatic Speaker Diarisation to be developed, growing Australia's impact on the information and communications technology (ICT) sector. The outcome of this research will also assist in the tracking of terrorist and unlawful activity by enabling effective intelligence gathering from different audio sources.Read moreRead less
Visual Solutions for Automated Translation Between Spoken and Signed Languages. We propose to build a robust visual speech recognition system that analyzes images of spoken language and achieves a recognition of the utterances with at least human expert recognition rates. This visual speech recognition system will then be integrated with our existing gesture recognition system to improve performance, just as humans combine visual and audio data for language understanding. The result will be a sy ....Visual Solutions for Automated Translation Between Spoken and Signed Languages. We propose to build a robust visual speech recognition system that analyzes images of spoken language and achieves a recognition of the utterances with at least human expert recognition rates. This visual speech recognition system will then be integrated with our existing gesture recognition system to improve performance, just as humans combine visual and audio data for language understanding. The result will be a system providing translation between English and the Australian sign language Auslan in a practical application domain. Significantly, our work will provide insights into the cognitive models of neural activity linking language and gesture.Read moreRead less
In the normal process of hearing, the brain actively selects sounds of interest from competing background sounds. This normal auditory function is indispensible for children and adults to cope in non-optimal listening environments, however the mechanisms by which such performance is achieved are poorly understood. This project will investigate the nerve circuits that enable this to occur and will also investigate how these circuits malfunction in various types of partial deafness. The results wi ....In the normal process of hearing, the brain actively selects sounds of interest from competing background sounds. This normal auditory function is indispensible for children and adults to cope in non-optimal listening environments, however the mechanisms by which such performance is achieved are poorly understood. This project will investigate the nerve circuits that enable this to occur and will also investigate how these circuits malfunction in various types of partial deafness. The results will improve our understanding of how we detect sounds and the impact of hearing pathologies on this process.Read moreRead less
Development Of The Listening In Spatialized Noise - Tonal Test (or LiSN-T)
Funder
National Health and Medical Research Council
Funding Amount
$227,136.00
Summary
In this project a novel listening test software will be developed for diagnosing spatial processing disorder in children. These children often have difficulties in understanding teachers in classrooms, which can significantly impact their ability to learn. The developed software will be specifically designed for diagnosing 5-year old children, before they enter primary school, and in contrast to existing tests will be independent of their language background.
Progressive Transmission of Street Directory Assistance and Business Pages over 3G and 4G mobile networks. Multimedia on-demand and live services over 3G and 4G mobiles will be enhanced. New methods for low volume, high information transfer multimedia transactions will be developed. This will create new jobs in the Information and Communication Technologies (ICT) sector. Progressive transmission of street directory assistance and business pages information to mobile handsets will enable citize ....Progressive Transmission of Street Directory Assistance and Business Pages over 3G and 4G mobile networks. Multimedia on-demand and live services over 3G and 4G mobiles will be enhanced. New methods for low volume, high information transfer multimedia transactions will be developed. This will create new jobs in the Information and Communication Technologies (ICT) sector. Progressive transmission of street directory assistance and business pages information to mobile handsets will enable citizens to make efficient use of their time and improve productivity. The 3G and 4G cellular telephone network, extended with 'mobile' base stations and satellite links, are especially attractive to a large country like Australia. Interactive information retrieval will become more universal and not limited through wired Internet connections.
Read moreRead less