Robust speech recognition in realistic hostile environments. Australia leads the world in the adoption of speech recognition technology but sadly lags in the development of the fundamental advances in the area. This research will help propel Australia to the forefront of new innovations in speech recognition technology and contributions to fundamental science. Our project will provide an excellent training ground for graduate students and researchers, with the real possibility of significant com ....Robust speech recognition in realistic hostile environments. Australia leads the world in the adoption of speech recognition technology but sadly lags in the development of the fundamental advances in the area. This research will help propel Australia to the forefront of new innovations in speech recognition technology and contributions to fundamental science. Our project will provide an excellent training ground for graduate students and researchers, with the real possibility of significant commercial benefit to the nation. The deployment of our system in the community will greatly enhance the defence and police forces ability for surveillance and security, and will provide new assistive aids to improve the quality of life and safety for the elderly and disabled.Read moreRead less
Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods. The development of robust multilingual speaker recognition systems will benefit the community through the elimination of fraud incurred by financial institutions and customers by enabling several person authentication applications such as: voice based signatures and document issuance; credit card verification by voice and secure over-the-phone financi ....Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods. The development of robust multilingual speaker recognition systems will benefit the community through the elimination of fraud incurred by financial institutions and customers by enabling several person authentication applications such as: voice based signatures and document issuance; credit card verification by voice and secure over-the-phone financial transactions. The technology will also assist in the protection of the community and safeguard Australia by enabling the implementation of the following: suspect identification using voice print; national security measures for combating terrorism by using voice to locate and track terrorists; preemptive criminal activity counter-measures; surveillance and secure building access by voice.Read moreRead less
Robust speaker recognition with reduced utterance duration and intersession variability. The development of robust and accurate speaker recognition systems will enable secure person authentication in over-the-phone financial transactions and benefit the community through the elimination of identity fraud incurred by customers and financial institutions. The technology will also assist in safeguarding Australia by enabling the implementation of suspect identification using voice and security meas ....Robust speaker recognition with reduced utterance duration and intersession variability. The development of robust and accurate speaker recognition systems will enable secure person authentication in over-the-phone financial transactions and benefit the community through the elimination of identity fraud incurred by customers and financial institutions. The technology will also assist in safeguarding Australia by enabling the implementation of suspect identification using voice and security measures for combating terrorism by using voice to locate and track terrorists. Our research at QUT Speech Research Lab is at the forefront of development in this field and will provide Australia with a technological advantage in the rapidly evolving global market for speaker recognition technology for person authentication applications.Read moreRead less
Speech recognition adaptation for low resource populations. Automatic speech recognition is an essential attribute of mobile devices and consumer electronics. Unfortunately, as these systems are trained with adult speech, they perform poorly when used by children and people with speaking difficulties. The lack of available training speech from these groups makes developing models for them difficult. We will investigate efficient model adaptation methods that use minimal training data to adapt ex ....Speech recognition adaptation for low resource populations. Automatic speech recognition is an essential attribute of mobile devices and consumer electronics. Unfortunately, as these systems are trained with adult speech, they perform poorly when used by children and people with speaking difficulties. The lack of available training speech from these groups makes developing models for them difficult. We will investigate efficient model adaptation methods that use minimal training data to adapt existing adult speech recognition models for use with children and people with speaking difficulties. The intended outcomes will improve access to automatic speech recognition systems for Australians whose communication with speech-controlled environmental and educational devices is currently restricted.Read moreRead less
Neural Activity Shaping for Retinal and Cochlear Implants. This project aims to develop methods to control and optimise the spatial patterns of neural activity evoked by neural prostheses in order to improve the resolution of neuroprostheses. A major problem for neural prostheses is that the electrical current used to stimulate neurons causes a diffuse spread of activity in the neural tissue, which limits the resolution of the device. For patients this translates into limitations in sound qualit ....Neural Activity Shaping for Retinal and Cochlear Implants. This project aims to develop methods to control and optimise the spatial patterns of neural activity evoked by neural prostheses in order to improve the resolution of neuroprostheses. A major problem for neural prostheses is that the electrical current used to stimulate neurons causes a diffuse spread of activity in the neural tissue, which limits the resolution of the device. For patients this translates into limitations in sound quality, in the case of cochlea implants, or visual acuity, for retinal implants. The outcome of the project will be algorithms that optimally choose the currents on each electrode so as to shape neural activity at the finer resolution of electrode spacing rather than the coarser resolution of current spread.Read moreRead less
A lossy compression paradigm for sensory neural coding. By applying new interdisciplinary theoretical results, this research aims to enhance our understanding of how the ear turns sounds into electrical signals in the presence of high levels of random noise. Socio-economic benefits to Australia include: (i) contributions to the knowledge base of theoretical neuroscience, and communications systems, enhancing Australia's reputation for cutting-edge research; (ii) strengthening of European interna ....A lossy compression paradigm for sensory neural coding. By applying new interdisciplinary theoretical results, this research aims to enhance our understanding of how the ear turns sounds into electrical signals in the presence of high levels of random noise. Socio-economic benefits to Australia include: (i) contributions to the knowledge base of theoretical neuroscience, and communications systems, enhancing Australia's reputation for cutting-edge research; (ii) strengthening of European international collaborations; (iii) outcomes that will ultimately impact on improved designs for bionic ears and future biomedical prosthetics; and (iv) commercialisation and technology transfer opportunities, via the transfer of results to wireless artificial sensor networks.Read moreRead less
Fixed and variable-length segment vocoders for very low bitrate speech coding. Reliable and secure voice communication is an important aspect of military and defence operations. In order to reduce the possibility of interception, low power transmitters are normally used for radio communications, where the bandwidth is often very low. Military voice communication, therefore, requires the coding of speech at very low bitrates. Our research proposal aims to develop speech coders that can operate ....Fixed and variable-length segment vocoders for very low bitrate speech coding. Reliable and secure voice communication is an important aspect of military and defence operations. In order to reduce the possibility of interception, low power transmitters are normally used for radio communications, where the bandwidth is often very low. Military voice communication, therefore, requires the coding of speech at very low bitrates. Our research proposal aims to develop speech coders that can operate at lower bitrates and reproduce speech of high quality and intelligibility. This is highly beneficial to the defence forces of Australia as it will permit the use of high-grade encryption technology to improve the security of transmission.Read moreRead less
Building a Talking Head via Dynamic & 3D-Static, and Age- & Ethnically-Varied Databases: Perceptibility and Acceptability. This project will provide cutting edge realistic, perceptible talking head animation. Based on rich 3D face motion and static face databases, it will allow the study of the facial structure of specific groups of people, and the creation of a lasting cultural heritage of faces. Information in these databases will be useful for research in high-quality 3D face reconstruction ....Building a Talking Head via Dynamic & 3D-Static, and Age- & Ethnically-Varied Databases: Perceptibility and Acceptability. This project will provide cutting edge realistic, perceptible talking head animation. Based on rich 3D face motion and static face databases, it will allow the study of the facial structure of specific groups of people, and the creation of a lasting cultural heritage of faces. Information in these databases will be useful for research in high-quality 3D face reconstruction, with applications as wide as multimodal Biometric Identification, finding lost children, and security systems. The novel methods in this project will also advance auditory-visual speech and emotion research with particular commercial applications in telecommunications, human-machine interfaces, foreign language teaching, humanoid development, animation, and film.Read moreRead less
Non-contact Instrumentation for the Home Monitoring of Upper Airway Obstructions in Sleep. Over 800,000 Australians suffer from obstructive sleep apnoea costing billions of dollars annually to the nation. Obstructive sleep apnoea patients use twice the health resources compared to a normal person, and 7 times more likely to cause traffic accidents. In NSW alone up to 43000 accidents per year are due to obstructive sleep apnoea. Obstructive sleep apnoea is treatable and thus consequences such as ....Non-contact Instrumentation for the Home Monitoring of Upper Airway Obstructions in Sleep. Over 800,000 Australians suffer from obstructive sleep apnoea costing billions of dollars annually to the nation. Obstructive sleep apnoea patients use twice the health resources compared to a normal person, and 7 times more likely to cause traffic accidents. In NSW alone up to 43000 accidents per year are due to obstructive sleep apnoea. Obstructive sleep apnoea is treatable and thus consequences such as stroke and heart attacks are preventable. At present over 90% patients remain undiagnosed. Current diagnosis is expensive and requires hospitalization; no acceptable mass screening device exists. This project proposes an enabling technology for the population screening of obstructive sleep apnoea based on analysing snoring sounds. Outcomes of the project have the potential to revolutionize the diagnosis of obstructive sleep apnoea.Read moreRead less
Computational neural modelling of bottom-up information and top-down attention in auditory perception. The aim of this project is to gain a better understanding of the ways in which our auditory cortex functions. This project will make a significant contribution to this important and fundamental aspect of brain science and brain-inspired computation. The outcome will be to build a computational model of the auditory cortex, through simulation of the detailed neuronal responses using spiking neur ....Computational neural modelling of bottom-up information and top-down attention in auditory perception. The aim of this project is to gain a better understanding of the ways in which our auditory cortex functions. This project will make a significant contribution to this important and fundamental aspect of brain science and brain-inspired computation. The outcome will be to build a computational model of the auditory cortex, through simulation of the detailed neuronal responses using spiking neurons. Applications will develop improved processing strategies for automatic speech recognition, hearing aids, bionic ears (cochlear implants), robotics and other machine processing systems.Read moreRead less