Robust speech recognition in realistic hostile environments. Australia leads the world in the adoption of speech recognition technology but sadly lags in the development of the fundamental advances in the area. This research will help propel Australia to the forefront of new innovations in speech recognition technology and contributions to fundamental science. Our project will provide an excellent training ground for graduate students and researchers, with the real possibility of significant com ....Robust speech recognition in realistic hostile environments. Australia leads the world in the adoption of speech recognition technology but sadly lags in the development of the fundamental advances in the area. This research will help propel Australia to the forefront of new innovations in speech recognition technology and contributions to fundamental science. Our project will provide an excellent training ground for graduate students and researchers, with the real possibility of significant commercial benefit to the nation. The deployment of our system in the community will greatly enhance the defence and police forces ability for surveillance and security, and will provide new assistive aids to improve the quality of life and safety for the elderly and disabled.Read moreRead less
Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods. The development of robust multilingual speaker recognition systems will benefit the community through the elimination of fraud incurred by financial institutions and customers by enabling several person authentication applications such as: voice based signatures and document issuance; credit card verification by voice and secure over-the-phone financi ....Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods. The development of robust multilingual speaker recognition systems will benefit the community through the elimination of fraud incurred by financial institutions and customers by enabling several person authentication applications such as: voice based signatures and document issuance; credit card verification by voice and secure over-the-phone financial transactions. The technology will also assist in the protection of the community and safeguard Australia by enabling the implementation of the following: suspect identification using voice print; national security measures for combating terrorism by using voice to locate and track terrorists; preemptive criminal activity counter-measures; surveillance and secure building access by voice.Read moreRead less
Robust speaker recognition with reduced utterance duration and intersession variability. The development of robust and accurate speaker recognition systems will enable secure person authentication in over-the-phone financial transactions and benefit the community through the elimination of identity fraud incurred by customers and financial institutions. The technology will also assist in safeguarding Australia by enabling the implementation of suspect identification using voice and security meas ....Robust speaker recognition with reduced utterance duration and intersession variability. The development of robust and accurate speaker recognition systems will enable secure person authentication in over-the-phone financial transactions and benefit the community through the elimination of identity fraud incurred by customers and financial institutions. The technology will also assist in safeguarding Australia by enabling the implementation of suspect identification using voice and security measures for combating terrorism by using voice to locate and track terrorists. Our research at QUT Speech Research Lab is at the forefront of development in this field and will provide Australia with a technological advantage in the rapidly evolving global market for speaker recognition technology for person authentication applications.Read moreRead less
Robust Automatic Speaker Diarisation of Audio Documents by Exploiting Prior Sources of Information. Speaker Diarisation, the task of determining who spoke when, is a technology fundamental in deriving intelligent information from audio and multimedia resources. The requirement for efficient and accurate Speaker Diarisation systems, portable across different domains is heightened by the explosive growth of audio and multimedia archives online and throughout the world. This research will provide t ....Robust Automatic Speaker Diarisation of Audio Documents by Exploiting Prior Sources of Information. Speaker Diarisation, the task of determining who spoke when, is a technology fundamental in deriving intelligent information from audio and multimedia resources. The requirement for efficient and accurate Speaker Diarisation systems, portable across different domains is heightened by the explosive growth of audio and multimedia archives online and throughout the world. This research will provide the foundation for a commercial service of automatic Speaker Diarisation to be developed, growing Australia's impact on the information and communications technology (ICT) sector. The outcome of this research will also assist in the tracking of terrorist and unlawful activity by enabling effective intelligence gathering from different audio sources.Read moreRead less
Audio Visual Speech Recognition. Even though significant advances have been made in automatic speech recognition using acoustic information, the recognition accuracies are still poor in noisy and hostile environments such as in crowds, traffic, factory floors etc. In many of these applications visual information is or can easily be made available in addition to the audio. The aim of this project is to achieve an order of magnitude improvement in speech recognition accuracies in adverse environme ....Audio Visual Speech Recognition. Even though significant advances have been made in automatic speech recognition using acoustic information, the recognition accuracies are still poor in noisy and hostile environments such as in crowds, traffic, factory floors etc. In many of these applications visual information is or can easily be made available in addition to the audio. The aim of this project is to achieve an order of magnitude improvement in speech recognition accuracies in adverse environments by joint processing and modelling of the acoustic modality with visual information in the form of lip shapes and movements. The outcomes will be useful in human computer interaction in adverse environments as well as in the transcription and mining of multimedia data.
Read moreRead less
Automatic speech-based assessment of mental state via mobile device. This project aims to create the first mobile, device-based automatic assessment of mental state from acoustic speech. Focusing on novel approaches for eliciting speech, for regression-based scoring of mental state and for longitudinal modelling of speech, the project takes speech processing out of the laboratory and into realistic environments. The project is significant because elicitation approach and longitudinal modelling h ....Automatic speech-based assessment of mental state via mobile device. This project aims to create the first mobile, device-based automatic assessment of mental state from acoustic speech. Focusing on novel approaches for eliciting speech, for regression-based scoring of mental state and for longitudinal modelling of speech, the project takes speech processing out of the laboratory and into realistic environments. The project is significant because elicitation approach and longitudinal modelling have been acknowledged by the research community as challenges that are valuable to investigate, and because conventional regression methods are sub-optimal on ordinal mental state scales. This is significant commercially because mobile devices allow individually tailored, frequent and low-cost mental state assessment. Expected outcomes will include commercial-ready technology, trialled on Australians, accessible to everyone with a mobile device and concentration of Australian research and development capability in a rapidly growing application area.Read moreRead less
High-performance computational data-mining techniques for feature detection in complex time series from large-scale, networked plasma experiments. Terabytes of data are gathered from large experimental facilities as complex time-series. Analysis of these data is daunting, especially when they involve high-dimensional spectral or image arrays. We will develop high-performance computational techniques for dimension reduction, efficient data-mining, and experimental control, using as an initial ta ....High-performance computational data-mining techniques for feature detection in complex time series from large-scale, networked plasma experiments. Terabytes of data are gathered from large experimental facilities as complex time-series. Analysis of these data is daunting, especially when they involve high-dimensional spectral or image arrays. We will develop high-performance computational techniques for dimension reduction, efficient data-mining, and experimental control, using as an initial target the H-1NF plasma fusion MNRF at the ANU and its >100 GB/year data stream. The techniques will immediately provide Australian researchers with unique tools for collaboration in international research to develop fusion as a low-emissions source of electricity, and will be applicable to complex time-series analysis in other areas of science, medicine, and defence.Read moreRead less
Dynamic Scheduling and Stochastic Control for Sensor Networks. Sensor networks are rapidly becoming important in applications from environmental monitoring, navigation to border surveillance. However, due to bandwidth constraints, even very simple networks have proven to be very complex to properly control. It is now necessary to efficiently allocate the 'limited available bandwidth' to sensors in order to share the most valuable data over the network. Therefore, this project proposes new techn ....Dynamic Scheduling and Stochastic Control for Sensor Networks. Sensor networks are rapidly becoming important in applications from environmental monitoring, navigation to border surveillance. However, due to bandwidth constraints, even very simple networks have proven to be very complex to properly control. It is now necessary to efficiently allocate the 'limited available bandwidth' to sensors in order to share the most valuable data over the network. Therefore, this project proposes new techniques using concepts of dynamic sensor scheduling and stochastic control to provide computationally feasible and near optimal solutions to the limited and varying bandwidth problem. This work will greatly enhance the operational performance of distributed sensor networks.Read moreRead less
Advanced Processing for Through-the-Wall Radar Imaging. There is currently considerable interest around the world in imaging systems that can 'see' through-the-walls. This project is at the cutting edge of through-the-wall radar imaging research and technology. It will benefit both the defence and security industries, and position Australia among the leading countries in this technology, as only a handful of research teams around the world have the necessary background and expertise in this area ....Advanced Processing for Through-the-Wall Radar Imaging. There is currently considerable interest around the world in imaging systems that can 'see' through-the-walls. This project is at the cutting edge of through-the-wall radar imaging research and technology. It will benefit both the defence and security industries, and position Australia among the leading countries in this technology, as only a handful of research teams around the world have the necessary background and expertise in this area. The research outcomes will enhance the capability of the military, law enforcement, counter-terrorism, and search and rescue personnel. It will play a vital role in protecting and safeguarding Australia from terrorism and crime.Read moreRead less
The next generation speaker recognition system. The next generation of speaker recognition technologies developed through this project will enable secure person authentication by voice in financial transactions and benefit the community through the elimination of identity fraud. This project will safeguard Australia by identifying criminal suspects using their voice and combat terrorism by using voice to locate and track terrorists.