Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods. The development of robust multilingual speaker recognition systems will benefit the community through the elimination of fraud incurred by financial institutions and customers by enabling several person authentication applications such as: voice based signatures and document issuance; credit card verification by voice and secure over-the-phone financi ....Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods. The development of robust multilingual speaker recognition systems will benefit the community through the elimination of fraud incurred by financial institutions and customers by enabling several person authentication applications such as: voice based signatures and document issuance; credit card verification by voice and secure over-the-phone financial transactions. The technology will also assist in the protection of the community and safeguard Australia by enabling the implementation of the following: suspect identification using voice print; national security measures for combating terrorism by using voice to locate and track terrorists; preemptive criminal activity counter-measures; surveillance and secure building access by voice.Read moreRead less
Robust speaker recognition with reduced utterance duration and intersession variability. The development of robust and accurate speaker recognition systems will enable secure person authentication in over-the-phone financial transactions and benefit the community through the elimination of identity fraud incurred by customers and financial institutions. The technology will also assist in safeguarding Australia by enabling the implementation of suspect identification using voice and security meas ....Robust speaker recognition with reduced utterance duration and intersession variability. The development of robust and accurate speaker recognition systems will enable secure person authentication in over-the-phone financial transactions and benefit the community through the elimination of identity fraud incurred by customers and financial institutions. The technology will also assist in safeguarding Australia by enabling the implementation of suspect identification using voice and security measures for combating terrorism by using voice to locate and track terrorists. Our research at QUT Speech Research Lab is at the forefront of development in this field and will provide Australia with a technological advantage in the rapidly evolving global market for speaker recognition technology for person authentication applications.Read moreRead less
Robust Automatic Speaker Diarisation of Audio Documents by Exploiting Prior Sources of Information. Speaker Diarisation, the task of determining who spoke when, is a technology fundamental in deriving intelligent information from audio and multimedia resources. The requirement for efficient and accurate Speaker Diarisation systems, portable across different domains is heightened by the explosive growth of audio and multimedia archives online and throughout the world. This research will provide t ....Robust Automatic Speaker Diarisation of Audio Documents by Exploiting Prior Sources of Information. Speaker Diarisation, the task of determining who spoke when, is a technology fundamental in deriving intelligent information from audio and multimedia resources. The requirement for efficient and accurate Speaker Diarisation systems, portable across different domains is heightened by the explosive growth of audio and multimedia archives online and throughout the world. This research will provide the foundation for a commercial service of automatic Speaker Diarisation to be developed, growing Australia's impact on the information and communications technology (ICT) sector. The outcome of this research will also assist in the tracking of terrorist and unlawful activity by enabling effective intelligence gathering from different audio sources.Read moreRead less
Detection and Quantification of General Fetal Movements from Accelerometer Measurements using Nonstationary Signal Processing Techniques. There are approximately 1,750 fetal deaths per year in Australian with about one-third occurring late in gestation and without an apparent cause. The development of an automated system capable of long-term monitoring of fetal health will result in accurate diagnoses and prediction of future outcome. This will, in turn, allow early intervention by the clinicia ....Detection and Quantification of General Fetal Movements from Accelerometer Measurements using Nonstationary Signal Processing Techniques. There are approximately 1,750 fetal deaths per year in Australian with about one-third occurring late in gestation and without an apparent cause. The development of an automated system capable of long-term monitoring of fetal health will result in accurate diagnoses and prediction of future outcome. This will, in turn, allow early intervention by the clinician to reduce fetal deaths and enhance the chances of good outcomes with resultant savings in social and financial costs to the community. The development of such equipment would spawn future research into intervention treatments and contribute to Australia's position as a world leader in computerised health monitoring systems.Read moreRead less
Multi-Channel Time-Frequency Analysis for EEG Neonatal Seizure Characterization. This project researches new signal processing methodologies for a multi-channel characterization of seizures for use in diagnosing newborn brain dysfunctions. The outcomes will result in important immediate clinical benefits for sick newborn babies and will fundamentally facilitate research progress in the development of neuroprotectants and anticonvulsants. The success of this project will contribute in minimizing ....Multi-Channel Time-Frequency Analysis for EEG Neonatal Seizure Characterization. This project researches new signal processing methodologies for a multi-channel characterization of seizures for use in diagnosing newborn brain dysfunctions. The outcomes will result in important immediate clinical benefits for sick newborn babies and will fundamentally facilitate research progress in the development of neuroprotectants and anticonvulsants. The success of this project will contribute in minimizing the social financial costs by diagnosing brain disorders in the initial stage of life and preventing further damage. This has the potential to result in a standard diagnostic equipment in neonatal intensive care units and medical research centres.Read moreRead less
Design of Neonatal Seizure Diagnosis Methods Using Time-Frequency Signal Processing Techniques. Seizures occur in approximately 0.5% of all newborns. They are often the only indicator of an early dysfunction in central nervous system (CNS). Their occurrence raises concerns about the underlying cause, its effect on the brain, and the appropriate treatment. Newborn seizures are mostly sub-clinical and only detected through the Electroencephalogram. For an efficient diagnosis, seizure classificatio ....Design of Neonatal Seizure Diagnosis Methods Using Time-Frequency Signal Processing Techniques. Seizures occur in approximately 0.5% of all newborns. They are often the only indicator of an early dysfunction in central nervous system (CNS). Their occurrence raises concerns about the underlying cause, its effect on the brain, and the appropriate treatment. Newborn seizures are mostly sub-clinical and only detected through the Electroencephalogram. For an efficient diagnosis, seizure classification systems were proposed based on visual observations. This project proposes developing a novel approach to automate the classification process using time-frequency (TF) signal processing techniques based on the multi-channel characteristics of the seizure; namely: A) TF signature B) origin, and C) propagation behaviour.Read moreRead less
The next generation speaker recognition system. The next generation of speaker recognition technologies developed through this project will enable secure person authentication by voice in financial transactions and benefit the community through the elimination of identity fraud. This project will safeguard Australia by identifying criminal suspects using their voice and combat terrorism by using voice to locate and track terrorists.
ARC Research Network on Intelligent Sensors, Sensor Networks and Information Processing. Sensor networks, a collection of diverse sensors interconnected via an ad-hoc communication network, are identified as one of the key technologies that over the next two decades will change the way we live. This research network brings together an interdisciplinary team of outstanding Australian researchers representing all the key disciplines required to successfully deploy sensor networks and links this te ....ARC Research Network on Intelligent Sensors, Sensor Networks and Information Processing. Sensor networks, a collection of diverse sensors interconnected via an ad-hoc communication network, are identified as one of the key technologies that over the next two decades will change the way we live. This research network brings together an interdisciplinary team of outstanding Australian researchers representing all the key disciplines required to successfully deploy sensor networks and links this team with the foremost international authorities and leading industry players in the area of sensor networks. This research network will guide collaborative research that will ensure Australia to play a world leading role in sensor network development and implementation.
Read moreRead less
Audio Visual Speech Recognition. Even though significant advances have been made in automatic speech recognition using acoustic information, the recognition accuracies are still poor in noisy and hostile environments such as in crowds, traffic, factory floors etc. In many of these applications visual information is or can easily be made available in addition to the audio. The aim of this project is to achieve an order of magnitude improvement in speech recognition accuracies in adverse environme ....Audio Visual Speech Recognition. Even though significant advances have been made in automatic speech recognition using acoustic information, the recognition accuracies are still poor in noisy and hostile environments such as in crowds, traffic, factory floors etc. In many of these applications visual information is or can easily be made available in addition to the audio. The aim of this project is to achieve an order of magnitude improvement in speech recognition accuracies in adverse environments by joint processing and modelling of the acoustic modality with visual information in the form of lip shapes and movements. The outcomes will be useful in human computer interaction in adverse environments as well as in the transcription and mining of multimedia data.
Read moreRead less
Accurate Location Service for 3G Cellular Networks. The project aims to improve the accuracy of automatic location identification of GPS enabled handsets in third-generation cellular networks via a modified Differential GPS method. The proposed sub-metre location ability will be a value-added service of the network providers and will appeal to a wide sector of users. The challenge is to develop a cost effective solution by utilising existing network infrastructure. We propose to use the GPS timi ....Accurate Location Service for 3G Cellular Networks. The project aims to improve the accuracy of automatic location identification of GPS enabled handsets in third-generation cellular networks via a modified Differential GPS method. The proposed sub-metre location ability will be a value-added service of the network providers and will appeal to a wide sector of users. The challenge is to develop a cost effective solution by utilising existing network infrastructure. We propose to use the GPS timing units in 3G networks for deriving local GPS position corrections; and to transfer these corrections to the users via the 3G data bursts of the beacons or other user data broadcasts.Read moreRead less