Analysing Iterative Machine Learning Algorithms with Information Geometric Methods. Online machine learning problems arise from situations where data is provided a point at a time. There are many classical algorithms for solving such problems based on the principle of stochastic gradient descent. Recent research by the CIs and others have thrown up interesting but diverse geometric connections that offer new insights. The proposed research aims to integrate the understanding of these algori ....Analysing Iterative Machine Learning Algorithms with Information Geometric Methods. Online machine learning problems arise from situations where data is provided a point at a time. There are many classical algorithms for solving such problems based on the principle of stochastic gradient descent. Recent research by the CIs and others have thrown up interesting but diverse geometric connections that offer new insights. The proposed research aims to integrate the understanding of these algorithms with the aim of designing algorithms better able to exploit prior knowledge, and to extend existing algorithms to new problem domains thus offering well principled and well understood algorithms for solving a variety of novel online problems.Read moreRead less
Robust speech recognition in realistic hostile environments. Australia leads the world in the adoption of speech recognition technology but sadly lags in the development of the fundamental advances in the area. This research will help propel Australia to the forefront of new innovations in speech recognition technology and contributions to fundamental science. Our project will provide an excellent training ground for graduate students and researchers, with the real possibility of significant com ....Robust speech recognition in realistic hostile environments. Australia leads the world in the adoption of speech recognition technology but sadly lags in the development of the fundamental advances in the area. This research will help propel Australia to the forefront of new innovations in speech recognition technology and contributions to fundamental science. Our project will provide an excellent training ground for graduate students and researchers, with the real possibility of significant commercial benefit to the nation. The deployment of our system in the community will greatly enhance the defence and police forces ability for surveillance and security, and will provide new assistive aids to improve the quality of life and safety for the elderly and disabled.Read moreRead less
Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods. The development of robust multilingual speaker recognition systems will benefit the community through the elimination of fraud incurred by financial institutions and customers by enabling several person authentication applications such as: voice based signatures and document issuance; credit card verification by voice and secure over-the-phone financi ....Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods. The development of robust multilingual speaker recognition systems will benefit the community through the elimination of fraud incurred by financial institutions and customers by enabling several person authentication applications such as: voice based signatures and document issuance; credit card verification by voice and secure over-the-phone financial transactions. The technology will also assist in the protection of the community and safeguard Australia by enabling the implementation of the following: suspect identification using voice print; national security measures for combating terrorism by using voice to locate and track terrorists; preemptive criminal activity counter-measures; surveillance and secure building access by voice.Read moreRead less
Robust speaker recognition with reduced utterance duration and intersession variability. The development of robust and accurate speaker recognition systems will enable secure person authentication in over-the-phone financial transactions and benefit the community through the elimination of identity fraud incurred by customers and financial institutions. The technology will also assist in safeguarding Australia by enabling the implementation of suspect identification using voice and security meas ....Robust speaker recognition with reduced utterance duration and intersession variability. The development of robust and accurate speaker recognition systems will enable secure person authentication in over-the-phone financial transactions and benefit the community through the elimination of identity fraud incurred by customers and financial institutions. The technology will also assist in safeguarding Australia by enabling the implementation of suspect identification using voice and security measures for combating terrorism by using voice to locate and track terrorists. Our research at QUT Speech Research Lab is at the forefront of development in this field and will provide Australia with a technological advantage in the rapidly evolving global market for speaker recognition technology for person authentication applications.Read moreRead less
Robust Automatic Speaker Diarisation of Audio Documents by Exploiting Prior Sources of Information. Speaker Diarisation, the task of determining who spoke when, is a technology fundamental in deriving intelligent information from audio and multimedia resources. The requirement for efficient and accurate Speaker Diarisation systems, portable across different domains is heightened by the explosive growth of audio and multimedia archives online and throughout the world. This research will provide t ....Robust Automatic Speaker Diarisation of Audio Documents by Exploiting Prior Sources of Information. Speaker Diarisation, the task of determining who spoke when, is a technology fundamental in deriving intelligent information from audio and multimedia resources. The requirement for efficient and accurate Speaker Diarisation systems, portable across different domains is heightened by the explosive growth of audio and multimedia archives online and throughout the world. This research will provide the foundation for a commercial service of automatic Speaker Diarisation to be developed, growing Australia's impact on the information and communications technology (ICT) sector. The outcome of this research will also assist in the tracking of terrorist and unlawful activity by enabling effective intelligence gathering from different audio sources.Read moreRead less
Breathing and snoring sound analysis in sleep apnea. About 800,000 Australians suffer from the disease sleep Apnoea (OSA) which has snoring as its earliest symptom. We develop electronics and snore processing algorithms to classify snorers into OSA-positive and OSA-negative classes, based on advanced technology derived from speech recognition systems.
Audio Visual Speech Recognition. Even though significant advances have been made in automatic speech recognition using acoustic information, the recognition accuracies are still poor in noisy and hostile environments such as in crowds, traffic, factory floors etc. In many of these applications visual information is or can easily be made available in addition to the audio. The aim of this project is to achieve an order of magnitude improvement in speech recognition accuracies in adverse environme ....Audio Visual Speech Recognition. Even though significant advances have been made in automatic speech recognition using acoustic information, the recognition accuracies are still poor in noisy and hostile environments such as in crowds, traffic, factory floors etc. In many of these applications visual information is or can easily be made available in addition to the audio. The aim of this project is to achieve an order of magnitude improvement in speech recognition accuracies in adverse environments by joint processing and modelling of the acoustic modality with visual information in the form of lip shapes and movements. The outcomes will be useful in human computer interaction in adverse environments as well as in the transcription and mining of multimedia data.
Read moreRead less
Bayesian inference for complex regression models using mixtures. The project will use mixtures to flexibly model complex regression functions and will develop Bayesian methods for carrying out statistical inference on these models. The models will deal with both Gaussian and non-Gaussian data. Multiple explanatory variables are dealt with by mixing simple additives to produce flexible high dimensional function estimates. Variable selection and model averaging will be used to identify important v ....Bayesian inference for complex regression models using mixtures. The project will use mixtures to flexibly model complex regression functions and will develop Bayesian methods for carrying out statistical inference on these models. The models will deal with both Gaussian and non-Gaussian data. Multiple explanatory variables are dealt with by mixing simple additives to produce flexible high dimensional function estimates. Variable selection and model averaging will be used to identify important variables and thus make the estimation more efficient. The methods will be extended to multivariate responses where account will taken be taken of the structure of the dependence between responses.Read moreRead less
Multiscale and multimodal modelling of brain dynamics. This project aims to understand dynamics of how several brain regions work together to process information. This project will generate new knowledge in brain sciences by using state of the art computational modelling and neuroimaging methods like functional and diffusion magnetic resonance imaging and electromagnetic measurements. This project will develop technologies to compute multiscale, multimodal and directed connectivity in the brain. ....Multiscale and multimodal modelling of brain dynamics. This project aims to understand dynamics of how several brain regions work together to process information. This project will generate new knowledge in brain sciences by using state of the art computational modelling and neuroimaging methods like functional and diffusion magnetic resonance imaging and electromagnetic measurements. This project will develop technologies to compute multiscale, multimodal and directed connectivity in the brain. Expected outcomes of this project will enhance our understanding of the brain’s functional organization and dynamics. The benefits of this project will include breakthroughs in development of new neuro-technologies like brain-machine interfaces and neuroscience inspired artificial intelligence. Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE170100128
Funder
Australian Research Council
Funding Amount
$395,000.00
Summary
Information processing in the brain. This project aims to understand the brain's functional organisation by developing non-invasive methods to characterise connectivity between interacting brain regions. No model-based methods to compute directional coupling between brain regions can be applied to large scale networks for resting state functional MRI data. This capability would be a major breakthrough in neuroimaging, given uninformative (non-directional) network connectivity analysis restricts ....Information processing in the brain. This project aims to understand the brain's functional organisation by developing non-invasive methods to characterise connectivity between interacting brain regions. No model-based methods to compute directional coupling between brain regions can be applied to large scale networks for resting state functional MRI data. This capability would be a major breakthrough in neuroimaging, given uninformative (non-directional) network connectivity analysis restricts research. This project is expected to advance our understanding of information processing in the brain by providing a mechanistic approach to functional integration.Read moreRead less