Robust Automatic Speaker Diarisation of Audio Documents by Exploiting Prior Sources of Information. Speaker Diarisation, the task of determining who spoke when, is a technology fundamental in deriving intelligent information from audio and multimedia resources. The requirement for efficient and accurate Speaker Diarisation systems, portable across different domains is heightened by the explosive growth of audio and multimedia archives online and throughout the world. This research will provide t ....Robust Automatic Speaker Diarisation of Audio Documents by Exploiting Prior Sources of Information. Speaker Diarisation, the task of determining who spoke when, is a technology fundamental in deriving intelligent information from audio and multimedia resources. The requirement for efficient and accurate Speaker Diarisation systems, portable across different domains is heightened by the explosive growth of audio and multimedia archives online and throughout the world. This research will provide the foundation for a commercial service of automatic Speaker Diarisation to be developed, growing Australia's impact on the information and communications technology (ICT) sector. The outcome of this research will also assist in the tracking of terrorist and unlawful activity by enabling effective intelligence gathering from different audio sources.Read moreRead less
Progressive Transmission of Street Directory Assistance and Business Pages over 3G and 4G mobile networks. Multimedia on-demand and live services over 3G and 4G mobiles will be enhanced. New methods for low volume, high information transfer multimedia transactions will be developed. This will create new jobs in the Information and Communication Technologies (ICT) sector. Progressive transmission of street directory assistance and business pages information to mobile handsets will enable citize ....Progressive Transmission of Street Directory Assistance and Business Pages over 3G and 4G mobile networks. Multimedia on-demand and live services over 3G and 4G mobiles will be enhanced. New methods for low volume, high information transfer multimedia transactions will be developed. This will create new jobs in the Information and Communication Technologies (ICT) sector. Progressive transmission of street directory assistance and business pages information to mobile handsets will enable citizens to make efficient use of their time and improve productivity. The 3G and 4G cellular telephone network, extended with 'mobile' base stations and satellite links, are especially attractive to a large country like Australia. Interactive information retrieval will become more universal and not limited through wired Internet connections.
Read moreRead less
Audio Visual Speech Recognition. Even though significant advances have been made in automatic speech recognition using acoustic information, the recognition accuracies are still poor in noisy and hostile environments such as in crowds, traffic, factory floors etc. In many of these applications visual information is or can easily be made available in addition to the audio. The aim of this project is to achieve an order of magnitude improvement in speech recognition accuracies in adverse environme ....Audio Visual Speech Recognition. Even though significant advances have been made in automatic speech recognition using acoustic information, the recognition accuracies are still poor in noisy and hostile environments such as in crowds, traffic, factory floors etc. In many of these applications visual information is or can easily be made available in addition to the audio. The aim of this project is to achieve an order of magnitude improvement in speech recognition accuracies in adverse environments by joint processing and modelling of the acoustic modality with visual information in the form of lip shapes and movements. The outcomes will be useful in human computer interaction in adverse environments as well as in the transcription and mining of multimedia data.
Read moreRead less
Automatic audio segmentation, classification, identification, search and retrieval. The research aims to develop generic tools for automated audio segmentation, classification, identification and search, with lowest possible computational complexity and highest accuracy and speed. The tools will be applicable to audio archive management, search of audio material over WWW and personal archives of music and audio-assisted video analysis. The industry will use the tools for automated broadcast ve ....Automatic audio segmentation, classification, identification, search and retrieval. The research aims to develop generic tools for automated audio segmentation, classification, identification and search, with lowest possible computational complexity and highest accuracy and speed. The tools will be applicable to audio archive management, search of audio material over WWW and personal archives of music and audio-assisted video analysis. The industry will use the tools for automated broadcast verification and identification for copyright surveillance and calculation of royalty payments, aiming to penetrate both Australian and overseas markets. The area of real-time audio scene analysis is in its infancy and the research aims to make significant contributions to this area.Read moreRead less
Physiologically accurate audio processing in cochlear implants. This project proposes to use a physiologically motivated computational model of the cochlea, which along with newly developed cochlear-implant electrode technology will produce the next quantum improvement in speech intelligibility and quality of hearing for implant recipients.
Making demonstrably reliable forensic voice comparison a practical everyday reality in Australia. To assist Australian law-enforcement agencies and courts in the process of the conviction of the guilty and the exoneration of the innocent, this project will develop and test a practical and demonstrably reliable forensic voice comparison system for use with Australian voices. This will allow forensic scientists to produce reliable strength of evidence statements for presentation in court using the ....Making demonstrably reliable forensic voice comparison a practical everyday reality in Australia. To assist Australian law-enforcement agencies and courts in the process of the conviction of the guilty and the exoneration of the innocent, this project will develop and test a practical and demonstrably reliable forensic voice comparison system for use with Australian voices. This will allow forensic scientists to produce reliable strength of evidence statements for presentation in court using the same evaluative framework as used with DNA. In addition, application of the system during criminal investigations may lead to the refocussing of investigations on other suspects, or may help leverage guilty pleas, thus saving substantial time and money.Read moreRead less
Individualized cochlear implant sound coding: Optimized algorithms for better hearing. One in six Australians is affected by hearing loss. Hearing loss impacts on a person's educational and employment opportunities, resulting in a significant economic impact upon Australia. Over 10% of people with hearing impairment have a severe or profound hearing loss and may be candidates for a cochlear implant. Current cochlear implant sound processing only offers limited benefit to users. This project repr ....Individualized cochlear implant sound coding: Optimized algorithms for better hearing. One in six Australians is affected by hearing loss. Hearing loss impacts on a person's educational and employment opportunities, resulting in a significant economic impact upon Australia. Over 10% of people with hearing impairment have a severe or profound hearing loss and may be candidates for a cochlear implant. Current cochlear implant sound processing only offers limited benefit to users. This project represents a truly innovative pathway forward in the development of cochlear implant sound coding that could substantially increase the speech perception of users, enabling these people to become and remain active and productive members of our community.Read moreRead less
Development Of An Artificial Vision System For Motion Detection. This work will have a broader impact beyond the advancement of a particular area of neurophysiology since it aims to produce a simple, robust velocity estimator in computer models and subsequently in silicon chips. We have already demonstrated the feasibility of transferring our basic preliminary models onto VLSI chips. Consistent with National Research Priorities 3 (Frontier Technologies), 4 (Defence Technologies) & 2 (Promoting a ....Development Of An Artificial Vision System For Motion Detection. This work will have a broader impact beyond the advancement of a particular area of neurophysiology since it aims to produce a simple, robust velocity estimator in computer models and subsequently in silicon chips. We have already demonstrated the feasibility of transferring our basic preliminary models onto VLSI chips. Consistent with National Research Priorities 3 (Frontier Technologies), 4 (Defence Technologies) & 2 (Promoting and Maintaining Good Health) our work has applications in the area of miniature autonomous systems in defence and civilian roles, including miniature unmanned aerial vehicles and collision avoidance detectors, while in the longer term it will serve as the basis for designing implantable artificial eyes.Read moreRead less
Tooth-mic Devices for Monitoring the Efficacy of Home-based Continuous Positive Airway Pressure (CPAP) Technology. Over 800,000 Australians suffer from Obstructive Sleep Apnea (OSA). OSA patients use twice the health resources compared to healthy people. They are 7 times more likely to cause traffic accidents; in NSW up to 43000 accidents/year are due to OSA. OSA is treatable & consequences such as strokes, diabetes & heart attacks are preventable. The standard OSA treatment is home-based Contin ....Tooth-mic Devices for Monitoring the Efficacy of Home-based Continuous Positive Airway Pressure (CPAP) Technology. Over 800,000 Australians suffer from Obstructive Sleep Apnea (OSA). OSA patients use twice the health resources compared to healthy people. They are 7 times more likely to cause traffic accidents; in NSW up to 43000 accidents/year are due to OSA. OSA is treatable & consequences such as strokes, diabetes & heart attacks are preventable. The standard OSA treatment is home-based Continuous Positive Airway Pressure Therapy. Unfortunately, no effective technique exists to measure the efficacy of the treatment. We propose enabling solutions to this problem via developing technology centered on breathing sound analysis. The project proposes joint work with a US-company facilitating access to advanced technology highly beneficial to Australia.Read moreRead less
Vocal empowerment: Researching the effect of actor voice training on young adults with cochlear implants and hearing aids. The research offers significant public health outcomes for Australians, in the areas of preventative healthcare, through improving the social integration and quality of life of young adults with a hearing impairment and through improvements in habilitation.