Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods. The development of robust multilingual speaker recognition systems will benefit the community through the elimination of fraud incurred by financial institutions and customers by enabling several person authentication applications such as: voice based signatures and document issuance; credit card verification by voice and secure over-the-phone financi ....Enhanced Multilingual Speaker Recognition through the Incorporation of High-Level Features, Late Fusion and Discriminative Classification Methods. The development of robust multilingual speaker recognition systems will benefit the community through the elimination of fraud incurred by financial institutions and customers by enabling several person authentication applications such as: voice based signatures and document issuance; credit card verification by voice and secure over-the-phone financial transactions. The technology will also assist in the protection of the community and safeguard Australia by enabling the implementation of the following: suspect identification using voice print; national security measures for combating terrorism by using voice to locate and track terrorists; preemptive criminal activity counter-measures; surveillance and secure building access by voice.Read moreRead less
ARC Research Network for Enabling Human Communication. The Human Communication Network promotes interdisciplinary research in speech, language, and sound by and between humans and machines. The network connects leading and emerging researchers across disciplines, exploits previously unrecognised intersections, supports interdisciplinary graduate training and exchanges, provides database storage infrastructure, and consults with industry and government to set, not follow, research agendas. By ge ....ARC Research Network for Enabling Human Communication. The Human Communication Network promotes interdisciplinary research in speech, language, and sound by and between humans and machines. The network connects leading and emerging researchers across disciplines, exploits previously unrecognised intersections, supports interdisciplinary graduate training and exchanges, provides database storage infrastructure, and consults with industry and government to set, not follow, research agendas. By generating an explosion of new approaches and knowledge, the network will build Australia's reputation as a leader in communication science and technology via advances in automatic speech recognition, distress call monitoring, hearing prostheses, web interfaces, and data retrieval and data mining systems.Read moreRead less
ARC Communications Research Network. Building on a strong platform of existing research excellence, the Aim of the Network is to facilitate nation-wide collaborative research, promoting four intersecting research Themes: Mobile and Wireless Communications, Rural Communications, Broadband and Optical Networks, and Fundamentals of Emerging Media. Each Theme is formulated to drive multidisciplinary, innovative research as well as inspire new collaborative initiatives. Four Programs encapsulate the ....ARC Communications Research Network. Building on a strong platform of existing research excellence, the Aim of the Network is to facilitate nation-wide collaborative research, promoting four intersecting research Themes: Mobile and Wireless Communications, Rural Communications, Broadband and Optical Networks, and Fundamentals of Emerging Media. Each Theme is formulated to drive multidisciplinary, innovative research as well as inspire new collaborative initiatives. Four Programs encapsulate the core activities of the Network: Researcher Mobility, Workshops and Conferences, Postgraduate Education, and Knowledge Management Systems. The Network is expected to add significant value to pre-existing investments and raise the profile of Australian telecommunications research.Read moreRead less
Frequency-related features derived from phase spectrum for robust speech recognition. Though the currently available speech recognizers work reasonably well in noise-free environments, their performance deteriorates drastically even in the presence of a small amount of noise. In order to overcome this problem, new frequency-related features are proposed in this project for speech recognition. These features are derived from the phase spectrum of the speech signal, and are expected to be robust t ....Frequency-related features derived from phase spectrum for robust speech recognition. Though the currently available speech recognizers work reasonably well in noise-free environments, their performance deteriorates drastically even in the presence of a small amount of noise. In order to overcome this problem, new frequency-related features are proposed in this project for speech recognition. These features are derived from the phase spectrum of the speech signal, and are expected to be robust to the additive noise distortion. These features will make the speech recognizer less sensitive to noise and will enhance its utility in a number of applications in the telecommunication and business world.Read moreRead less
Fixed and variable-length segment vocoders for very low bitrate speech coding. Reliable and secure voice communication is an important aspect of military and defence operations. In order to reduce the possibility of interception, low power transmitters are normally used for radio communications, where the bandwidth is often very low. Military voice communication, therefore, requires the coding of speech at very low bitrates. Our research proposal aims to develop speech coders that can operate ....Fixed and variable-length segment vocoders for very low bitrate speech coding. Reliable and secure voice communication is an important aspect of military and defence operations. In order to reduce the possibility of interception, low power transmitters are normally used for radio communications, where the bandwidth is often very low. Military voice communication, therefore, requires the coding of speech at very low bitrates. Our research proposal aims to develop speech coders that can operate at lower bitrates and reproduce speech of high quality and intelligibility. This is highly beneficial to the defence forces of Australia as it will permit the use of high-grade encryption technology to improve the security of transmission.Read moreRead less
Tooth-mic Devices for Monitoring the Efficacy of Home-based Continuous Positive Airway Pressure (CPAP) Technology. Over 800,000 Australians suffer from Obstructive Sleep Apnea (OSA). OSA patients use twice the health resources compared to healthy people. They are 7 times more likely to cause traffic accidents; in NSW up to 43000 accidents/year are due to OSA. OSA is treatable & consequences such as strokes, diabetes & heart attacks are preventable. The standard OSA treatment is home-based Contin ....Tooth-mic Devices for Monitoring the Efficacy of Home-based Continuous Positive Airway Pressure (CPAP) Technology. Over 800,000 Australians suffer from Obstructive Sleep Apnea (OSA). OSA patients use twice the health resources compared to healthy people. They are 7 times more likely to cause traffic accidents; in NSW up to 43000 accidents/year are due to OSA. OSA is treatable & consequences such as strokes, diabetes & heart attacks are preventable. The standard OSA treatment is home-based Continuous Positive Airway Pressure Therapy. Unfortunately, no effective technique exists to measure the efficacy of the treatment. We propose enabling solutions to this problem via developing technology centered on breathing sound analysis. The project proposes joint work with a US-company facilitating access to advanced technology highly beneficial to Australia.Read moreRead less
Progressive Transmission of Street Directory Assistance and Business Pages over 3G and 4G mobile networks. Multimedia on-demand and live services over 3G and 4G mobiles will be enhanced. New methods for low volume, high information transfer multimedia transactions will be developed. This will create new jobs in the Information and Communication Technologies (ICT) sector. Progressive transmission of street directory assistance and business pages information to mobile handsets will enable citize ....Progressive Transmission of Street Directory Assistance and Business Pages over 3G and 4G mobile networks. Multimedia on-demand and live services over 3G and 4G mobiles will be enhanced. New methods for low volume, high information transfer multimedia transactions will be developed. This will create new jobs in the Information and Communication Technologies (ICT) sector. Progressive transmission of street directory assistance and business pages information to mobile handsets will enable citizens to make efficient use of their time and improve productivity. The 3G and 4G cellular telephone network, extended with 'mobile' base stations and satellite links, are especially attractive to a large country like Australia. Interactive information retrieval will become more universal and not limited through wired Internet connections.
Read moreRead less
Audio Visual Speech Recognition. Even though significant advances have been made in automatic speech recognition using acoustic information, the recognition accuracies are still poor in noisy and hostile environments such as in crowds, traffic, factory floors etc. In many of these applications visual information is or can easily be made available in addition to the audio. The aim of this project is to achieve an order of magnitude improvement in speech recognition accuracies in adverse environme ....Audio Visual Speech Recognition. Even though significant advances have been made in automatic speech recognition using acoustic information, the recognition accuracies are still poor in noisy and hostile environments such as in crowds, traffic, factory floors etc. In many of these applications visual information is or can easily be made available in addition to the audio. The aim of this project is to achieve an order of magnitude improvement in speech recognition accuracies in adverse environments by joint processing and modelling of the acoustic modality with visual information in the form of lip shapes and movements. The outcomes will be useful in human computer interaction in adverse environments as well as in the transcription and mining of multimedia data.
Read moreRead less
Robust feature extraction for automatic speech recognition. Speech is perhaps the most natural and efficient mode of communication for humans. Therefore, it has always been a dream for many people to communicate with machines via speech. Significant advances have been made in the last five decades in the area of automatic speech recognition. Though the currently available speech recognisers work reasonably well in noise-free office environments, their performance deteriorates drastically when th ....Robust feature extraction for automatic speech recognition. Speech is perhaps the most natural and efficient mode of communication for humans. Therefore, it has always been a dream for many people to communicate with machines via speech. Significant advances have been made in the last five decades in the area of automatic speech recognition. Though the currently available speech recognisers work reasonably well in noise-free office environments, their performance deteriorates drastically when they are deployed in real-life situations due to the presence of background noise and other distortions. The problem of robust speech recognition will be researched in this project. Read moreRead less
Automatic audio segmentation, classification, identification, search and retrieval. The research aims to develop generic tools for automated audio segmentation, classification, identification and search, with lowest possible computational complexity and highest accuracy and speed. The tools will be applicable to audio archive management, search of audio material over WWW and personal archives of music and audio-assisted video analysis. The industry will use the tools for automated broadcast ve ....Automatic audio segmentation, classification, identification, search and retrieval. The research aims to develop generic tools for automated audio segmentation, classification, identification and search, with lowest possible computational complexity and highest accuracy and speed. The tools will be applicable to audio archive management, search of audio material over WWW and personal archives of music and audio-assisted video analysis. The industry will use the tools for automated broadcast verification and identification for copyright surveillance and calculation of royalty payments, aiming to penetrate both Australian and overseas markets. The area of real-time audio scene analysis is in its infancy and the research aims to make significant contributions to this area.Read moreRead less