Robust speech recognition in realistic hostile environments. Australia leads the world in the adoption of speech recognition technology but sadly lags in the development of the fundamental advances in the area. This research will help propel Australia to the forefront of new innovations in speech recognition technology and contributions to fundamental science. Our project will provide an excellent training ground for graduate students and researchers, with the real possibility of significant com ....Robust speech recognition in realistic hostile environments. Australia leads the world in the adoption of speech recognition technology but sadly lags in the development of the fundamental advances in the area. This research will help propel Australia to the forefront of new innovations in speech recognition technology and contributions to fundamental science. Our project will provide an excellent training ground for graduate students and researchers, with the real possibility of significant commercial benefit to the nation. The deployment of our system in the community will greatly enhance the defence and police forces ability for surveillance and security, and will provide new assistive aids to improve the quality of life and safety for the elderly and disabled.Read moreRead less
Visual Solutions for Automated Translation Between Spoken and Signed Languages. We propose to build a robust visual speech recognition system that analyzes images of spoken language and achieves a recognition of the utterances with at least human expert recognition rates. This visual speech recognition system will then be integrated with our existing gesture recognition system to improve performance, just as humans combine visual and audio data for language understanding. The result will be a sy ....Visual Solutions for Automated Translation Between Spoken and Signed Languages. We propose to build a robust visual speech recognition system that analyzes images of spoken language and achieves a recognition of the utterances with at least human expert recognition rates. This visual speech recognition system will then be integrated with our existing gesture recognition system to improve performance, just as humans combine visual and audio data for language understanding. The result will be a system providing translation between English and the Australian sign language Auslan in a practical application domain. Significantly, our work will provide insights into the cognitive models of neural activity linking language and gesture.Read moreRead less
ARC Research Network for Enabling Human Communication. The Human Communication Network promotes interdisciplinary research in speech, language, and sound by and between humans and machines. The network connects leading and emerging researchers across disciplines, exploits previously unrecognised intersections, supports interdisciplinary graduate training and exchanges, provides database storage infrastructure, and consults with industry and government to set, not follow, research agendas. By ge ....ARC Research Network for Enabling Human Communication. The Human Communication Network promotes interdisciplinary research in speech, language, and sound by and between humans and machines. The network connects leading and emerging researchers across disciplines, exploits previously unrecognised intersections, supports interdisciplinary graduate training and exchanges, provides database storage infrastructure, and consults with industry and government to set, not follow, research agendas. By generating an explosion of new approaches and knowledge, the network will build Australia's reputation as a leader in communication science and technology via advances in automatic speech recognition, distress call monitoring, hearing prostheses, web interfaces, and data retrieval and data mining systems.Read moreRead less
Missing Voices: Communication Difficulties After Stroke And Traumatic Brain Injury In Indigenous Australians
Funder
National Health and Medical Research Council
Funding Amount
$655,310.00
Summary
Acquired communication disorder (ACD) is a common result of stroke and traumatic brain injury (TBI) and has a devastating impact on victims’ everyday lives. Stroke and TBI occur more than twice as frequently in Indigenous as in non-Indigenous populations, but current uptake of communication rehabilitation services is low and long term outcomes for the individuals are unknown. This Australian first study will examine the extent and impact of ACD in urban and rural Indigenous Australians.
Does word similarity across languages help or hinder bilingual speakers? This project aims to understand in more detail how bilinguals can accurately speak in both their languages. Speaking is a complex skill, particularly if you have two languages to choose from, which will be true for over half of Australia’s population by 2025. This project aims to investigate the factors that influence speech production in both monolinguals and bilinguals including those with language impairment, and develop ....Does word similarity across languages help or hinder bilingual speakers? This project aims to understand in more detail how bilinguals can accurately speak in both their languages. Speaking is a complex skill, particularly if you have two languages to choose from, which will be true for over half of Australia’s population by 2025. This project aims to investigate the factors that influence speech production in both monolinguals and bilinguals including those with language impairment, and develop a better bilingual theory. The benefit of this new theory will be to provide a clear basis for diagnosis and treatment for children in bilingual households who have problems learning to speak, and for bilingual people with language problems after a stroke or dementia.Read moreRead less
Advanced Capture, Analysis and Compression of Facial Images. Facial image processing is an area of research that holds an important key to future advances in intelligent human-to-computer and human-to-human systems. This project will investigate and develop superior approaches to image capturing of human faces for subsequent analysis and compression. It aims to develop innovative techniques to detect, extract and recognise faces, as well as more efficient ways to compress facial image data. This ....Advanced Capture, Analysis and Compression of Facial Images. Facial image processing is an area of research that holds an important key to future advances in intelligent human-to-computer and human-to-human systems. This project will investigate and develop superior approaches to image capturing of human faces for subsequent analysis and compression. It aims to develop innovative techniques to detect, extract and recognise faces, as well as more efficient ways to compress facial image data. This project will provide advanced Australian technology with applications in some of the world's fastest growing markets, including crowd surveillance, computer user interface, videoconferencing, and multimedia systems.Read moreRead less
Linkage Infrastructure, Equipment And Facilities - Grant ID: LE100100211
Funder
Australian Research Council
Funding Amount
$650,000.00
Summary
The Big Australian Speech Corpus: An audio-visual speech corpus of Australian English. Contemporary speech science and technology are driven by the availability of large speech corpora. While audio databases exist for languages spoken in America, Europe and Japan, there is currently no large auditory-visual database of spoken language, and certainly not one for Australian English. Here we will establish the Big Australian Speech Corpus, which will support a speech science research and developmen ....The Big Australian Speech Corpus: An audio-visual speech corpus of Australian English. Contemporary speech science and technology are driven by the availability of large speech corpora. While audio databases exist for languages spoken in America, Europe and Japan, there is currently no large auditory-visual database of spoken language, and certainly not one for Australian English. Here we will establish the Big Australian Speech Corpus, which will support a speech science research and development using Australian English and facilitate the development of Australian speech technology applications from automatic speech recognition and text-to-speech synthesis used in taxi and other ordering services, to hearing prostheses and talking head aids for learning-impaired children, and a range of security and forensic applications.Read moreRead less
A Skin Detection Micro-Sensor for Face Identification using Color and Stereo Information. The objective of this research is to develop a micro-sensor for face identification, using color and stereo information. The micro-sensor chip performs a real-time search of the scene to locate human skin for subsequent face detection. This micro-sensor could also be used for gesture recognition, lip reading, monitoring driver's hypo-vigilance or tracking a person in a crowd. The chip image-recognition capa ....A Skin Detection Micro-Sensor for Face Identification using Color and Stereo Information. The objective of this research is to develop a micro-sensor for face identification, using color and stereo information. The micro-sensor chip performs a real-time search of the scene to locate human skin for subsequent face detection. This micro-sensor could also be used for gesture recognition, lip reading, monitoring driver's hypo-vigilance or tracking a person in a crowd. The chip image-recognition capabilities will spur the development of a new generation of consumer products with "intelligent eyes".
Read moreRead less
Linkage Infrastructure, Equipment And Facilities - Grant ID: LE100100235
Funder
Australian Research Council
Funding Amount
$280,000.00
Summary
Accelerating Australia's large scale video surveillance research programmes. The research to be conducted using this infrastructure will bring immense benefits to Australia in terms of increased levels of public safety and in the protection of critical facilities from terrorism and other crimes, by developing better surveillance systems. This will provide both increases in measurable research outputs and opportunities for Australian business to commercialise these systems. The infrastructure wil ....Accelerating Australia's large scale video surveillance research programmes. The research to be conducted using this infrastructure will bring immense benefits to Australia in terms of increased levels of public safety and in the protection of critical facilities from terrorism and other crimes, by developing better surveillance systems. This will provide both increases in measurable research outputs and opportunities for Australian business to commercialise these systems. The infrastructure will accelerate the pace of surveillance research and development in Australia, enhancing the competitiveness of both Australia's researchers and the businesses that will commercialise these researchers' discoveries.Read moreRead less
Multi-Modal Dictionary Learning for Smart City Operation and Management. This Project aims to provide new digital asset management tools for city councils to improve city services by utilising new sensing and automated learning technologies for recognising, tracking and auditing of assets. Currently, there are no digital tools available to handle these services. This project proposes new multi-modal sensing and mapping of city asset techniques by building new multi-modal dictionary learning proc ....Multi-Modal Dictionary Learning for Smart City Operation and Management. This Project aims to provide new digital asset management tools for city councils to improve city services by utilising new sensing and automated learning technologies for recognising, tracking and auditing of assets. Currently, there are no digital tools available to handle these services. This project proposes new multi-modal sensing and mapping of city asset techniques by building new multi-modal dictionary learning procedures. The new framework will recognise different conditions of city assets in real-time to make decisions. Expected outcomes of this Project include integration and easy access of assets with unique digital identities to help city councils, governments, and navigation services for real-time asset monitoring.Read moreRead less