How, What and Who in Human Communication: Movement of Face and Voice. The aim of this project is to identify the essential characteristics of tone, affect, and identity from face and voice using a combination of signal processing, biological, and behavioural techniques in order to develop a comprehensive model of auditory-visual speech processing and communication. This research will significantly improve understanding of the basis of auditory-visual perception and production in tonal languages ....How, What and Who in Human Communication: Movement of Face and Voice. The aim of this project is to identify the essential characteristics of tone, affect, and identity from face and voice using a combination of signal processing, biological, and behavioural techniques in order to develop a comprehensive model of auditory-visual speech processing and communication. This research will significantly improve understanding of the basis of auditory-visual perception and production in tonal languages and in affective communication, facilitate links between neurophysiological processes and auditory-visual speech processing; and contribute to applications in automatic person recognition, automatic speech recognition, text-to-speech systems, and talking head aids for the hearing impaired.Read moreRead less
Special Research Initiatives - Grant ID: SR0354596
Funder
Australian Research Council
Funding Amount
$20,000.00
Summary
Perception and Action in Auditory Scenes (PAAS): Neural, Behavioural, Computational and Mechanical Systems. Auditory scenes are temporal and ephemeral yet pervasively influence human life. How humans negotiate such scenes has not been solved, a fact highlighted by attempts to build machines to respond to speech, warnings etc., in real-world situations with room reverberation, different talkers, and background noise. No one discipline can solve such problems. In this network outstanding researche ....Perception and Action in Auditory Scenes (PAAS): Neural, Behavioural, Computational and Mechanical Systems. Auditory scenes are temporal and ephemeral yet pervasively influence human life. How humans negotiate such scenes has not been solved, a fact highlighted by attempts to build machines to respond to speech, warnings etc., in real-world situations with room reverberation, different talkers, and background noise. No one discipline can solve such problems. In this network outstanding researchers from physical, medical, human, and social sciences with interests in speech, music and audition will provide insights into how humans and machines localize, recognize, interpret and produce auditory events, and advance frontier technologies, e.g., automatic speech recognition, hearing prostheses, auditory monitoring/warning systems.Read moreRead less
Filters reveal what flicker conceals: temporal processing in the human visual system. I have recently discovered a new form of camouflage using 10Hz luminance flicker. This project will quantify this effect and examine the extent to which it generalises across colour and spatial dimensions and to video sequences depicting natural scenes. This information is expected to provide foundational information to technologies relating to national security that rely on visual concealment. This research wi ....Filters reveal what flicker conceals: temporal processing in the human visual system. I have recently discovered a new form of camouflage using 10Hz luminance flicker. This project will quantify this effect and examine the extent to which it generalises across colour and spatial dimensions and to video sequences depicting natural scenes. This information is expected to provide foundational information to technologies relating to national security that rely on visual concealment. This research will examine the extent to which filtering out these camouflaging frequencies enhances our sensitivity to low temporal frequency information. This decamouflaging aspect of my research is expected to improve the clarity of digital video-based technologies including ultrasound, educational, info-tainment and defence applicationsRead moreRead less
Broadcasting 3D Audio: Recording, Transmission, and Playback. With the current state of the art, a performance at the Sydney Opera House cannot be recorded and broadcast such that you can listen to it as if you are in the best seat of the house. The goal of our project is to develop the ultimate form of multi-channel audio broadcasting to create this experience. We will develop and implement effective systems for recording, broadcasting and playback of 3D audio in three different scenarios: indi ....Broadcasting 3D Audio: Recording, Transmission, and Playback. With the current state of the art, a performance at the Sydney Opera House cannot be recorded and broadcast such that you can listen to it as if you are in the best seat of the house. The goal of our project is to develop the ultimate form of multi-channel audio broadcasting to create this experience. We will develop and implement effective systems for recording, broadcasting and playback of 3D audio in three different scenarios: individual headphone reproduction; small loudspeaker array reproduction; and large loudspeaker array reproduction. We will create optimal recording techniques and broadcasting software for each of these playback techniques.Read moreRead less
Temporal segmentation, leadership and cognition in musical improvisation and creativity. Improvisation is core to conversation and to creative and social emergence. This project investigates musical improvisation, in order to reveal constituent processes, using computational and cognitive approaches. Mechanisms for generating transitions in the temporal stream, and for asserting social power or position in it are assessed. Improvised material can be explored, modified, and developed in the creat ....Temporal segmentation, leadership and cognition in musical improvisation and creativity. Improvisation is core to conversation and to creative and social emergence. This project investigates musical improvisation, in order to reveal constituent processes, using computational and cognitive approaches. Mechanisms for generating transitions in the temporal stream, and for asserting social power or position in it are assessed. Improvised material can be explored, modified, and developed in the creative process, and the project investigates how this occurs and whether computers can facilitate the process. Such contributions can be critical to the development of innovation in research and cultural arenas in Australia. Read moreRead less
Computational Intelligence for Complex Structured Data. This project aims to use computational intelligence techniques to reliably learn adaptive natural human pointing and gestures to control an interface on a pseudo-3D display. Highly complex data with interconnections between elements is hard to visualise on screens. Most current tools are operated using point/click/drag on 2D screens. The physical technology to capture appropriate human behaviours exists already, but not the adaptive learnin ....Computational Intelligence for Complex Structured Data. This project aims to use computational intelligence techniques to reliably learn adaptive natural human pointing and gestures to control an interface on a pseudo-3D display. Highly complex data with interconnections between elements is hard to visualise on screens. Most current tools are operated using point/click/drag on 2D screens. The physical technology to capture appropriate human behaviours exists already, but not the adaptive learning of the syntax and semantics of individual gestures and actions, nor the multi-gesture information fusion required for understanding, which could significantly enhance efficiency, for example, in sorting through named entities in an investigation. All of this is done naturally by most human beings, using biological neural networks.Read moreRead less
Scalable Visual Analytics for Uncertain Dynamic Networks. Technological advances have provided a data deluge over the past few years, and have led to many large uncertain and dynamic network models. This includes terrorist networks, marketing networks, facebook networks, various biological networks, and software engineering structures. Human understanding of such networks is difficult. This project aims to provide new methods for visual analysis of large uncertain dynamic networks such as these. ....Scalable Visual Analytics for Uncertain Dynamic Networks. Technological advances have provided a data deluge over the past few years, and have led to many large uncertain and dynamic network models. This includes terrorist networks, marketing networks, facebook networks, various biological networks, and software engineering structures. Human understanding of such networks is difficult. This project aims to provide new methods for visual analysis of large uncertain dynamic networks such as these. The algorithms developed in the project will help security analysts to monitor illegal behaviour such as money laundering and terrorist activities, help biologists understand key biological systems, and help engineers to understand large software systems.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE180101416
Funder
Australian Research Council
Funding Amount
$338,446.00
Summary
Broadening horizons: using curiosity to diversify behaviour. This project aims to explore how interactive systems can encourage their users to try new things. This is made possible by recent developments in artificial intelligence that can estimate what will make users curious. This project expects to generate new knowledge about how interactive technology can encourage diverse behaviour by stimulating curiosity. Expected outcomes include a framework for how to design interactive systems that en ....Broadening horizons: using curiosity to diversify behaviour. This project aims to explore how interactive systems can encourage their users to try new things. This is made possible by recent developments in artificial intelligence that can estimate what will make users curious. This project expects to generate new knowledge about how interactive technology can encourage diverse behaviour by stimulating curiosity. Expected outcomes include a framework for how to design interactive systems that encourage users to try new things, and a greater theoretical understanding of how to diversify user behaviour.Read moreRead less
Using 3D printing to improve access to graphics by vision-impaired people. This project aims to investigate the possible benefits of 3D printing for production of accessible materials for vision-impaired people. Currently tactile graphics are used to provide severely vision-impaired adults and children with access to graphical content used in education and in orientation and mobility training. This project is expected to clarify the kinds of graphics for which 3D prints are better suited than ta ....Using 3D printing to improve access to graphics by vision-impaired people. This project aims to investigate the possible benefits of 3D printing for production of accessible materials for vision-impaired people. Currently tactile graphics are used to provide severely vision-impaired adults and children with access to graphical content used in education and in orientation and mobility training. This project is expected to clarify the kinds of graphics for which 3D prints are better suited than tactile graphics, and to build capacity within the national accessible format provision sector for the production and use of 3D prints. Benefits will include increased educational opportunities and quality of life for Australians with severe vision impairment, through improved access to graphic materials used in education and orientation and mobility training.
Read moreRead less