Automatic video annotation by learning from web data. This project aims to study next-generation video annotation technologies to automatically tag raw videos using a huge set of semantic concepts. The project will study new domain adaptation schemes and frameworks in order to substantially improve video annotation performance. The resulting prototype system can be directly used by ordinary users worldwide to search their personal videos using textual queries. The system is also applicable to vi ....Automatic video annotation by learning from web data. This project aims to study next-generation video annotation technologies to automatically tag raw videos using a huge set of semantic concepts. The project will study new domain adaptation schemes and frameworks in order to substantially improve video annotation performance. The resulting prototype system can be directly used by ordinary users worldwide to search their personal videos using textual queries. The system is also applicable to video surveillance applications, which can enhance Australia’s homeland security.Read moreRead less
Australian Laureate Fellowships - Grant ID: FL170100117
Funder
Australian Research Council
Funding Amount
$3,208,192.00
Summary
On snapping up semantics of dynamic pixels from moving cameras. The project aims to develop a suite of original models and algorithms for processing and understanding videos captured by moving cameras, and to establish the mathematical foundations for deep learning-based computer vision to provide theoretical underpinnings. The project expects to generate new knowledge that will transform moving-camera computer vision with step-changes in visual quality enhancement, compression and acceleration ....On snapping up semantics of dynamic pixels from moving cameras. The project aims to develop a suite of original models and algorithms for processing and understanding videos captured by moving cameras, and to establish the mathematical foundations for deep learning-based computer vision to provide theoretical underpinnings. The project expects to generate new knowledge that will transform moving-camera computer vision with step-changes in visual quality enhancement, compression and acceleration technologies, and solutions for fundamental computer vision tasks. A new concept of feature complexity for measuring the discriminant and learnable abilities of features from deep models will also be defined. The outcomes of the project will be critical for enabling autonomous machines to perceive and interact with the environment.Read moreRead less
Assistive micro-navigation for vision impaired people. This project aims to develop novel algorithms to transform a simple camera into a smart sensor, that can enable a vision-impaired person to navigate freely and without additional aids in a crowded area. Such a smart sensor will be endowed with the capability to detect and locate obstacles, identify the walking path, recognise objects and traffic signs and convey step-by-step instructions to the user. The project outcomes are expected to impr ....Assistive micro-navigation for vision impaired people. This project aims to develop novel algorithms to transform a simple camera into a smart sensor, that can enable a vision-impaired person to navigate freely and without additional aids in a crowded area. Such a smart sensor will be endowed with the capability to detect and locate obstacles, identify the walking path, recognise objects and traffic signs and convey step-by-step instructions to the user. The project outcomes are expected to improve the well-being and accessibility to public areas for vision-impaired people and reduce physical access disparities for this disadvantaged and vulnerable group. Furthermore, technologies developed in this project can potentially be adapted for use in related special navigation applications such as road safety, self-driving vehicles, and autonomous robots.Read moreRead less
Towards in-vehicle situation awareness using visual and audio sensors. This project aims to characterise driver awareness, activity and interactions with other vehicle occupants using visual and audio cues from internally mounted sensors. Road accidents cost Australia an estimated $30 billion per year and tragic loss of thousands of lives, yet the vast majority of severe vehicle crashes are linked to driver fatigue or distraction. The expected project outcomes include advanced artificial intelli ....Towards in-vehicle situation awareness using visual and audio sensors. This project aims to characterise driver awareness, activity and interactions with other vehicle occupants using visual and audio cues from internally mounted sensors. Road accidents cost Australia an estimated $30 billion per year and tragic loss of thousands of lives, yet the vast majority of severe vehicle crashes are linked to driver fatigue or distraction. The expected project outcomes include advanced artificial intelligence to infer and predict dangerous driver and passenger behaviour. This has the potential to significantly benefit society by advancing autonomous driving capabilities and reducing driver-induced accidents and fatalities, ensuring that every driver, passenger and pedestrian arrives home safely at the end of each day.Read moreRead less
Deep visual understanding: learning to see in an unruly world. Deep Learning has achieved incredible success at an astonishing variety of Computer Vision tasks recently. This project aims to convey this success into the challenging domain of high-level image-based reasoning. It will extend deep learning to achieve flexible semantic reasoning about the content of images based on information gleaned from the huge volumes of data available on the Internet. The project expects to overcome one of the ....Deep visual understanding: learning to see in an unruly world. Deep Learning has achieved incredible success at an astonishing variety of Computer Vision tasks recently. This project aims to convey this success into the challenging domain of high-level image-based reasoning. It will extend deep learning to achieve flexible semantic reasoning about the content of images based on information gleaned from the huge volumes of data available on the Internet. The project expects to overcome one of the primary limitations of deep learning and will greatly increase its practical application to a range of industrial, cultural or health settings.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE190100539
Funder
Australian Research Council
Funding Amount
$408,000.00
Summary
Towards conversational vision-based Artificial Intelligence. This project aims to develop a novel learning framework, Vision-Ask-Answer-Act (V3A). This framework will allow a machine to perform a sequence of actions via a conversation with human users, based on intricate processing of not just visual input, but human-computer verbal exchanges. Artificial intelligence has great potential as a tool for economic productivity and daily tasks. Applications in cars and assistant robots, still in their ....Towards conversational vision-based Artificial Intelligence. This project aims to develop a novel learning framework, Vision-Ask-Answer-Act (V3A). This framework will allow a machine to perform a sequence of actions via a conversation with human users, based on intricate processing of not just visual input, but human-computer verbal exchanges. Artificial intelligence has great potential as a tool for economic productivity and daily tasks. Applications in cars and assistant robots, still in their early days, typically require significant expertise to use effectively. The outcomes of this project will push the boundary of vision-language research to produce a conversational intelligent agent that can be easily used in common situations across industry, transport, the medical sector, and at home.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE210101624
Funder
Australian Research Council
Funding Amount
$410,775.00
Summary
Causal Discovery from Unstructured Data. This Project aims to enable machines to discover causal relations from various kinds of unstructured data, such as images, text files, and sensor data. The project expects to promote causal revolution of data-centric intelligence and science – construct machines that can communicate in the language of cause and effect and answer ‘why’ questions by inferring from unstructured data. Expected outcomes of this project include theoretical foundations for causa ....Causal Discovery from Unstructured Data. This Project aims to enable machines to discover causal relations from various kinds of unstructured data, such as images, text files, and sensor data. The project expects to promote causal revolution of data-centric intelligence and science – construct machines that can communicate in the language of cause and effect and answer ‘why’ questions by inferring from unstructured data. Expected outcomes of this project include theoretical foundations for causal discovery from unstructured data and practical algorithms that drive intelligent machines to make rational decisions in real-world scenarios. This should benefit society and the economy nationally and internationally through the applications of artificial intelligence and data science. Read moreRead less
Transfer Learning Handling Causally Bilateral Shift . Transfer learning is a core step for machines to transfer knowledge. This Project aims to equip machines with the ability to harness complex causal structures for transfer learning. The Project expects to produce the next great step for artificial intelligence – the potential to explore and exploit complex causal information to better understand, reason, and trust transfer learning. Expected outcomes of this Project include theoretical founda ....Transfer Learning Handling Causally Bilateral Shift . Transfer learning is a core step for machines to transfer knowledge. This Project aims to equip machines with the ability to harness complex causal structures for transfer learning. The Project expects to produce the next great step for artificial intelligence – the potential to explore and exploit complex causal information to better understand, reason, and trust transfer learning. Expected outcomes of this Project include theoretical foundations for transfer learning utilising causality and the next generation of intelligent systems to accommodate data with complex causal structures. This should benefit science, society, and the economy nationally and internationally through the applications to analysing their corresponding complex data.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE200101283
Funder
Australian Research Council
Funding Amount
$400,998.00
Summary
Data synthesis to quantitatively understand and improve vision systems. This project aims to build high-fidelity synthetic data, to understand how a machine vision system reacts to environmental factors and consequently improve the ability of the system to generalise in the real world. This project expects to generate new knowledge in the area of computer vision using innovative techniques of data synthesis, analysis, and domain adaptation. The expected outcomes include new scientific discoverie ....Data synthesis to quantitatively understand and improve vision systems. This project aims to build high-fidelity synthetic data, to understand how a machine vision system reacts to environmental factors and consequently improve the ability of the system to generalise in the real world. This project expects to generate new knowledge in the area of computer vision using innovative techniques of data synthesis, analysis, and domain adaptation. The expected outcomes include new scientific discoveries and domain adaptation algorithms derived from synthetic data for real-world applications. The benefits are expected to be widespread across sectors such as transportation, security, and manufacturing, including safer robotic navigation, defect detection, and smart video surveillance to improve community safety.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE220101390
Funder
Australian Research Council
Funding Amount
$402,900.00
Summary
Towards Human-like Machine Perception for Embodied AI. This project aims to investigate human-like visual perception, whereby AI machines can see and interpret the world like a human. The expected outputs will empower AI machines with the abilities of human-centered visual recognition and annotation-efficient learning through a set of deep learning techniques, and the ability to actively gather visual information through a reinforcement learning methodology (for decision support). This research ....Towards Human-like Machine Perception for Embodied AI. This project aims to investigate human-like visual perception, whereby AI machines can see and interpret the world like a human. The expected outputs will empower AI machines with the abilities of human-centered visual recognition and annotation-efficient learning through a set of deep learning techniques, and the ability to actively gather visual information through a reinforcement learning methodology (for decision support). This research is fundamental to the creation of embodied AI machines, which are expected to provide assistance to humans in industry, education and health. It thus will indicate immediate applications embracing autonomous vehicles and domestic robotics, providing scientific, social and economic benefits for Australia.Read moreRead less