Target detection in visual clutter. The interdisciplinary nature of the project will offer a stimulating environment for training a postdoctoral worker in the hot topic of computational neuroscience. While computationally expensive solutions to moving target detection in clutter have been implemented using conventional engineering, this project will offer insight into the efficiency of the biological brain (with benefit of millions of years of evolution towards compact, economical and optimal so ....Target detection in visual clutter. The interdisciplinary nature of the project will offer a stimulating environment for training a postdoctoral worker in the hot topic of computational neuroscience. While computationally expensive solutions to moving target detection in clutter have been implemented using conventional engineering, this project will offer insight into the efficiency of the biological brain (with benefit of millions of years of evolution towards compact, economical and optimal solutions). The results will assist development of efficient artificial intelligence. It will also assist our ongoing collaborations with defence partners to develop and apply algorithms in artificial vision systems. Read moreRead less
Learning to see in 3D. The project aims to endow machine vision with an ability we, as humans, use almost constantly: to judge 3D properties from a 2D image. This extremely useful ability will be applied to digital images to obtain 3D measurements and aid in automating tasks such as mining, surveying, medical diagnosis, and visual effects in movies.
Recognising and reconstructing objects in real time from a moving camera. This project will use a moving camera to estimate the three-dimensional shape and identity of objects and surfaces it can see. This ability, which we humans use all the time, has wide application in automation including driver assistance, exploring hazardous environments, robotics, remote collaboration, and the creation of three-dimensional models for entertainment.
Optimal Robust Fitting under the Framework of LP-Type Problems. The project aims to develop algorithms to support the development of robust and accurate computer vision systems. Real-world visual data (images, videos) is inherently noisy and outlier prone. To build computer vision systems that work reliably in the real world, it is necessary to ensure that the underlying algorithms are robust and efficient. The project aims to devise novel algorithms that can compute the best possible result giv ....Optimal Robust Fitting under the Framework of LP-Type Problems. The project aims to develop algorithms to support the development of robust and accurate computer vision systems. Real-world visual data (images, videos) is inherently noisy and outlier prone. To build computer vision systems that work reliably in the real world, it is necessary to ensure that the underlying algorithms are robust and efficient. The project aims to devise novel algorithms that can compute the best possible result given the input data in a short amount of time. The expected outcomes would support the construction of reliable and accurate computer vision-based systems, such as large-scale 3-D reconstruction from photo collections, self-driving cars and domestic robots.Read moreRead less
Australian Laureate Fellowships - Grant ID: FL130100102
Funder
Australian Research Council
Funding Amount
$3,179,946.00
Summary
Lifelong computer vision systems. This project will create a computer vision system that can produce a detailed environmental map in real time, turning standard video cameras into sensors that 'understand' a scene with basic semantic tools. This high-level sensing will unlock a wide range of applications for autonomous systems.
Whole image understanding by convolutions on graphs. This project seeks to develop technologies that will help computer vision interpret the whole visible scene, rather than just some of the objects therein. Existing automated methods for understanding images perform well at recognising specific objects in canonical poses, but the problem of whole image interpretation is far more challenging. Convolutional neural networks (CNN) have underpinned recent progress in object recognition, but whole-im ....Whole image understanding by convolutions on graphs. This project seeks to develop technologies that will help computer vision interpret the whole visible scene, rather than just some of the objects therein. Existing automated methods for understanding images perform well at recognising specific objects in canonical poses, but the problem of whole image interpretation is far more challenging. Convolutional neural networks (CNN) have underpinned recent progress in object recognition, but whole-image understanding cannot be tackled similarly because the number of possible combinations of objects is too large. The project thus proposes a graph-based generalisation of the CNN approach which allows scene structure to be learned explicitly. This would represent an important step towards providing computers with robust vision, allowing them to interact with their environment.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE170101259
Funder
Australian Research Council
Funding Amount
$360,000.00
Summary
Zero-shot and few-shot learning with deep knowledge transfer. This project aims to develop few-shot and zero-shot learning, visual recognition techniques that can learn a visual concept with few or no visual examples. Visual recognition is a major component in Artificial Intelligence and used in cybernetic security, robotic vision and medical image analysis. This project will use deep learning to enable the zero/few-shot learning to use and model previously unexplored information, making zero/fe ....Zero-shot and few-shot learning with deep knowledge transfer. This project aims to develop few-shot and zero-shot learning, visual recognition techniques that can learn a visual concept with few or no visual examples. Visual recognition is a major component in Artificial Intelligence and used in cybernetic security, robotic vision and medical image analysis. This project will use deep learning to enable the zero/few-shot learning to use and model previously unexplored information, making zero/few-shot learning more practical, scalable and flexible. The project is expected to advance the applicability of visual recognition in many challenging scenarios and provide effective tools to analyse the online visual data for supporting Australia’s cybernetic security.Read moreRead less
Autocalibration without decimation. The insertion of computer generated characters into real footage, the removal of objects from video, and the recovery of 3-dimensional architectural or topographic models from photographs are amongst a growing number of processes used in industry which require highly accurate camera calibration. Autocalibration is thus a prerequisite for these and many other emerging image-based technologies. By developing expertise in this area, and particularly by enabling ....Autocalibration without decimation. The insertion of computer generated characters into real footage, the removal of objects from video, and the recovery of 3-dimensional architectural or topographic models from photographs are amongst a growing number of processes used in industry which require highly accurate camera calibration. Autocalibration is thus a prerequisite for these and many other emerging image-based technologies. By developing expertise in this area, and particularly by enabling more flexible and efficient means of autocalibration, we expect to provide Australian industry with a valuable improvement in the state of the art and a competitive edge in a number of important application areas.Read moreRead less
Combined shape and appearance descriptors for visual object recognition. The quantity of video generated each year is expanding rapidly. This increasing volume of visual information means that it is more likely that any particular event will be recorded, but that the footage will be harder to find. This applies to a collection of home videos as much as to television and movie footage. The object-recognition method to be developed has the potential to alleviate this situation, in which vast amou ....Combined shape and appearance descriptors for visual object recognition. The quantity of video generated each year is expanding rapidly. This increasing volume of visual information means that it is more likely that any particular event will be recorded, but that the footage will be harder to find. This applies to a collection of home videos as much as to television and movie footage. The object-recognition method to be developed has the potential to alleviate this situation, in which vast amounts of video data are available but have little value. Such an outcome would be a boon for Australian industry and offer a valuable export opportunity.Read moreRead less