Added depth: automated high level image interpretation. Humans are very good at understanding the world through imagery, but computers lack this fundamental capacity because they lack experience of what they might see. This project will provide this experience by combining the large volumes of imagery on the Internet with three dimensional information generated by humans for other purposes.
Hybrid optimisation for automatic large-scale video annotation. Optimization is the basis for solving many problems in Computer Vision, such as three-dimensional geometry recovery, image segmentation, scene labeling and object recognition. This project will develop new optimisation techniques and demonstrate their suitability for large-scale video annotation, which is key to visual data mining and scene understanding.
Solve it or Ignore it? The Challenge of Alignment Distortion and Creating Next Generation Automatic Facial Expression Detection. The last two decades have seen an escalating interest in automating the coding of facial expressions. Despite this keen interest, the promise of computer vision systems to accurately code facial expressions in natural circumstances remains elusive. Our interdisciplinary team will research a new paradigm to account for facial alignment distortion directly rather than ai ....Solve it or Ignore it? The Challenge of Alignment Distortion and Creating Next Generation Automatic Facial Expression Detection. The last two decades have seen an escalating interest in automating the coding of facial expressions. Despite this keen interest, the promise of computer vision systems to accurately code facial expressions in natural circumstances remains elusive. Our interdisciplinary team will research a new paradigm to account for facial alignment distortion directly rather than aiming to achieve invariance to it. The project will also research new data agnostic feature compaction capabilities to enable scalable learning on the world’s largest and challenging expression dataset available to us through international collaboration. Tackling these two major open problems will make accurate coding of facial expressions in natural environments achievable.Read moreRead less
Generic Content-based News Picture Retrieval with Local Invariant Features. Image Retrieval searches for images from large databases whose visual content meets the requirements submitted by users. Besides directly benefiting the Partner Organization, this project will enable more efficient access to large picture repositories in news agencies and publishers, digital libraries and film archives. It will make public use of visual information much more convenient and economical. It will help securi ....Generic Content-based News Picture Retrieval with Local Invariant Features. Image Retrieval searches for images from large databases whose visual content meets the requirements submitted by users. Besides directly benefiting the Partner Organization, this project will enable more efficient access to large picture repositories in news agencies and publishers, digital libraries and film archives. It will make public use of visual information much more convenient and economical. It will help security officers to effortlessly and accurately find particular scenes from the images generated by a large closed-circuit TV networks. Also, the developed technology can be applied to tele-education and e-commerce. New algorithms developed in this project will benefit the Australian and world scientific communities.Read moreRead less
Leveraging 3D computer vision for camera-based precise geo-localisation. This project aims to develop advanced 3D computer vision and image processing technology that can turn regular cameras into high-precision location-sensing devices. Spatial Location is a fundamental type of information of our physical world. Determining the precise location of people, vehicle, and mobile devices is essential for many critical applications. Outcomes of the project will enable a wide range of novel applicatio ....Leveraging 3D computer vision for camera-based precise geo-localisation. This project aims to develop advanced 3D computer vision and image processing technology that can turn regular cameras into high-precision location-sensing devices. Spatial Location is a fundamental type of information of our physical world. Determining the precise location of people, vehicle, and mobile devices is essential for many critical applications. Outcomes of the project will enable a wide range of novel applications of significant social, environmental and economic value, such as Location-Aware Service, Environment Monitoring, Augmented Reality, Autonomous Vehicle, and Rapid Emergency Response. The project will enhance Australia's international competitive advantage in forefront of ICT research and technology innovation.Read moreRead less
Structure-without-motion: large-scale 3D reconstruction from distributed and unorganised images. Vision-based 3D reconstruction is a frontier technology for a wide range of applications. This project will lead to novel 3D reconstruction methods and systems that are more efficient, more cost-effective and more accessible to ordinary user. The outcomes will directly contribute to National Research Priority Goal of smart information use.
Semantic Vectorisation: From Bitmaps to Intelligent Representations. The objective of this innovative project is to provide a solution to the open question of representing natural images by semantically rich vector graphics. The challenges are to identify key visual and temporal elements for images and videos, and efficiently decompose the visual data into semantic vector representations that are faithful to original data, compact and editable. The project aims to investigate new bitmap-to-vecto ....Semantic Vectorisation: From Bitmaps to Intelligent Representations. The objective of this innovative project is to provide a solution to the open question of representing natural images by semantically rich vector graphics. The challenges are to identify key visual and temporal elements for images and videos, and efficiently decompose the visual data into semantic vector representations that are faithful to original data, compact and editable. The project aims to investigate new bitmap-to-vector conversion methods. It is expected to develop a framework where semantic labels and hyperlinks can be embedded in visual data automatically. It hopes to pioneer the creation of a web of images where the links are on image/video regions. New image simplification, stylisation, and non-photorealistic rendering methods are expected to be provided.Read moreRead less
Omniscient face recognition for uncooperative subjects. The outcomes of this project will enable effective video surveillance technology to be developed for use by law enforcement and national security agencies. It will lead to reliable identification of humans at a distance by automatically detecting and recognising faces, for use in counter-terrorism surveillance and commercial robot-human interfaces.
Pattern Recognition and Scene Analysis via Machine Learning. We plan to use kernel methods, a novel machine learning technique, for computer vision problems, such as scene analysis and real time object recognition. Such capabilities are relevant for the design of intelligent and adaptive systems, suitable for complex real world environments. Expected outcomes are the design of efficient statistical tools which take the special nature of visual data into account (structure, decomposition, prior ....Pattern Recognition and Scene Analysis via Machine Learning. We plan to use kernel methods, a novel machine learning technique, for computer vision problems, such as scene analysis and real time object recognition. Such capabilities are relevant for the design of intelligent and adaptive systems, suitable for complex real world environments. Expected outcomes are the design of efficient statistical tools which take the special nature of visual data into account (structure, decomposition, prior knowledge of physical environments, etc.) and combine the advantages of feature based high-level vision methods with low-level machine learning techniques.
This proposal is part of a joint IST project with partners from the European Union.Read moreRead less
Computer Vision Optimization Problems Using Machine Learning. Computer Vision concerns itself with understanding the world through the analysis of images obtained by a video or still camera. An important application is tracking of people in video and modelling their movements. This has evident applications in security, sport and entertainment. By enabling the computer to capture the motion of a subject in a video, we may detect suspicious activity in security, analyze the motion (golf-swing, ....Computer Vision Optimization Problems Using Machine Learning. Computer Vision concerns itself with understanding the world through the analysis of images obtained by a video or still camera. An important application is tracking of people in video and modelling their movements. This has evident applications in security, sport and entertainment. By enabling the computer to capture the motion of a subject in a video, we may detect suspicious activity in security, analyze the motion (golf-swing, diving style) of a sports-person, or capture the motion of an actor for animation or game applications. Development of a reliable technology requires new optimization techniques, which will place Australia at the forefront of the application of such research, commercially and for the public benefit.Read moreRead less