Taming media for the masses: Computational frameworks for intelligent digital media capture, management, and sharing. The core issues tackled in this project are learning, recognition and application of semantics in multimedia data and the context of its creation and use - a foundational issue in pattern recognition with many applications. The project is part of the Institute for Multi-sensor Processing and Content Analysis whose aim is to tackle technical issues in large scale pattern recogniti ....Taming media for the masses: Computational frameworks for intelligent digital media capture, management, and sharing. The core issues tackled in this project are learning, recognition and application of semantics in multimedia data and the context of its creation and use - a foundational issue in pattern recognition with many applications. The project is part of the Institute for Multi-sensor Processing and Content Analysis whose aim is to tackle technical issues in large scale pattern recognition. By developing scalable and robust techniques to extract information from large scale multi-modal data, the applications include large scale surveillance systems from multi-modal data (e.g. airport security, smart homes for the aged), context-aware devices, and the next generation of media creation and repurposing tools - a fast-growing sector of the economy.Read moreRead less
Image Based Visual Servo Control of Dynamic Under-Actuated Systems. The project builds on earlier work on visual servo control of under-actuated rigid body dynamics to develop and implement sophisticated and robust image based visual servo control for a wide class of under-actuated and fully actuated dynamic systems. The scope of the project extends far beyond basic testing of preliminary results to address key technical issues facing visual servo control algorithms at this time. The project i ....Image Based Visual Servo Control of Dynamic Under-Actuated Systems. The project builds on earlier work on visual servo control of under-actuated rigid body dynamics to develop and implement sophisticated and robust image based visual servo control for a wide class of under-actuated and fully actuated dynamic systems. The scope of the project extends far beyond basic testing of preliminary results to address key technical issues facing visual servo control algorithms at this time. The project is strongly motivated by the host of emerging applications for visual servo control of unmanned aerial vehicles. The experimental program within the project is based on control of a four rotor VTOL `hoverbot'.Read moreRead less
Solve it or Ignore it? The Challenge of Alignment Distortion and Creating Next Generation Automatic Facial Expression Detection. The last two decades have seen an escalating interest in automating the coding of facial expressions. Despite this keen interest, the promise of computer vision systems to accurately code facial expressions in natural circumstances remains elusive. Our interdisciplinary team will research a new paradigm to account for facial alignment distortion directly rather than ai ....Solve it or Ignore it? The Challenge of Alignment Distortion and Creating Next Generation Automatic Facial Expression Detection. The last two decades have seen an escalating interest in automating the coding of facial expressions. Despite this keen interest, the promise of computer vision systems to accurately code facial expressions in natural circumstances remains elusive. Our interdisciplinary team will research a new paradigm to account for facial alignment distortion directly rather than aiming to achieve invariance to it. The project will also research new data agnostic feature compaction capabilities to enable scalable learning on the world’s largest and challenging expression dataset available to us through international collaboration. Tackling these two major open problems will make accurate coding of facial expressions in natural environments achievable.Read moreRead less
Autocalibration without decimation. The insertion of computer generated characters into real footage, the removal of objects from video, and the recovery of 3-dimensional architectural or topographic models from photographs are amongst a growing number of processes used in industry which require highly accurate camera calibration. Autocalibration is thus a prerequisite for these and many other emerging image-based technologies. By developing expertise in this area, and particularly by enabling ....Autocalibration without decimation. The insertion of computer generated characters into real footage, the removal of objects from video, and the recovery of 3-dimensional architectural or topographic models from photographs are amongst a growing number of processes used in industry which require highly accurate camera calibration. Autocalibration is thus a prerequisite for these and many other emerging image-based technologies. By developing expertise in this area, and particularly by enabling more flexible and efficient means of autocalibration, we expect to provide Australian industry with a valuable improvement in the state of the art and a competitive edge in a number of important application areas.Read moreRead less
Human Cues for Robot Navigation. The world has many navigational cues for the benefit of humans: sign posts, maps and the wealth of information on the internet. Yet, to date, robotic navigation has made little use of this abundant symbolic information as a resource. This project will develop a robot navigation system that can navigate using information beyond the robot's range sensors by incorporating knowledge gained by reading room labels, following human route directions or interpreting maps ....Human Cues for Robot Navigation. The world has many navigational cues for the benefit of humans: sign posts, maps and the wealth of information on the internet. Yet, to date, robotic navigation has made little use of this abundant symbolic information as a resource. This project will develop a robot navigation system that can navigate using information beyond the robot's range sensors by incorporating knowledge gained by reading room labels, following human route directions or interpreting maps found on the web. This project will demonstrate the robot's navigation ability by comparing its performance with a human as it learns to find its way around campus by asking for directions, reading signs and maps, and searching the internet for clues.Read moreRead less
Omniscient face recognition for uncooperative subjects. The outcomes of this project will enable effective video surveillance technology to be developed for use by law enforcement and national security agencies. It will lead to reliable identification of humans at a distance by automatically detecting and recognising faces, for use in counter-terrorism surveillance and commercial robot-human interfaces.
Lifelong robotic navigation using visual perception. Service robots are becoming a major part of our working and personal environments, in much the same way as personal computers already have. This project will develop new methods of practical and useful robot navigation that will enable Australia's industries and services to remain internationally competitive.
Active multispectral computer vision for defence and security. This project will develop new techniques to extract intelligent information from multispectral images in the visible and near infra-red spectrum. It will enable computers to automatically recognise objects, faces and human actions with unprecedented accuracy.
One shot three-dimensional reconstruction of human anatomy and motion. This project aims to accurately estimate the three-dimensional (3D) structure of non-rigid human anatomy. Although computer vision has advanced the area of structure from motion, current approaches cannot accurately and densely reconstruct people. This project will create dense 3D reconstruction techniques which can manage non-rigid human anatomy using only two-dimensional images from medical imaging devices (X-rays and video ....One shot three-dimensional reconstruction of human anatomy and motion. This project aims to accurately estimate the three-dimensional (3D) structure of non-rigid human anatomy. Although computer vision has advanced the area of structure from motion, current approaches cannot accurately and densely reconstruct people. This project will create dense 3D reconstruction techniques which can manage non-rigid human anatomy using only two-dimensional images from medical imaging devices (X-rays and video sequences) in one shot – from a single image. This approach is expected to be used for the 3D visualisation of x-rays such as in clinical practice, human pose estimation, and 3D planning for orthopaedic minimally invasive surgery.Read moreRead less
A Novel System for Surveillance of Moving Objects. Surveillance of moving objects is critical in numerous applications such as detection and recognition of motor vehicles. It is important for detection to be fast and accurate with low cost. In this project, we aim to implement a surveillance system consisting of an efficient algorithm on a PC network with a camera. Our detection algorithm will be achieved with an advanced and computationally powerful image representation for fast computation. It ....A Novel System for Surveillance of Moving Objects. Surveillance of moving objects is critical in numerous applications such as detection and recognition of motor vehicles. It is important for detection to be fast and accurate with low cost. In this project, we aim to implement a surveillance system consisting of an efficient algorithm on a PC network with a camera. Our detection algorithm will be achieved with an advanced and computationally powerful image representation for fast computation. Its accuracy will be enhanced by adapting a well recognized theory for fast removal of image noise. Our implementation on the PC network will provide a flexible and extensible platform for parallel computing to further reduce detection time while keeping costs low.Read moreRead less