ARDC Research Link Australia

DOI: 10.1109/FPL.2011.62

Publication

The First 25 Years of the FPL Conference

Publisher: Association for Computing Machinery (ACM)

Publisher: Elsevier BV

Date: 04-2016

DOI: 10.1016/J.COMPMEDIMAG.2016.01.001

Abstract: The automatic annotation of medical images is a prerequisite for building comprehensive semantic archives that can be used to enhance evidence-based diagnosis, physician education, and biomedical research. Annotation also has important applications in the automatic generation of structured radiology reports. Much of the prior research work has focused on annotating images with properties such as the modality of the image, or the biological system or body region being imaged. However, many challenges remain for the annotation of high-level semantic content in medical images (e.g., presence of calcification, vessel obstruction, etc.) due to the difficulty in discovering relationships and associations between low-level image features and high-level semantic concepts. This difficulty is further compounded by the lack of labelled training data. In this paper, we present a method for the automatic semantic annotation of medical images that leverages techniques from content-based image retrieval (CBIR). CBIR is a well-established image search technology that uses quantifiable low-level image features to represent the high-level semantic content depicted in those images. Our method extends CBIR techniques to identify or retrieve a collection of labelled images that have similar low-level features and then uses this collection to determine the best high-level semantic annotations. We demonstrate our annotation method using retrieval via weighted nearest-neighbour retrieval and multi-class classification to show that our approach is viable regardless of the underlying retrieval strategy. We experimentally compared our method with several well-established baseline techniques (classification and regression) and showed that our method achieved the highest accuracy in the annotation of liver computed tomography (CT) images.

Publication

A Smith-Waterman systolic cell

Publisher: Springer Berlin Heidelberg

Date: 2003

DOI: 10.1007/978-3-540-45234-8_37

Publication

Ultra-Low-Power Alcohol Vapor Sensors Based on Multi-Walled Carbon Nanotube

Publisher: IEEE

Date: 2006

DOI: 10.1109/NEMS.2006.334679

Publication

A Low Power VLSI Arrhythmia Classifier

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Publisher: Wiley

fSEAD: A Composable FPGA-based Streaming Ensemble Anomaly Detection Library

Publisher: Association for Computing Machinery (ACM)

Publisher: SAGE Publications

Date: 2012

DOI: 10.1255/JNIRS.975

Abstract: Optical coherence tomography (OCT) is a technique that is able to provide cross section views of tissue layers. This fast and non-invasive method is widely used in clinical applications for the diagnosis and treatment of certain diseases. Although conventional OCT is derived from the theory of interferometric imaging, emerging developments, including spectroscopic OCT and related techniques such as dual-band OCT and Raman spectroscopy–OCT, have resulted in significantly improved clinical capabilities for observing the tissue layers through enhanced tissue definition, image resolution, image contrast and scanning speed. This paper reviews the state-of-the-art developments of OCT. It starts with a general introduction of conventional interferometric OCT imaging methods including the time-domain and frequency-domain techniques. The second section explores the advances introduced from spectroscopy techniques in OCT, especially with spectroscopic OCT, dual-band OCT and Raman spectroscopy combined OCT. The final section discusses the current challenges in the application of approaches based on computer-aided diagnosis (CAD) for retinal imaging, for ex le automated segmentation of tissue layers and tracking disease progression. This task is currently limited by the quality of the recorded data from OCT systems but will be improved by adopting spectroscopic techniques. Finally, we analyse and discuss the improvements that are expected in retinal CAD from the adoption of newly emerging near infrared spectroscopy OCT at multiple wavelengths.

Publication

A Customizable Matrix Multiplication Framework for the Intel HARPv2 Xeon+FPGA Platform

Publisher: ACM

Date: 15-02-2018

DOI: 10.1145/3174243.3174258

Publication

An analogue neural network using MCM technology

Publisher: IEEE Comput. Soc. Press

Date: 1993

DOI: 10.1109/ANNES.1993.323066

Publication

A performance adequate computational model for auditory localization

Publisher: Acoustical Society of America (ASA)

Date: 2000

DOI: 10.1121/1.428350

Abstract: A computational model of auditory localization resulting in performance similar to humans is reported. The model incorporates both the monaural and binaural cues available to a human for sound localization. Essential elements used in the simulation of the processes of auditory cue generation and encoding by the nervous system include measured head-related transfer functions (HRTFs), minimum audible field (MAF), and the Patterson–Holdsworth cochlear model. A two-layer feed-forward back-propagation artificial neural network (ANN) was trained to transform the localization cues to a two-dimensional map that gives the direction of the sound source. The model results were compared with (i) the localization performance of the human listener who provided the HRTFs for the model and (ii) the localization performance of a group of 19 other human listeners. The localization accuracy and front–back confusion error rates exhibited by the model were similar to both the single listener and the group results. This suggests that the simulation of the cue generation and extraction processes as well as the model parameters were reasonable approximations to the overall biological processes. The litude resolution of the monaural spectral cues was varied and the influence on the model’s performance was determined. The model with 128 cochlear channels required an litude resolution of approximately 20 discrete levels for encoding the spectral cue to deliver similar localization performance to the group of human listeners.

Publication

A bitstream reconfigurable FPGA implementation of the WSAT algorithm

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Date: 02-2001

DOI: 10.1109/92.920833

Publication

FPGA implementation of a microcoded elliptic curve cryptographic processor

Publisher: IEEE Comput. Soc

Date: 2000

DOI: 10.1109/FPGA.2000.903394

Publication

Active temporal multiplexing of indistinguishable heralded single photons

Publisher: Springer Science and Business Media LLC

Publisher: Association for Computing Machinery (ACM)

Date: 16-04-2015

DOI: 10.1145/2665073

Abstract: Runtime analysis provides an effective method for measuring the sensitivity of programs to rounding errors. To date, implementations have required significant changes to source code, detracting from their widespread application. In this work, we present an open source system that automates the quantitative analysis of floating point rounding errors through the use of C-based source-to-source compilation and a Monte Carlo arithmetic library. We demonstrate its application to the comparison of algorithms, detection of catastrophic cancellation, and determination of whether single precision floating point provides sufficient accuracy for a given application. Methods for obtaining quantifiable measurements of sensitivity to rounding error are also detailed.

Publication

CPE: A parallel library for financial engineering applications

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Date: 10-2005

DOI: 10.1109/MC.2005.337

Publication

An arithmetic library and its application to the N-body problem

Publisher: IEEE

Date: 2004

DOI: 10.1109/FCCM.2004.14

Publication

FPGA implementations of kernel normalised least mean squares processors

Publisher: Association for Computing Machinery (ACM)

Date: 15-12-2017

DOI: 10.1145/3106744

Abstract: Kernel adaptive filters (KAFs) are online machine learning algorithms which are amenable to highly efficient streaming implementations. They require only a single pass through the data and can act as universal approximators, i.e. approximate any continuous function with arbitrary accuracy. KAFs are members of a family of kernel methods which apply an implicit non-linear mapping of input data to a high dimensional feature space, permitting learning algorithms to be expressed entirely as inner products. Such an approach avoids explicit projection into the feature space, enabling computational efficiency. In this paper, we propose the first fully pipelined implementation of the kernel normalised least mean squares algorithm for regression. Independent training tasks necessary for hyperparameter optimisation fill pipeline stages, so no stall cycles to resolve dependencies are required. Together with other optimisations to reduce resource utilisation and latency, our core achieves 161 GFLOPS on a Virtex 7 XC7VX485T FPGA for a floating point implementation and 211 GOPS for fixed point. Our PCI Express based floating-point system implementation achieves 80% of the core’s speed, this being a speedup of 10× over an optimised implementation on a desktop processor and 2.66× over a GPU.

Publication

A comparison of programmable gate array logic cell circuits

Publisher: ACM

Date: 22-02-2009

DOI: 10.1145/1508128.1508137

Publication

A microcoded Kernel recursive least squares processor using FPGA technology

Publisher: Association for Computing Machinery (ACM)

Publisher: Elsevier BV

Date: 04-1998

DOI: 10.1016/S0165-0270(97)00201-X

Abstract: A systematic analysis of the localization of objects in extra-personal space requires a three-dimensional method of documenting location. In auditory localization studies the location of a sound source is often reduced to a directional vector with constant magnitude with respect to the observer, data being plotted on a unit sphere with the observer at the origin. This is an attractive form of data representation as the relevant spherical statistical and graphical methods are well described. In this paper we collect together a set of spherical plotting and statistical procedures to visualize and summarize these data. We describe methods for visualizing auditory localization data without assuming that the principal components of the data are aligned with the coordinate system. As a means of comparing experimental techniques and having a common set of data for the verification of spherical statistics, the software (implemented in MATLAB) and database described in this paper have been placed in the public domain. Although originally intended for the visualization and summarization of auditory psychophysical data, these routines are sufficiently general to be applied in other situations involving spherical data.

Publication

Mixed precision comparison in reconfigurable systems

Publisher: IEEE

Date: 05-2011

DOI: 10.1109/FCCM.2011.57

Publication

Compact and reconfigurable silicon nitride time-bin entanglement circuit

Publisher: Optica Publishing Group

Date: 06-08-2015

DOI: 10.1364/OPTICA.2.000724

Publication

A hardware Gaussian noise generator using the Wallace method

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Date: 08-2005

DOI: 10.1109/TVLSI.2005.853615

Publication

Kakadu--a low power analogue neural network classifier.

Publisher: World Scientific Pub Co Pte Lt

Date: 12-1993

DOI: 10.1142/S0129065793000316

Abstract: An analogue neural network VLSI chip designed for low power operation is presented. This chip consists of 84 synapse elements arranged as arrays of size 10 × 6 and 6 × 4 and was fabricated using a standard 1.2 μm double metal single poly CMOS process. The synapses are digitally programmable and static weight storage is provided. The chip has a typical power consumption of tens of microwatts. It has been successfully trained and tested on a range of classification problems including 4-bit parity, character recognition and morphological-based classification of intracardiac electrogram signals.

Publication

Modulation recognition using an FPGA-based convolutional neural network

Publisher: IEEE

Date: 08-2019

DOI: 10.1109/ICEMS.2019.8922403

Publication

A variable-radix digit-serial design methodology and its application to the discrete cosine transform

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Date: 02-2003

DOI: 10.1109/TVLSI.2003.811099

Publication

Unrolling ternary neural networks

Publisher: Association for Computing Machinery (ACM)

Date: 18-10-2019

DOI: 10.1145/3359983

Abstract: The computational complexity of neural networks for large-scale or real-time applications necessitates hardware acceleration. Most approaches assume that the network architecture and parameters are unknown at design time, permitting usage in a large number of applications. This article demonstrates, for the case where the neural network architecture and ternary weight values are known a priori , that extremely high throughput implementations of neural network inference can be made by customising the datapath and routing to remove unnecessary computations and data movement. This approach is ideally suited to FPGA implementations as a specialized implementation of a trained network improves efficiency while still retaining generality with the reconfigurability of an FPGA. A VGG-style network with ternary weights and fixed point activations is implemented for the CIFAR10 dataset on Amazon’s AWS F1 instance. This article demonstrates how to remove 90% of the operations in convolutional layers by exploiting sparsity and compile-time optimizations. The implementation in hardware achieves 90.9 ± 0.1% accuracy and 122k frames per second, with a latency of only 29µs, which is the fastest CNN inference implementation reported so far on an FPGA.

Publication

An architecture for solving boolean satisfiability using runtime configurable hardware

Publisher: IEEE

Date: 1999

DOI: 10.1109/ICPPW.1999.800085

Publication

Reconfigurable acceleration of neural models with gap junctions

Publisher: IEEE

Date: 12-2009

DOI: 10.1109/FPT.2009.5377639

Publication

Rolling window time series prediction using MapReduce

Publisher: IEEE

Date: 08-2014

DOI: 10.1109/IRI.2014.7051965

Publication

Automated quality control of forced oscillation measurements: respiratory artifact detection with advanced feature extraction

Publisher: American Physiological Society

Publisher: Association for Computing Machinery (ACM)

Philip Leong

Researcher

Research Topics

Top 5 Research Topics

ANZSRC Field of Research (FoR)

ANZSRC Socio-Economic Objective (SEO)

Related Links

Publications

A Two-Speed, Radix-4, Serial-Parallel Multiplier

Wireless Signal Representation Techniques for Automatic Modulation Classification

An FPGA chip identification generator using configurable ring oscillator

Recent trends in FPGA architectures and applications

A scalable dataflow accelerator for real time onboard hyperspectral image classification

Multiwavelength stabilization control of a thermooptic system with adaptive reconfiguration

Routing optimization for hybrid FPGAs

Dynamic voltage scaling for commercial FPGAs

Ultralow-Power Alcohol Vapor Sensors Using Chemically Functionalized Multiwalled Carbon Nanotubes

Wearable healthcare systems: A single channel accelerometer based anomaly detector for studies of gait freezing in Parkinson's disease

Map-reduce as a programming model for custom computing machines

Vector processor for online lithium-ion battery capacity prediction

A hardware Gaussian noise generator using the box-muller method and its error analysis

A model for matrix multiplication performance on FPGAs

The First 25 Years of the FPL Conference

Virtual Embedded Blocks: A Methodology for Evaluating Embedded Elements in FPGAs

Long Short-Term Memory for Radio Frequency Spectral Prediction and its Real-Time FPGA Implementation

A Novel Real-Time Error Compensation Methodology for &#x003BC;IMU-based Digital Writing Instrument

Chemically Functionalized Multi-Walled Carbon Nanotube Sensors for Ultra-Low-Power Alcohol Vapor Detection

Structured ASIC: Methodology and comparison

Design and Modeling of a CNT-CMOS Low-Power Sensor Chip

Integrated silicon nitride time-bin entanglement circuits

Scaling Binarized Neural Networks on Reconfigurable Logic

Rapid prototyping on a structured ASIC fabric

NITI: Training Integer Neural Networks Using Integer-Only Arithmetic

A comment on the implementation of the Ziggurat method

Adapting content-based image retrieval techniques for the semantic annotation of medical images

A Smith-Waterman systolic cell

Ultra-Low-Power Alcohol Vapor Sensors Based on Multi-Walled Carbon Nanotube

A Low Power VLSI Arrhythmia Classifier

Reconfigurable acceleration for Monte Carlo based financial simulation

Real-time Automatic Modulation Classification using RFSoC

A karatsuba-based montgomery multiplier

SMCGen: Generating reconfigurable design for sequential Monte Carlo applications

Technology scaling in FPGAS: Trends in applications and architectures

An FPGA-Based floating point unit for rounding error analysis

Significant papers from the first 25 years of the FPL conference

MATIC—An Intracardiac Tachycardia Classification System

RAW Introduction

Mapping and scheduling with task clustering for heterogeneous computing systems

Towards a unique FPGA-based identification circuit using process variations

Dynamical Modeling and Simulation of a Laser-micromachined Vibration-based Micro Power Generator

Fixed-point FPGA Implementation of the FFT Accumulation Method for Real-time Cyclostationary Analysis

A low latency kernel recursive least squares processor using FPGA technology

FPGA based runtime configurable clause evaluator for SAT problems

An analytical model relating FPGA architecture to logic density and depth

A microcoded elliptic curve processor using FPGA technology

Forecasting Financial Time Series with Grammar-Guided Feature Generation

Development of a Human Airbag System for Fall Protection Using MEMS Motion Sensing Technology

fSEAD: A Composable FPGA-based Streaming Ensemble Anomaly Detection Library

Lossless Compression Decoders for Bitstreams and Software Binaries Based on High-Level Synthesis

Respiratory Artefact Removal in Forced Oscillation Measurements: A Machine Learning Approach

Serial and parallel FPGA-based variable block size motion estimation processors

High performance physical random number generator

A runtime reconfigurable implementation of the GSAT algorithm

Mobile computing architectures, design and implementation

Field programmable gate array technology for robotics applications

IP generation for an FPGA-based audio DAC sigma-delta converter

Distributed kernel learning using Kernel Recursive Least Squares

Rapid estimation of power consumption for hybrid FPGAS

An analytical model describing the relationships between logic architecture and FPGA Density

A hybrid feature selection and generation algorithm for electricity load prediction using Grammatical evolution

An FPGA based SHA-256 processor

An AA-Sized Vibration-Based Microgenerator for Wireless Sensors

FPGA fastfood - a high speed systolic implementation of a large scale online kernel method

Grammar-based feature generation for time-series prediction

On Timing Yield Improvement for FPGA Designs Using Architectural Symmetry

Review: Near infrared spectroscopy in optical coherence tomography

A Customizable Matrix Multiplication Framework for the Intel HARPv2 Xeon+FPGA Platform

An analogue neural network using MCM technology

A performance adequate computational model for auditory localization

A bitstream reconfigurable FPGA implementation of the WSAT algorithm

A Novel Real-Time Error Compensation Methodology for μIMU-based Digital Writing Instrument