ARDC Research Link Australia

ORCID Profile
Orcid icon. 0000-0002-4430-6373

Current Organisation
Nanyang Technological University

Does something not look right? The information on this page has been harvested from data sources that may not be up to date. We continue to work with information providers to improve coverage and quality. To report an issue, use the Feedback Form.

Research Topics

In Research Link Australia (RLA), "Research Topics" refer to ANZSRC FOR and SEO codes. These topics are either sourced from ANZSRC FOR and SEO codes listed in researchers' related grants or generated by a large language model (LLM) based on their publications.

ANZSRC Field of Research (FoR)

Information Systems | Database Management | Global Information Systems |

ANZSRC Socio-Economic Objective (SEO)

Electronic Information Storage and Retrieval Services | Expanding Knowledge in the Information and Computing Sciences | Information Processing Services (incl. Data Entry and Capture)

Publications

Publication

Extracting representative motion flows for effective video retrieval

Publisher: Springer Science and Business Media LLC

Date: 24-02-2011

DOI: 10.1007/S11042-011-0763-8

Publication

KORS: Keyword-aware Optimal Route Search System

Publisher: IEEE

Date: 04-2013

DOI: 10.1109/ICDE.2013.6544939

Publication

WISK: A Workload-aware Learned Index for Spatial Keyword Queries

Publisher: Association for Computing Machinery (ACM)

Date: 13-06-2023

DOI: 10.1145/3589332

Abstract: Spatial objects often come with textual information, such as Points of Interest (POIs) with their descriptions, which are referred to as geo-textual data. To retrieve such data, spatial keyword queries that take into account both spatial proximity and textual relevance have been extensively studied. Existing indexes designed for spatial keyword queries are mostly built based on the geo-textual data without considering the distribution of queries already received. However, previous studies have shown that utilizing the known query distribution can improve the index structure for future query processing. In this paper, we propose WISK, a learned index for spatial keyword queries, which self-adapts for optimizing querying costs given a query workload. One key challenge is how to utilize both structured spatial attributes and unstructured textual information during learning the index. We first ide the data objects into partitions, aiming to minimize the processing costs of the given query workload. We prove the NP-hardness of the partitioning problem and propose a machine learning model to find the optimal partitions. Then, to achieve more pruning power, we build a hierarchical structure based on the generated partitions in a bottom-up manner with a reinforcement learning-based approach. We conduct extensive experiments on real-world datasets and query workloads with various distributions, and the results show that WISK outperforms all competitors, achieving up to 8× speedup in querying time with comparable storage overhead.

Publication

Keyword-aware Optimal Route Search

Publisher: Association for Computing Machinery (ACM)

Date: 07-2012

DOI: 10.14778/2350229.2350234

Abstract: Identifying a preferable route is an important problem that finds applications in map services. When a user plans a trip within a city, the user may want to find "a most popular route such that it passes by shopping mall, restaurant , and pub , and the travel time to and from his hotel is within 4 hours." However, none of the algorithms in the existing work on route planning can be used to answer such queries. Motivated by this, we define the problem of keyword-aware optimal route query, denoted by KOR, which is to find an optimal route such that it covers a set of user-specified keywords, a specified budget constraint is satisfied, and an objective score of the route is optimal. The problem of answering KOR queries is NP-hard. We devise an approximation algorithm OSScaling with provable approximation bounds. Based on this algorithm, another more efficient approximation algorithm BucketBound is proposed. We also design a greedy approximation algorithm. Results of empirical studies show that all the proposed algorithms are capable of answering KOR queries efficiently, while the BucketBound and Greedy algorithms run faster. The empirical studies also offer insight into the accuracy of the proposed algorithms.

Publication

Efficient Algorithms for Answering the m-Closest Keywords Query

Publisher: ACM

Date: 27-05-2015

DOI: 10.1145/2723372.2723723

Publication

Spatial Keyword Querying

Publisher: Springer Berlin Heidelberg

Date: 2012

DOI: 10.1007/978-3-642-34002-4_2

Publication

Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives

Publisher: Association for Computing Machinery (ACM)

Date: 05-2012

DOI: 10.1145/2180868.2180869

Abstract: Community Question Answering (CQA) is a popular type of service where users ask questions and where answers are obtained from other users or from historical question-answer pairs. CQA archives contain large volumes of questions organized into a hierarchy of categories. As an essential function of CQA services, question retrieval in a CQA archive aims to retrieve historical question-answer pairs that are relevant to a query question. This article presents several new approaches to exploiting the category information of questions for improving the performance of question retrieval, and it applies these approaches to existing question retrieval models, including a state-of-the-art question retrieval model. Experiments conducted on real CQA data demonstrate that the proposed techniques are effective and efficient and are capable of outperforming a variety of baseline methods significantly.

Publication

Retrieving Regions of Interest for User Exploration

Publisher: Association for Computing Machinery (ACM)

Date: 05-2014

DOI: 10.14778/2732939.2732946

Abstract: We consider an application scenario where points of interest (PoIs) each have a web presence and where a web user wants to identify a region that contains relevant PoIs that are relevant to a set of keywords, e.g., in preparation for deciding where to go to conveniently explore the PoIs. Motivated by this, we propose the length-constrained maximum-sum region (LCMSR) query that returns a spatial-network region that is located within a general region of interest, that does not exceed a given size constraint, and that best matches query keywords. Such a query maximizes the total weight of the PoIs in it w.r.t. the query keywords. We show that it is NP-hard to answer this query. We develop an approximation algorithm with a (5 + ε) approximation ratio utilizing a technique that scales node weights into integers. We also propose a more efficient heuristic algorithm and a greedy algorithm. Empirical studies on real data offer detailed insight into the accuracy of the proposed algorithms and show that the proposed algorithms are capable of computing results efficiently and effectively.

Publication

Reverse $k$ Nearest Neighbor Search over Trajectories

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Date: 04-2018

DOI: 10.1109/TKDE.2017.2776268

Publication

SWORS: A System for the Efficient Retrieval of Relevant Spatial Web Objects

Publisher: Association for Computing Machinery (ACM)

Date: 08-2012

DOI: 10.14778/2367502.2367536

Abstract: Spatial web objects that possess both a geographical location and a textual description are gaining in prevalence. This gives prominence to spatial keyword queries that exploit both location and textual arguments. Such queries are used in many web services such as yellow pages and maps services. We present SWORS, the Spatial Web Object Retrieval System, that is capable of efficiently retrieving spatial web objects that satisfy spatial keyword queries. Specifically, SWORS supports two types of queries: a) the location-aware top- k text retrieval (L k T) query that retrieves k in idual spatial web objects taking into account query location proximity and text relevancy b) the spatial keyword group (SKG) query that retrieves a group of objects that cover the query keywords and are nearest to the query location and have the shortest inter-object distances. SWORS provides browser-based interfaces for desktop and laptop computers and provides a client application for mobile devices. The interfaces and the client enable users to formulate queries and view the query results on a map. The server side stores the data and processes the queries. We use three real-life data sets to demonstrate the functionality and performance of SWORS.

Publication

Distributed Algorithms on Exact Personalized PageRank

Publisher: ACM

Date: 09-05-2017

DOI: 10.1145/3035918.3035920

Publication

Mining Significant Semantic Locations From GPS Data

Publisher: Association for Computing Machinery (ACM)

Date: 09-2010

DOI: 10.14778/1920841.1920968

Abstract: With the increasing deployment and use of GPS-enabled devices, massive amounts of GPS data are becoming available. We propose a general framework for the mining of semantically meaningful, significant locations, e.g., shopping malls and restaurants, from such data. We present techniques capable of extracting semantic locations from GPS data. We capture the relationships between locations and between locations and users with a graph. Significance is then assigned to locations using random walks over the graph that propagates significance among the locations. In doing so, mutual reinforcement between location significance and user authority is exploited for determining significance, as are aspects such as the number of visits to a location, the durations of the visits, and the distances users travel to reach locations. Studies using up to 100 million GPS records from a confined spatio-temporal region demonstrate that the proposal is effective and is capable of outperforming baseline methods and an extension of an existing proposal.

Publication

Efficient Processing of Spatial Group Keyword Queries

Publisher: Association for Computing Machinery (ACM)

Date: 30-06-2015

DOI: 10.1145/2772600

Abstract: With the proliferation of geo-positioning and geo-tagging techniques, spatio-textual objects that possess both a geographical location and a textual description are gaining in prevalence, and spatial keyword queries that exploit both location and textual description are gaining in prominence. However, the queries studied so far generally focus on finding in idual objects that each satisfy a query rather than finding groups of objects where the objects in a group together satisfy a query. We define the problem of retrieving a group of spatio-textual objects such that the group's keywords cover the query's keywords and such that the objects are nearest to the query location and have the smallest inter-object distances. Specifically, we study three instantiations of this problem, all of which are NP-hard. We devise exact solutions as well as approximate solutions with provable approximation bounds to the problems. In addition, we solve the problems of retrieving top- k groups of three instantiations, and study a weighted version of the problem that incorporates object weights. We present empirical studies that offer insight into the efficiency of the solutions, as well as the accuracy of the approximate solutions.

Publication

The use of categorization information in language models for question retrieval

Publisher: ACM

Date: 02-11-2009

DOI: 10.1145/1645953.1645989

Publication

A generalized framework of exploring category information for question retrieval in community question answer archives

Publisher: ACM

Date: 26-04-2010

DOI: 10.1145/1772690.1772712

Publication

Collective spatial keyword querying

Publisher: ACM

Date: 12-06-2011

DOI: 10.1145/1989323.1989363

Publication

An efficient query indexing mechanism for filtering geo-textual data

Publisher: ACM

Date: 22-06-2013

DOI: 10.1145/2463676.2465328

Publication

SOPS: A System for Efficient Processing of Spatial-Keyword Publish/Subscribe

Publisher: Association for Computing Machinery (ACM)

Date: 08-2014

DOI: 10.14778/2733004.2733040

Abstract: Massive amount of data that are geo-tagged and associated with text information are being generated at an unprecedented scale. These geo-textual data cover a wide range of topics. Users are interested in receiving up-to-date geo-textual objects (e.g., geo-tagged Tweets) such that their locations meet users' need and their texts are interesting to users. For ex le, a user may want to be updated with tweets near her home on the topic "dengue fever headache". AB@In this demonstration, we present SOPS, the u S /u atial-Keyw u o /u rd u P /u ublish/ u S /u ubscribe System, that is capable of efficiently processing spatial keyword continuous queries. SOPS supports two types of queries: (1) u B /u oolean u R /u ange u C /u ontinuous (BRC) query that can be used to subscribe the geo-textual objects satisfying a boolean keyword expression and falling in a specified spatial region (2) u T /u empor u a /u l u S /u atial- u K /u eyword Top- k Continuous (TaSK) query that continuously maintains up-to-date top- k most relevant results over a stream of geo-textual objects. SOPS enables users to formulate their queries and view the real-time results over a stream of geo-textual objects by browser-based user interfaces. On the server side, we propose solutions to efficiently processing a large number of BRC queries (tens of millions) and TaSK queries over a stream of geo-textual objects.

Publication

Temporal Spatial-Keyword Top-k publish/subscribe

Publisher: IEEE

Date: 04-2015

DOI: 10.1109/ICDE.2015.7113289

Publication

Reverse k Nearest Neighbor Search over Trajectories (Extended Abstract)

Publisher: IEEE

Date: 04-2018

DOI: 10.1109/ICDE.2018.00250

Publication

Retrieving Top-k Prestige-Based Relevant Spatial Web Objects

Publisher: Association for Computing Machinery (ACM)

Date: 09-2010

DOI: 10.14778/1920841.1920891

Abstract: The location-aware keyword query returns ranked objects that are near a query location and that have textual descriptions that match query keywords. This query occurs inherently in many types of mobile and traditional web services and applications, e.g., Yellow Pages and Maps services. Previous work considers the potential results of such a query as being independent when ranking them. However, a relevant result object with nearby objects that are also relevant to the query is likely to be preferable over a relevant object without relevant nearby objects. The paper proposes the concept of prestige-based relevance to capture both the textual relevance of an object to a query and the effects of nearby objects. Based on this, a new type of query, the Location-aware top- k Prestige-based Text retrieval (L k PT) query, is proposed that retrieves the top- k spatial web objects ranked according to both prestige-based relevance and location proximity. We propose two algorithms that compute L k PT queries. Empirical studies with real-world spatial data demonstrate that L k PT queries are more effective in retrieving web objects than a previous approach that does not consider the effects of nearby objects and they show that the proposed algorithms are scalable and outperform a baseline approach significantly.

Publication

On Spatial Pattern Matching

Publisher: IEEE

Date: 04-2018

DOI: 10.1109/ICDE.2018.00035

Related Organisations

Organisation

Nanyang Technological University

Location: Singapore

View Organisation

Related Funding Activities

Grant

Discovery Projects - Grant ID: DP200102611

Start Date: 2020

End Date: 12-2024

Amount: $480,000.00

Funder: Australian Research Council

View Funded Activity

Grant

Discovery Projects - Grant ID: DP220101434

Start Date: 2022

End Date: 12-2024

Amount: $364,295.00

Funder: Australian Research Council

View Funded Activity

Gao Cong

Researcher

Research Topics

Top 5 Research Topics

ANZSRC Field of Research (FoR)

ANZSRC Socio-Economic Objective (SEO)

Publications

Extracting representative motion flows for effective video retrieval

KORS: Keyword-aware Optimal Route Search System

WISK: A Workload-aware Learned Index for Spatial Keyword Queries

Keyword-aware Optimal Route Search

Efficient Algorithms for Answering the m-Closest Keywords Query

Spatial Keyword Querying

Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives

Retrieving Regions of Interest for User Exploration

Reverse $k$ Nearest Neighbor Search over Trajectories

SWORS: A System for the Efficient Retrieval of Relevant Spatial Web Objects

Distributed Algorithms on Exact Personalized PageRank

Mining Significant Semantic Locations From GPS Data

Efficient Processing of Spatial Group Keyword Queries

The use of categorization information in language models for question retrieval

A generalized framework of exploring category information for question retrieval in community question answer archives

Collective spatial keyword querying

An efficient query indexing mechanism for filtering geo-textual data

SOPS: A System for Efficient Processing of Spatial-Keyword Publish/Subscribe

Temporal Spatial-Keyword Top-k publish/subscribe

Reverse k Nearest Neighbor Search over Trajectories (Extended Abstract)

Retrieving Top-k Prestige-Based Relevant Spatial Web Objects

On Spatial Pattern Matching

Related Organisations

Nanyang Technological University

Related Funding Activities

Discovery Projects - Grant ID: DP200102611

Discovery Projects - Grant ID: DP220101434

Gao Cong

Researcher

Research Topics

Top 5 Research Topics

ANZSRC Field of Research (FoR)

ANZSRC Socio-Economic Objective (SEO)

Related Links

Publications

Extracting representative motion flows for effective video retrieval

KORS: Keyword-aware Optimal Route Search System

WISK: A Workload-aware Learned Index for Spatial Keyword Queries

Keyword-aware Optimal Route Search

Efficient Algorithms for Answering the m-Closest Keywords Query

Spatial Keyword Querying

Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives

Retrieving Regions of Interest for User Exploration

Reverse $k$ Nearest Neighbor Search over Trajectories

SWORS: A System for the Efficient Retrieval of Relevant Spatial Web Objects

Distributed Algorithms on Exact Personalized PageRank

Mining Significant Semantic Locations From GPS Data

Efficient Processing of Spatial Group Keyword Queries

The use of categorization information in language models for question retrieval

A generalized framework of exploring category information for question retrieval in community question answer archives

Collective spatial keyword querying

An efficient query indexing mechanism for filtering geo-textual data

SOPS: A System for Efficient Processing of Spatial-Keyword Publish/Subscribe

Temporal Spatial-Keyword Top-k publish/subscribe

Reverse k Nearest Neighbor Search over Trajectories (Extended Abstract)

Retrieving Top-k Prestige-Based Relevant Spatial Web Objects

On Spatial Pattern Matching

Related Organisations

Nanyang Technological University

Related Funding Activities

Discovery Projects - Grant ID: DP200102611

Discovery Projects - Grant ID: DP220101434

ARDC NEWSLETTER SIGNUP