ORCID Profile
0000-0002-6468-5729
Current Organisations
University of Technology Sydney
,
Nanyang Technological University
Does something not look right? The information on this page has been harvested from data sources that may not be up to date. We continue to work with information providers to improve coverage and quality. To report an issue, use the Feedback Form.
In Research Link Australia (RLA), "Research Topics" refer to ANZSRC FOR and SEO codes. These topics are either sourced from ANZSRC FOR and SEO codes listed in researchers' related grants or generated by a large language model (LLM) based on their publications.
Pattern Recognition and Data Mining | Artificial Intelligence and Image Processing | Business Information Management (incl. Records, Knowledge and Information Management, and Intelligence) | Computer Software | Data mining and knowledge discovery | Information Systems Development Methodologies | Data engineering and data science | Information Systems Management | Computer Hardware not elsewhere classified | Software Engineering | Computer Hardware | Mechanical Engineering | Deep learning | Acoustics and Noise Control (excl. Architectural Acoustics) | Behavioural Ecology | Data management and data science
Application Tools and System Utilities | Application Software Packages (excl. Computer Games) | Information Processing Services (incl. Data Entry and Capture) | Expanding Knowledge in the Information and Computing Sciences | Property, Business Support Services and Trade not elsewhere classified | Control of Pests, Diseases and Exotic Species in Urban and Industrial Environments | Information and Communication Services not elsewhere classified | Expanding Knowledge in Engineering | Expanding Knowledge in the Biological Sciences |
Publisher: IEEE
Date: 12-2008
DOI: 10.1109/ICDM.2008.78
Publisher: No publisher found
Date: 2005
DOI: 10.1007/11408079\_67
Publisher: Society for Industrial and Applied Mathematics
Date: 02-05-2013
Publisher: Association for Computing Machinery (ACM)
Date: 09-03-2015
DOI: 10.1145/2699670
Abstract: Social media provides valuable resources to analyze user behaviors and capture user preferences. This article focuses on analyzing user behaviors in social media systems and designing a latent class statistical mixture model, named temporal context-aware mixture model (TCAM), to account for the intentions and preferences behind user behaviors. Based on the observation that the behaviors of a user in social media systems are generally influenced by intrinsic interest as well as the temporal context (e.g., the public's attention at that time), TCAM simultaneously models the topics related to users' intrinsic interests and the topics related to temporal context and then combines the influences from the two factors to model user behaviors in a unified way. Considering that users' interests are not always stable and may change over time, we extend TCAM to a dynamic temporal context-aware mixture model (DTCAM) to capture users' changing interests. To alleviate the problem of data sparsity, we exploit the social and temporal correlation information by integrating a social-temporal regularization framework into the DTCAM model. To further improve the performance of our proposed models (TCAM and DTCAM), an item-weighting scheme is proposed to enable them to favor items that better represent topics related to user interests and topics related to temporal context, respectively. Based on our proposed models, we design a temporal context-aware recommender system (TCARS). To speed up the process of producing the top- k recommendations from large-scale social media data, we develop an efficient query-processing technique to support TCARS. Extensive experiments have been conducted to evaluate the performance of our models on four real-world datasets crawled from different social media sites. The experimental results demonstrate the superiority of our models, compared with the state-of-the-art competitor methods, by modeling user behaviors more precisely and making more effective and efficient recommendations.
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 02-2023
Publisher: IEEE
Date: 11-2018
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 10-2018
Publisher: Springer International Publishing
Date: 2016
Publisher: ACM
Date: 17-10-2015
Publisher: Elsevier BV
Date: 10-2009
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 12-2019
Publisher: No publisher found
Date: 2015
Publisher: Elsevier BV
Date: 10-2016
Publisher: IEEE
Date: 07-2015
Publisher: IEEE
Date: 11-2018
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 12-2019
Publisher: Springer Science and Business Media LLC
Date: 14-04-2012
Publisher: Association for the Advancement of Artificial Intelligence (AAAI)
Date: 17-07-2019
DOI: 10.1609/AAAI.V33I01.33014408
Abstract: In this paper, we propose a new online feature selection algorithm for streaming data. We aim to focus on the following two problems which remain unaddressed in literature. First, most existing online feature selection algorithms merely utilize the first-order information of the data streams, regardless of the fact that second-order information explores the correlations between features and significantly improves the performance. Second, most online feature selection algorithms are based on the balanced data presumption, which is not true in many real-world applications. For ex le, in fraud detection, the number of positive ex les are much less than negative ex les because most cases are not fraud. The balanced assumption will make the selected features biased towards the majority class and fail to detect the fraud cases. We propose an Adaptive Sparse Confidence-Weighted (ASCW) algorithm to solve the aforementioned two problems. We first introduce an `0-norm constraint into the second-order confidence-weighted (CW) learning for feature selection. Then the original loss is substituted with a cost-sensitive loss function to address the imbalanced data issue. Furthermore, our algorithm maintains multiple sparse CW learner with the corresponding cost vector to dynamically select an optimal cost. We theoretically enhance the theory of sparse CW learning and analyze the performance behavior in F-measure. Empirical studies show the superior performance over the stateof-the-art online learning methods in the online-batch setting.
Publisher: IEEE
Date: 07-2018
Publisher: International World Wide Web Conferences Steering Committee
Date: 03-04-2017
Publisher: Association for Computing Machinery (ACM)
Date: 20-04-2017
DOI: 10.1145/3011019
Abstract: With the rapid development of location-based social networks (LBSNs), spatial item recommendation has become an important mobile application, especially when users travel away from home. However, this type of recommendation is very challenging compared to traditional recommender systems. A user may visit only a limited number of spatial items, leading to a very sparse user-item matrix. This matrix becomes even sparser when the user travels to a distant place, as most of the items visited by a user are usually located within a short distance from the user’s home. Moreover, user interests and behavior patterns may vary dramatically across different time and geographical regions. In light of this, we propose ST-SAGE, a spatial-temporal sparse additive generative model for spatial item recommendation in this article. ST-SAGE considers both personal interests of the users and the preferences of the crowd in the target region at the given time by exploiting both the co-occurrence patterns and content of spatial items. To further alleviate the data-sparsity issue, ST-SAGE exploits the geographical correlation by smoothing the crowd’s preferences over a well-designed spatial index structure called the spatial pyramid . To speed up the training process of ST-SAGE, we implement a parallel version of the model inference algorithm on the GraphLab framework. We conduct extensive experiments the experimental results clearly demonstrate that ST-SAGE outperforms the state-of-the-art recommender systems in terms of recommendation effectiveness, model training efficiency, and online recommendation efficiency.
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 08-2021
Publisher: ACM
Date: 11-08-2013
Publisher: No publisher found
Date: 2004
Publisher: IEEE
Date: 12-2013
DOI: 10.1109/ICDM.2013.35
Publisher: No publisher found
Date: 2016
Publisher: Springer Berlin Heidelberg
Date: 2011
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 09-2019
Publisher: No publisher found
Date: 2015
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 05-2018
Publisher: Society for Industrial and Applied Mathematics
Date: 28-04-2011
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 11-2017
Publisher: Springer Berlin Heidelberg
Date: 2005
DOI: 10.1007/11408079_67
Publisher: IEEE
Date: 12-2016
Publisher: Springer International Publishing
Date: 2015
Publisher: Elsevier BV
Date: 11-2006
Publisher: Elsevier BV
Date: 02-2017
Publisher: Springer Berlin Heidelberg
Date: 2004
Publisher: Association for Computing Machinery (ACM)
Date: 27-01-2020
DOI: 10.1145/3372121
Abstract: The effective training of supervised Person Re-identification (Re-ID) models requires sufficient pairwise labeled data. However, when there is limited annotation resource, it is difficult to collect pairwise labeled data. We consider a challenging and practical problem called Early Active Learning, which is applied to the early stage of experiments when there is no pre-labeled s le available as references for human annotating. Previous early active learning methods suffer from two limitations for Re-ID. First, these instance-based algorithms select instances rather than pairs, which can result in missing optimal pairs for Re-ID. Second, most of these methods only consider the representativeness of instances, which can result in selecting less erse and less informative pairs. To overcome these limitations, we propose a novel pair-based active learning for Re-ID. Our algorithm selects pairs instead of instances from the entire dataset for annotation. Besides representativeness, we further take into account the uncertainty and the ersity in terms of pairwise relations. Therefore, our algorithm can produce the most representative, informative, and erse pairs for Re-ID data annotation. Extensive experimental results on five benchmark Re-ID datasets have demonstrated the superiority of the proposed pair-based early active learning algorithm.
Publisher: Springer Science and Business Media LLC
Date: 07-09-2019
Publisher: Springer Berlin Heidelberg
Date: 2009
Publisher: ACM
Date: 07-07-2016
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 2017
Publisher: No publisher found
Date: 2016
Publisher: IEEE
Date: 05-2017
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 05-2018
Publisher: Springer Berlin Heidelberg
Date: 2009
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 2023
Publisher: Association for Computing Machinery (ACM)
Date: 06-2014
DOI: 10.1145/2629461
Abstract: Newly emerging location-based and event-based social network services provide us with a new platform to understand users' preferences based on their activity history. A user can only visit a limited number of venues/events and most of them are within a limited distance range, so the user-item matrix is very sparse, which creates a big challenge to the traditional collaborative filtering-based recommender systems. The problem becomes even more challenging when people travel to a new city where they have no activity information. In this article, we propose LCARS, a location-content-aware recommender system that offers a particular user a set of venues (e.g., restaurants and shopping malls) or events (e.g., concerts and exhibitions) by giving consideration to both personal interest and local preference. This recommender system can facilitate people's travel not only near the area in which they live, but also in a city that is new to them. Specifically, LCARS consists of two components: offline modeling and online recommendation. The offline modeling part, called LCA-LDA, is designed to learn the interest of each in idual user and the local preference of each in idual city by capturing item cooccurrence patterns and exploiting item contents. The online recommendation part takes a querying user along with a querying city as input, and automatically combines the learned interest of the querying user and the local preference of the querying city to produce the top- k recommendations. To speed up the online process, a scalable query processing technique is developed by extending both the Threshold Algorithm (TA) and TA-approximation algorithm. We evaluate the performance of our recommender system on two real datasets, that is, DoubanEvent and Foursquare, and one large-scale synthetic dataset. The results show the superiority of LCARS in recommending spatial items for users, especially when traveling to new cities, in terms of both effectiveness and efficiency. Besides, the experimental analysis results also demonstrate the excellent interpretability of LCARS.
Publisher: IEEE
Date: 08-2020
Publisher: No publisher found
Date: 2006
DOI: 10.1007/11733836\_38
Publisher: Springer Berlin Heidelberg
Date: 2006
DOI: 10.1007/11731139_49
Publisher: No publisher found
Date: 2006
DOI: 10.1007/11731139\_49
Publisher: ACM Press
Date: 2015
Publisher: Elsevier BV
Date: 09-2013
Publisher: Springer Science and Business Media LLC
Date: 23-04-2019
Publisher: Elsevier BV
Date: 12-2006
Publisher: International Joint Conferences on Artificial Intelligence Organization
Date: 07-2018
Abstract: Attributed network embedding aims to learn a low-dimensional representation for each node of a network, considering both attributes and structure information of the node. However, the learning based methods usually involve substantial cost in time, which makes them impractical without the help of a powerful workhorse. In this paper, we propose a simple yet effective algorithm, named NetHash, to solve this problem only with moderate computing capacity. NetHash employs the randomized hashing technique to encode shallow trees, each of which is rooted at a node of the network. The main idea is to efficiently encode both attributes and structure information of each node by recursively sketching the corresponding rooted tree from bottom (i.e., the predefined highest-order neighboring nodes) to top (i.e., the root node), and particularly, to preserve as much information closer to the root node as possible. Our extensive experimental results show that the proposed algorithm, which does not need learning, runs significantly faster than the state-of-the-art learning-based network embedding methods while achieving competitive or even better performance in accuracy.
Publisher: Springer Berlin Heidelberg
Date: 2011
Publisher: No publisher found
Date: 2008
Publisher: ACM
Date: 21-04-2008
Publisher: ACM
Date: 09-02-2009
Publisher: Springer International Publishing
Date: 2015
Publisher: International Joint Conferences on Artificial Intelligence Organization
Date: 08-2019
Abstract: Attributed network embedding plays an important role in transferring network data into compact vectors for effective network analysis. Existing attributed network embedding models are designed either in continuous Euclidean spaces which introduce data redundancy or in binary coding spaces which incur significant loss of representation accuracy. To this end, we present a new Low-Bit Quantization for Attributed Network Representation Learning model (LQANR for short) that can learn compact node representations with low bitwidth values while preserving high representation accuracy. Specifically, we formulate a new representation learning function based on matrix factorization that can jointly learn the low-bit node representations and the layer aggregation weights under the low-bit quantization constraint. Because the new learning function falls into the category of mixed integer optimization, we propose an efficient mixed-integer based alternating direction method of multipliers (ADMM) algorithm as the solution. Experiments on real-world node classification and link prediction tasks validate the promising results of the proposed LQANR model.
Publisher: ACM
Date: 08-05-2007
Publisher: Springer Berlin Heidelberg
Date: 2006
DOI: 10.1007/11733836_38
Publisher: No publisher found
Date: 2016
Publisher: Springer Berlin Heidelberg
Date: 2004
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 05-2018
Publisher: ACM
Date: 18-06-2014
Publisher: No publisher found
Date: 2004
Publisher: No publisher found
Date: 2017
Publisher: ACM
Date: 13-11-2004
Publisher: Association for Computing Machinery (ACM)
Date: 04-2015
DOI: 10.1145/2663356
Abstract: This article proposes LA-LDA, a location-aware probabilistic generative model that exploits location-based ratings to model user profiles and produce recommendations. Most of the existing recommendation models do not consider the spatial information of users or items however, LA-LDA supports three classes of location-based ratings, namely spatial user ratings for nonspatial items, nonspatial user ratings for spatial items, and spatial user ratings for spatial items. LA-LDA consists of two components, ULA-LDA and ILA-LDA, which are designed to take into account user and item location information, respectively. The component ULA-LDA explicitly incorporates and quantifies the influence from local public preferences to produce recommendations by considering user home locations, whereas the component ILA-LDA recommends items that are closer in both taste and travel distance to the querying users by capturing item co-occurrence patterns, as well as item location co-occurrence patterns. The two components of LA-LDA can be applied either separately or collectively, depending on the available types of location-based ratings. To demonstrate the applicability and flexibility of the LA-LDA model, we deploy it to both top- k recommendation and cold start recommendation scenarios. Experimental evidence on large-scale real-world data, including the data from Gowalla (a location-based social network), DoubanEvent (an event-based social network), and MovieLens (a movie recommendation system), reveal that LA-LDA models user profiles more accurately by outperforming existing recommendation models for top- k recommendation and the cold start problem.
Publisher: Springer International Publishing
Date: 2016
Publisher: Springer International Publishing
Date: 2016
Publisher: ACM
Date: 11-08-2013
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 03-2020
Publisher: ACM
Date: 18-03-2013
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 2023
Publisher: No publisher found
Date: 2009
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Date: 10-2022
Publisher: Association for Computing Machinery (ACM)
Date: 08-2009
Abstract: Web archives preserve the history of autonomous Web sites and are potential gold mines for all kinds of media and business analysts. The most common Web archiving technique uses crawlers to automate the process of collecting Web pages. However, (re)downloading entire collection of pages periodically from a large Web site is unfeasible. In this paper, we take a step towards addressing this problem. We devise a data mining-driven policy for selectively (re)downloading Web pages that are located in hierarchical directory structures which are believed to have changed significantly (e.g., a substantial percentage of pages are inserted to/removed from the directory). Consequently, there is no need to download and maintain pages that have not changed since the last crawl as they can be easily retrieved from the archive. In our approach, we propose an off-line data mining algorithm called near- Miner that analyzes the evolution history of Web directory structures of the original Web site stored in the archive and mines negatively correlated association rules (near) between ancestor-descendant Web directories. These rules indicate the evolution correlations between Web directories. Using the discovered rules, we propose an efficient Web archive maintenance algorithm called warm that optimally skips the subdirectories (during the next crawl) which are negatively correlated with it in undergoing significant changes. Our experimental results with real data show that our approach improves the efficiency of the archive maintenance process significantly while sacrificing slightly in keeping the "freshness" of the archives. Furthermore, our experiments demonstrate that it is not necessary to discover nears frequently as the mining rules can be utilized effectively for archive maintenance over multiple versions.
Publisher: IEEE
Date: 05-2016
Start Date: 06-2024
End Date: 06-2028
Amount: $1,140,382.00
Funder: Australian Research Council
View Funded ActivityStart Date: 09-2015
End Date: 10-2015
Amount: $510,000.00
Funder: Australian Research Council
View Funded ActivityStart Date: 2012
End Date: 05-2015
Amount: $380,000.00
Funder: Australian Research Council
View Funded ActivityStart Date: 2014
End Date: 09-2017
Amount: $390,000.00
Funder: Australian Research Council
View Funded ActivityStart Date: 2021
End Date: 12-2024
Amount: $300,000.00
Funder: Australian Research Council
View Funded ActivityStart Date: 2014
End Date: 07-2015
Amount: $1,000,000.00
Funder: Australian Research Council
View Funded ActivityStart Date: 2021
End Date: 06-2024
Amount: $368,810.00
Funder: Australian Research Council
View Funded ActivityStart Date: 07-2022
End Date: 06-2026
Amount: $580,165.00
Funder: Australian Research Council
View Funded Activity