ARDC Research Link Australia

ORCID Profile
Orcid icon. 0000-0002-7311-3693

Current Organisation
University of Queensland

Does something not look right? The information on this page has been harvested from data sources that may not be up to date. We continue to work with information providers to improve coverage and quality. To report an issue, use the Feedback Form.

Research Topics

In Research Link Australia (RLA), "Research Topics" refer to ANZSRC FOR and SEO codes. These topics are either sourced from ANZSRC FOR and SEO codes listed in researchers' related grants or generated by a large language model (LLM) based on their publications.

ANZSRC Field of Research (FoR)

ANZSRC Socio-Economic Objective (SEO)

Information Services not elsewhere classified | Information Processing Services (incl. Data Entry and Capture) | Application Tools and System Utilities | Electronic Information Storage and Retrieval Services | Expanding Knowledge in Technology | Technological and Organisational Innovation

Publications

Publication

The Impact of Task Abandonment in Crowdsourcing

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Date: 2019

DOI: 10.1109/TKDE.2019.2948168

Publication

Moral Panic through the Lens of Twitter

Publisher: ACM

Date: 18-07-2018

DOI: 10.1145/3217804.3217915

Publication

Task design in complex crowdsourcing experiments: Item assignment optimization

Publisher: Elsevier BV

Date: 12-2022

DOI: 10.1016/J.COR.2022.105995

Publication

Report on INEX 2008

Publisher: Association for Computing Machinery (ACM)

Date: 25-06-2009

DOI: 10.1145/1670598.1670603

Abstract: INEX investigates focused retrieval from structured documents by providing large test collections of structured documents, uniform evaluation measures, and a forum for organizations to compare their results. This paper reports on the INEX 2008 evaluation c aign, which consisted of a wide range of tracks: Ad hoc, Book, Efficiency, Entity Ranking, Interactive, QA, Link the Wiki, and XML Mining.

Publication

Perspectives on Large Language Models for Relevance Judgment

Publisher: ACM

Date: 09-08-2023

DOI: 10.1145/3578337.3605136

Publication

Effective named entity recognition for idiosyncratic web collections

Publisher: ACM

Date: 07-04-2014

DOI: 10.1145/2566486.2568013

Publication

On Understanding Data Worker Interaction Behaviors

Publisher: ACM

Date: 25-07-2020

DOI: 10.1145/3397271.3401059

Publication

Incorporating AI and Analytics to Derive Insights from E-exam Logs

Publisher: Springer International Publishing

Date: 2022

DOI: 10.1007/978-3-031-11644-5_78

Publication

TRank: Ranking Entity Types Using the Web of Data

Publisher: Springer Berlin Heidelberg

Date: 2013

DOI: 10.1007/978-3-642-41335-3_40

Publication

From people to entities

Publisher: Association for Computing Machinery (ACM)

Date: 24-05-2011

DOI: 10.1145/1988852.1988868

Abstract: The exponential growth of digital information available in Enterprises and on the Web creates the need for search tools that can respond to the most sophisticated informational needs. Retrieving relevant documents is not enough anymore and finding entities rather than textual resources provides great support to the user both on the Web and in Enterprises. Many user tasks would be simplified if Search Engines would support typed search, and return entities instead of just Web pages. For ex le, an executive who tries to solve a problem needs to find people in the company who are knowledgeable about a certain topic. Aggregation of information spread over different documents is a key aspect in this process. Finding experts is a problem mostly considered in the Enterprise setting where teams for new projects need to be built and problems need to be solved by the right persons. In the first part of the thesis, we propose a model for expert finding based on the well consolidated vector space model for Information Retrieval and investigate its effectiveness. We can define Entity Retrieval by generalizing the expert finding problem to any entity. In Entity Retrieval the goal is to rank entities according to their relevance to a query (e.g., "Countries where I can pay in Euro") the set of entities to be ranked is assumed to be loosely defined by a generic category, given in the query itself (e.g., countries), or by some ex le entities (e.g., Italy, Germany, France). In the second part of the thesis, we investigate different methods based on Semantic Web and Natural Language Processing techniques for solving these tasks both in Wikipedia and, generally, on the Web. Evaluation is a critical aspect of Information Retrieval. We contributed to the field of Information Retrieval evaluation by organizing an evaluation initiative for Entity Retrieval. Opinions and other relevant information about entities can be provided by different sources in different contexts. News articles report about events where entities are involved. In such setting the temporal dimension is critical as news stories develop over time and new entities appear in the story and others are not relevant anymore. In the third part of this thesis, we study the problem of Entity Retrieval for news applications and the importance of the news trail history (i.e., past related articles) to determine the relevant entities in current articles. We also study opinion evolution about entities. In the last years, the blogosphere has become a vital part of the Web, covering a variety of different points of view and opinions on political and event-related topics such as immigration, election c aigns, or economic developments. We propose a method for automatically extracting public opinion about specific entities from the blogosphere. Available online at esearch hd/.

Publication

Health Cards for Consumer Health Search

Publisher: ACM

Date: 18-07-2019

DOI: 10.1145/3331184.3331194

Publication

Firearms on Twitter: A Novel Object Detection Pipeline

Publisher: Association for the Advancement of Artificial Intelligence (AAAI)

Date: 02-06-2023

DOI: 10.1609/ICWSM.V17I1.22221

Abstract: Social media is an important source of real-time imagery concerning world events. One subset of social media posts which may be of particular interest are those featuring firearms. These posts can give insight into weapon movements, troop activity and civilian safety. Object detection tools offer important opportunities for insight into these images. Unfortunately, these images can be visually complex, poorly lit and generally challenging for object detection models. We present an analysis of existing gun detection datasets, and find that these datasets to not effectively address the challenge of gun detection on real-life images. Following this, we present a novel object detection pipeline. We train our pipeline on a number of datasets including one created for this investigation made up of Twitter images of the Russo-Ukrainian War. We compare the performance of our model as trained on the different datasets to baseline numbers provided by original authors as well as a YOLO v5 benchmark. We find that our model outperforms the state-of-the-art benchmarks on contextually rich, real-life-derived imagery of firearms.

Publication

Hippocampus

Publisher: ACM

Date: 07-04-2014

DOI: 10.1145/2567948.2576946

Publication

Understanding Engagement through Search Behaviour

Publisher: ACM

Date: 06-11-2017

DOI: 10.1145/3132847.3132978

Publication

Exploring Data Literacy Levels in the Crowd – the Case of COVID-19

Publisher: Association for the Advancement of Artificial Intelligence (AAAI)

Date: 31-05-2022

DOI: 10.1609/ICWSM.V16I1.19395

Abstract: During global health crises, the use of data becomes critical to control the spread of infections, to inform the general public and to foster safe behaviors. The ability of people to read and understand data (i.e., data literacy) has the potential to affect human behaviors. In this paper, we study non-expert human subjects' ability to make accurate interpretations of complex pandemic data visualizations designed for general public consumption. We present them with popular plots and graphs that have been shown by traditional and social media, and ask them to answer questions to assess their data literacy at three levels: extracting information, finding relationships among data, and expanding or predicting information. Our results show the presence of variance in interpretations and reveal insights into how messages communicated through data may be perceived differently by different people. We also highlight the importance of designing communication strategies that ensure the spread of the right message through data.

Publication

Understanding reactions to swine flu, Ebola, and the Zika virus using Twitter data: an outlook for future infectious disease outbreaks

Publisher: Linnaeus University Press

Date: 03-05-2022

DOI: 10.15626/ISHIMR.2020.04

Abstract: Infectious disease outbreaks are a serious public health threat which can disrupt world economies. This paper presents an in-depth qualitative analysis of n=15,415 tweets that relate to the peak of three major infectious diseases: the swine flu outbreak of 2009, the Ebola outbreak of 2014, and the Zika outbreak of 2016. Tweets were analysed using thematic analysis and a number of themes and sub-themes were identified. The results were brought together in an abstraction phase and the commonalities between the cases were studied. A notable similarity which emerged was the rate at which Twitter users expressed intense fear and panic akin to that of the phenomena of “moral panic” and the “outbreak narrative”. Our study also discusses the utility of using Twitter data for in-depth qualitative research as compared to traditional interview-methods. Our study is the largest in-depth analysis of tweets on infectious diseases and could inform public health strategies for future outbreaks such as the coronavirus outbreak.

Publication

NoizCrowd: A Crowd-Based Data Gathering and Management System for Noise Level Data

Publisher: Springer Berlin Heidelberg

Date: 2013

DOI: 10.1007/978-3-642-40276-0_14

Publication

Human Beyond the Machine: Challenges and Opportunities of Microtask Crowdsourcing

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Date: 07-2015

DOI: 10.1109/MIS.2015.66

Publication

BowlognaBench—Benchmarking RDF Analytics

Publisher: Springer Berlin Heidelberg

Date: 2012

DOI: 10.1007/978-3-642-34044-4_5

Publication

Hierarchical Clustering of Corals using Image Clustering

Publisher: ACM

Date: 09-12-2021

DOI: 10.1145/3503516.3503531

Publication

A Behavioural Analysis of Metadata Use in Evaluating the Quality of Repurposed Data

Publisher: Springer International Publishing

Date: 2022

DOI: 10.1007/978-3-031-17995-2_22

Publication

Crowdsourcing truthfulness: The impact of judgment scale and assessor bias

Publisher: Springer International Publishing

Date: 2020

DOI: 10.1007/978-3-030-45442-5_26

Publication

Preferences on a Budget: Prioritizing Document Pairs when Crowdsourcing Relevance Judgments

Publisher: ACM

Date: 25-04-2022

DOI: 10.1145/3485447.3511960

Publication

How Many Crowd Workers Do I Need? On Statistical Power when Crowdsourcing Relevance Judgments

Publisher: Association for Computing Machinery (ACM)

Date: 18-08-2023

DOI: 10.1145/3597201

Abstract: To scale the size of Information Retrieval collections, crowdsourcing has become a common way to collect relevance judgments at scale. Crowdsourcing experiments usually employ 100–10,000 workers, but such a number is often decided in a heuristic way. The downside is that the resulting dataset does not have any guarantee of meeting predefined statistical requirements as, for ex le, have enough statistical power to be able to distinguish in a statistically significant way between the relevance of two documents. We propose a methodology adapted from literature on sound topic set size design, based on t-test and ANOVA, which aims at guaranteeing the resulting dataset to meet a predefined set of statistical requirements. We validate our approach on several public datasets. Our results show that we can reliably estimate the recommended number of workers needed to achieve statistical power, and that such estimation is dependent on the topic, while the effect of the relevance scale is limited. Furthermore, we found that such estimation is dependent on worker features such as agreement. Finally, we describe a set of practical estimation strategies that can be used to estimate the worker set size, and we also provide results on the estimation of document set sizes.

Publication

Ranking Entities Using Web Search Query Logs

Publisher: Springer Berlin Heidelberg

Date: 2010

DOI: 10.1007/978-3-642-15464-5_28

Publication

An Introduction to Hybrid Human-Machine Information Systems

Publisher: Now Publishers

Date: 2017

DOI: 10.1561/1800000025

Publication

An Analysis of the Australian Political Discourse in Sponsored Social Media Content

Publisher: ACM

Date: 09-12-2021

DOI: 10.1145/3503516.3503533

Publication

Analytics of learning tactics and strategies in an online learnersourcing environment

Publisher: Wiley

Date: 31-08-2022

DOI: 10.1111/JCAL.12729

Abstract: The use of crowdsourcing in a pedagogically supported form to partner with learners in developing novel content is emerging as a viable approach for engaging students in higher‐order learning at scale. However, how students behave in this form of crowdsourcing, referred to as learnersourcing, is still insufficiently explored. To contribute to filling this gap, this study explores how students engage with learnersourcing tasks across a range of course and assessment designs. We conducted an exploratory study on trace data of 1279 students across three courses, originating from the use of a learnersourcing environment under different assessment designs. We employed a new methodology from the learning analytics (LA) field that aims to represent students' behaviour through two theoretically‐derived latent constructs: learning tactics and the learning strategies built upon them. The study's results demonstrate students use different tactics and strategies, highlight the association of learnersourcing contexts with the identified learning tactics and strategies, indicate a significant association between the strategies and performance and contribute to the employed method's generalisability by applying it to a new context. This study provides an ex le of how learning analytics methods can be employed towards the development of effective learnersourcing systems and, more broadly, technological educational solutions that support learner‐centred and data‐driven learning at scale. Findings should inform best practices for integrating learnersourcing activities into course design and shed light on the relevance of tactics and strategies to support teachers in making informed pedagogical decisions.

Publication

Ontology-Based Word Sense Disambiguation for Scientific Literature

Publisher: Springer Berlin Heidelberg

Date: 2013

DOI: 10.1007/978-3-642-36973-5_50

Publication

Why finding entities in Wikipedia is difficult, sometimes

Publisher: Springer Science and Business Media LLC

Date: 18-05-2010

DOI: 10.1007/S10791-010-9135-7

Publication

Socio-Economic Diversity in Human Annotations

Publisher: ACM

Date: 26-06-2022

DOI: 10.1145/3501247.3531588

Publication

Contextualized ranking of entity types based on knowledge graphs

Publisher: Elsevier BV

Date: 03-2016

DOI: 10.1016/J.WEBSEM.2015.12.005

Publication

Combining inverted indices and structured search for ad-hoc object retrieval

Publisher: ACM

Date: 12-08-2012

DOI: 10.1145/2348283.2348304

Publication

Investigating User Perception of Gender Bias in Image Search

Publisher: ACM

Date: 27-06-2018

DOI: 10.1145/3209978.3210094

Publication

Zika Outbreak of 2016: Insights from Twitter

Publisher: Springer International Publishing

Date: 2020

DOI: 10.1007/978-3-030-49576-3_32

Publication

Does Evidence from Peers Help Crowd Workers in Assessing Truthfulness?

Publisher: ACM

Date: 25-04-2022

DOI: 10.1145/3487553.3524236

Publication

Semantically Enhanced Entity Ranking

Publisher: Springer Berlin Heidelberg

Date: 2008

DOI: 10.1007/978-3-540-85481-4_15

Publication

How Does the Crowd Impact the Model? A tool for raising awareness of social bias in crowdsourced training data

Publisher: Zenodo

Date: 2022

DOI: 10.5281/ZENODO.6914457

Publication

Scheduling Human Intelligence Tasks in Multi-Tenant Crowd-Powered Systems

Publisher: International World Wide Web Conferences Steering Committee

Date: 11-04-2016

DOI: 10.1145/2872427.2883030

Publication

Social recommendations of content and metadata

Publisher: ACM

Date: 24-11-2008

DOI: 10.1145/1497308.1497329

Publication

A Data-Driven Analysis of Behaviors in Data Curation Processes

Publisher: Association for Computing Machinery (ACM)

Date: 07-02-2023

DOI: 10.1145/3567419

Abstract: Understanding how data workers interact with data, and various pieces of information related to data preparation, is key to designing systems that can better support them in exploring datasets. To date, however, there is a paucity of research studying the strategies adopted by data workers as they carry out data preparation activities. In this work, we investigate a specific data preparation activity, namely data quality discovery , and aim to (i) understand the behaviors of data workers in discovering data quality issues, (ii) explore what factors (e.g., prior experience) can affect their behaviors, as well as (iii) understand how these behavioral observations relate to their performance. To this end, we collect a multi-modal dataset through a data-driven experiment that relies on the use of eye-tracking technology with a purpose-designed platform built on top of iPython Notebook. The experiment results reveal that: (i) ‘copy–paste–modify’ is a typical strategy for writing code to complete tasks (ii) proficiency in writing code has a significant impact on the quality of task performance, while perceived difficulty and efficacy can influence task completion patterns and (iii) searching in external resources is a prevalent action that can be leveraged to achieve better performance. Furthermore, our experiment indicates that providing s le code within the system can help data workers get started with their task, and surfacing underlying data is an effective way to support exploration. By investigating data worker behaviors prior to each search action, we also find that the most common reasons that trigger external search actions are the need to seek assistance in writing or debugging code and to search for relevant code to reuse. Based on our experiment results, we showcase a systematic approach to select from the top best code snippets created by data workers and assemble them to achieve better performance than the best in idual performer in the dataset. By doing so, our findings not only provide insights into patterns of interactions with various system components and information resources when performing data curation tasks, but also build effective and efficient data curation processes through data workers’ collective intelligence.

Publication

Charting the Design and Analytics Agenda of Learnersourcing Systems

Publisher: ACM

Date: 12-04-2021

DOI: 10.1145/3448139.3448143

Publication

Overview of the INEX 2009 Entity Ranking Track

Publisher: Springer Berlin Heidelberg

Date: 2010

DOI: 10.1007/978-3-642-14556-8_26

Publication

Non-parametric Class Completeness Estimators for Collaborative Knowledge Graphs—The Case of Wikidata

Publisher: Springer International Publishing

Date: 2019

DOI: 10.1007/978-3-030-30793-6_26

Publication

QuRVe: Query Refinement for View Recommendation in Visual Data Exploration

Publisher: Springer International Publishing

Date: 2020

DOI: 10.1007/978-3-030-54623-6_14

Publication

On the State of Reporting in Crowdsourcing Experiments and a Checklist to Aid Current Practices

Publisher: Association for Computing Machinery (ACM)

Date: 13-10-2021

DOI: 10.1145/3479531

Abstract: Crowdsourcing is being increasingly adopted as a platform to run studies with human subjects. Running a crowdsourcing experiment involves several choices and strategies to successfully port an experimental design into an otherwise uncontrolled research environment, e.g., s ling crowd workers, mapping experimental conditions to micro-tasks, or ensure quality contributions. While several guidelines inform researchers in these choices, guidance of how and what to report from crowdsourcing experiments has been largely overlooked. If under-reported, implementation choices constitute variability sources that can affect the experiment's reproducibility and prevent a fair assessment of research outcomes. In this paper, we examine the current state of reporting of crowdsourcing experiments and offer guidance to address associated reporting issues. We start by identifying sensible implementation choices, relying on existing literature and interviews with experts, to then extensively analyze the reporting of 171 crowdsourcing experiments. Informed by this process, we propose a checklist for reporting crowdsourcing experiments.

Publication

The Dynamics of Micro-Task Crowdsourcing

Publisher: International World Wide Web Conferences Steering Committee

Date: 18-05-2015

DOI: 10.1145/2736277.2741685

Publication

A Model for Ranking Entities and Its Application to Wikipedia

Publisher: IEEE

Date: 10-2008

DOI: 10.1109/LA-WEB.2008.8

Publication

Measuring the Effect of Public Health Campaigns on Twitter: The Case of World Autism Awareness Day

Publisher: Springer International Publishing

Date: 2018

DOI: 10.1007/978-3-319-78105-1_2

Publication

ZenCrowd

Publisher: ACM

Date: 16-04-2012

DOI: 10.1145/2187836.2187900

Publication

Pooling-based continuous evaluation of information retrieval systems

Publisher: Springer Science and Business Media LLC

Date: 08-09-2015

DOI: 10.1007/S10791-015-9266-Y

Publication

Crowd Anatomy Beyond the Good and Bad: Behavioral Traces for Crowd Worker Modeling and Pre-selection

Publisher: Springer Science and Business Media LLC

Date: 26-06-2019

DOI: 10.1007/S10606-018-9336-Y

Publication

Predicting the Future Impact of News Events

Publisher: Springer Berlin Heidelberg

Date: 2012

DOI: 10.1007/978-3-642-28997-2_5

Publication

All Those Wasted Hours

Publisher: ACM

Date: 30-01-2019

DOI: 10.1145/3289600.3291035

Publication

Quality Control Attack Schemes in Crowdsourcing

Publisher: International Joint Conferences on Artificial Intelligence Organization

Date: 08-2019

DOI: 10.24963/IJCAI.2019/850

Abstract: An important precondition to build effective AI models is the collection of training data at scale. Crowdsourcing is a popular methodology to achieve this goal. Its adoption introduces novel challenges in data quality control, to deal with under-performing and malicious annotators. One of the most popular quality assurance mechanisms, especially in paid micro-task crowdsourcing, is the use of a small set of pre-annotated tasks as gold standard, to assess in real time the annotators quality. In this paper, we highlight a set of vulnerabilities this scheme suffers: a group of colluding crowd workers can easily implement and deploy a decentralised machine learning inferential system to detect and signal which parts of the task are more likely to be gold questions, making them ineffective as a quality control tool. Moreover, we demonstrate how the most common countermeasures against this attack are ineffective in practical scenarios. The basic architecture of the inferential system is composed of a browser plug-in and an external server where the colluding workers can share information. We implement and validate the attack scheme, by means of experiments on real-world data from a popular crowdsourcing platform.

Publication

On the Volatility of Commercial Search Engines and its Impact on Information Retrieval Research

Publisher: ACM

Date: 27-06-2018

DOI: 10.1145/3209978.3210088

Publication

B-hist: Entity-centric search over personal web browsing history

Publisher: Elsevier BV

Date: 08-2014

DOI: 10.1016/J.WEBSEM.2014.07.003

Publication

Effects of Technological Interventions for Self-regulation: A Control Experiment in Learnersourcing

Publisher: ACM

Date: 21-03-2022

DOI: 10.1145/3506860.3506911

Publication

L3S at INEX 2007: Query Expansion for Entity Ranking Using a Highly Accurate Ontology

Publisher: Springer Berlin Heidelberg

Date: 2008

DOI: 10.1007/978-3-540-85902-4_23

Publication

Loss-based Active Learning for Named Entity Recognition

Publisher: IEEE

Date: 18-07-2021

DOI: 10.1109/IJCNN52387.2021.9533675

Publication

Iterative Human-in-the-Loop Discovery of Unknown Unknowns in Image Datasets

Publisher: Association for the Advancement of Artificial Intelligence (AAAI)

Date: 04-10-2021

DOI: 10.1609/HCOMP.V9I1.18941

Abstract: Automatic predictions (e.g., recognizing objects in images) may result in systematic errors if certain classes are not well represented by training instances (these errors are called unknowns). When a model assigns high confidence scores to these wrong predictions (this type of error is called unknown unknowns), it becomes challenging to automatically identify them. In this paper, we present the first work on leveraging human intelligence to discover unknown unknowns (UUs) in an iterative way. The proposed methodology first differentiates the feature space generated by crowd workers labelling instances (e.g., images) in an active learning fashion from the space learned by the prediction model over a batch training phase, and thus identifies the predictions most likely to be UUs. Next, we add crowd labels collected for these discovered UUs to the training set and re-train the model with this extended dataset. This process is then repeated iteratively to discover more instances of both unknown and under-represented classes. Our experimental results show that the proposed methodology is able to (1) efficiently discover UUs, (2) significantly improve the quality of model predictions, and (3) to push UUs into known unknowns (i.e., the model makes mistakes but at least its classification confidence on those instances is low so those predictions can be discarded or post-processed) for further investigation. We additionally discuss the trade-off between prediction quality improvements and the human effort required to achieve those improvements. Our results bear implications on building cost-effective systems to discover UUs with humans in the loop.

Publication

Active learning with feature matching for clinical named entity recognition

Publisher: Elsevier BV

Date: 09-2023

DOI: 10.1016/J.NLP.2023.100015

Publication

Can the crowd judge truthfulness? A longitudinal study on recent misinformation about COVID-19

Publisher: Springer Science and Business Media LLC

Date: 16-09-2021

DOI: 10.1007/S00779-021-01604-6

Abstract: Recently, the misinformation problem has been addressed with a crowdsourcing-based approach: to assess the truthfulness of a statement, instead of relying on a few experts, a crowd of non-expert is exploited. We study whether crowdsourcing is an effective and reliable method to assess truthfulness during a pandemic, targeting statements related to COVID-19, thus addressing (mis)information that is both related to a sensitive and personal issue and very recent as compared to when the judgment is done. In our experiments, crowd workers are asked to assess the truthfulness of statements, and to provide evidence for the assessments. Besides showing that the crowd is able to accurately judge the truthfulness of the statements, we report results on workers’ behavior, agreement among workers, effect of aggregation functions, of scales transformations, and of workers background and bias. We perform a longitudinal study by re-launching the task multiple times with both novice and experienced workers, deriving important insights on how the behavior and quality change over time. Our results show that workers are able to detect and objectively categorize online (mis)information related to COVID-19 both crowdsourced and expert judgments can be transformed and aggregated to improve quality worker background and other signals (e.g., source of information, behavior) impact the quality of the data. The longitudinal study demonstrates that the time-span has a major effect on the quality of the judgments, for both novice and experienced workers. Finally, we provide an extensive failure analysis of the statements misjudged by the crowd-workers.

Publication

On the Impact of Data Quality on Image Classification Fairness

Publisher: ACM

Date: 18-07-2023

DOI: 10.1145/3539618.3592031

Publication

Scalpel-CD: Leveraging Crowdsourcing and Deep Probabilistic Modeling for Debugging Noisy Training Data

Publisher: ACM

Date: 13-05-2019

DOI: 10.1145/3308558.3313599

Publication

Visual interfaces for stimulating exploratory search

Publisher: ACM

Date: 13-06-2011

DOI: 10.1145/1998076.1998151

Publication

Combining Human and Machine Confidence in Truthfulness Assessment

Publisher: Association for Computing Machinery (ACM)

Date: 28-12-2022

DOI: 10.1145/3546916

Abstract: Automatically detecting online misinformation at scale is a challenging and interdisciplinary problem. Deciding what is to be considered truthful information is sometimes controversial and also difficult for educated experts. As the scale of the problem increases, human-in-the-loop approaches to truthfulness that combine both the scalability of machine learning (ML) and the accuracy of human contributions have been considered. In this work, we look at the potential to automatically combine machine-based systems with human-based systems. The former exploit superviseds ML approaches the latter involve either crowd workers (i.e., human non-experts) or human experts. Since both ML and crowdsourcing approaches can produce a score indicating the level of confidence on their truthfulness judgments (either algorithmic or self-reported, respectively), we address the question of whether it is feasible to make use of such confidence scores to effectively and efficiently combine three approaches: (i) machine-based methods, (ii) crowd workers, and (iii) human experts. The three approaches differ significantly, as they range from available, cheap, fast, scalable, but less accurate to scarce, expensive, slow, not scalable, but highly accurate.

Publication

Human-AI Cooperation to Tackle Misinformation and Polarization

Publisher: Association for Computing Machinery (ACM)

Date: 22-06-2023

DOI: 10.1145/3588431

Publication

How Does the Crowd Impact the Model? A Tool for Raising Awareness of Social Bias in Crowdsourced Training Data

Publisher: ACM

Date: 17-10-2022

DOI: 10.1145/3511808.3557178

Publication

The Evolution of Power and Standard Wikidata Editors: Comparing Editing Behavior over Time to Predict Lifespan and Volume of Edits

Publisher: Springer Science and Business Media LLC

Date: 15-12-2018

DOI: 10.1007/S10606-018-9344-Y

Publication

A Neural Model to Jointly Predict and Explain Truthfulness of Statements

Publisher: Association for Computing Machinery (ACM)

Date: 28-12-2023

DOI: 10.1145/3546917

Abstract: Automated fact-checking (AFC) systems exist to combat disinformation, however, their complexity usually makes them opaque to the end-user, making it difficult to foster trust in the system. In this article, we introduce the E-BART model with the hope of making progress on this front. E-BART is able to provide a veracity prediction for a claim and jointly generate a human-readable explanation for this decision. We show that E-BART is competitive with the state-of-the-art on the e-FEVER and e-SNLI tasks. In addition, we validate the joint-prediction architecture by showing (1) that generating explanations does not significantly impede the model from performing well in its main task of veracity prediction, and (2) that predicted veracity and explanations are more internally coherent when generated jointly than separately. We also calibrate the E-BART model, allowing the output of the final model to be correctly interpreted as the confidence of correctness. Finally, we also conduct an extensive human evaluation on the impact of generated explanations and observe that: Explanations increase human ability to spot misinformation and make people more skeptical about claims, and explanations generated by E-BART are competitive with ground truth explanations.

Publication

Hybrid human–machine information systems: Challenges and opportunities

Publisher: Elsevier BV

Date: 10-2015

DOI: 10.1016/J.COMNET.2015.05.018

Publication

A Vector Space Model for Ranking Entities and Its Application to Expert Search

Publisher: Springer Berlin Heidelberg

Date: 2009

DOI: 10.1007/978-3-642-00958-7_19

Publication

How to Trace and Revise Identities

Publisher: Springer Berlin Heidelberg

Date: 2009

DOI: 10.1007/978-3-642-02121-3_32

Publication

The Community Notes Observatory: Can Crowdsourced Fact-Checking be Trusted in Practice?

Publisher: ACM

Date: 30-04-2023

DOI: 10.1145/3543873.3587340

Publication

CoralExp: An Explainable System to Support Coral Taxonomy Research

Publisher: Springer International Publishing

Date: 2021

DOI: 10.1007/978-3-030-72240-1_55

Publication

Leveraging personal metadata for Desktop search: The Beagle++ system

Publisher: Elsevier BV

Date: 03-2010

DOI: 10.1016/J.WEBSEM.2009.12.001

Publication

Can User Behaviour Sequences Reflect Perceived Novelty?

Publisher: ACM

Date: 17-10-2018

DOI: 10.1145/3269206.3269243

Publication

Human-in-the-loop Regular Expression Extraction for Single Column Format Inconsistency

Publisher: ACM

Date: 30-04-2023

DOI: 10.1145/3543507.3583515

Publication

Modus Operandi of Crowd Workers

Publisher: Association for Computing Machinery (ACM)

Date: 11-09-2017

DOI: 10.1145/3130914

Abstract: The ubiquity of the Internet and the widespread proliferation of electronic devices has resulted in flourishing microtask crowdsourcing marketplaces, such as Amazon MTurk. An aspect that has remained largely invisible in microtask crowdsourcing is that of work environments defined as the hardware and software affordances at the disposal of crowd workers which are used to complete microtasks on crowdsourcing platforms. In this paper, we reveal the significant role of work environments in the shaping of crowd work. First, through a pilot study surveying the good and bad experiences workers had with UI elements in crowd work, we revealed the typical issues workers face. Based on these findings, we then deployed over 100 distinct microtasks on CrowdFlower, addressing workers in India and USA in two identical batches. These tasks emulate the good and bad UI element designs that characterize crowdsourcing microtasks. We recorded hardware specifics such as CPU speed and device type, apart from software specifics including the browsers used to complete tasks, operating systems on the device, and other properties that define the work environments of crowd workers. Our findings indicate that crowd workers are embedded in a variety of work environments which influence the quality of work produced. To confirm and validate our data-driven findings we then carried out semi-structured interviews with a s le of Indian and American crowd workers from this platform. Depending on the design of UI elements in microtasks, we found that some work environments support crowd workers more than others. Based on our overall findings resulting from all the three studies, we introduce ModOp, a tool that helps to design crowdsourcing microtasks that are suitable for erse crowd work environments. We empirically show that the use of ModOp results in reducing the cognitive load of workers, thereby improving their user experience without affecting the accuracy or task completion time.

Publication

On Transforming Relevance Scales

Publisher: ACM

Date: 03-11-2019

DOI: 10.1145/3357384.3357988

Publication

TAER

Publisher: ACM

Date: 26-10-2010

DOI: 10.1145/1871437.1871661

Publication

The many dimensions of truthfulness: Crowdsourcing misinformation assessments on a multidimensional scale

Publisher: Elsevier BV

Date: 11-2021

DOI: 10.1016/J.IPM.2021.102710

Publication

Crowd Worker Strategies in Relevance Judgment Tasks

Publisher: ACM

Date: 20-01-2020

DOI: 10.1145/3336191.3371857

Publication

Chapter 4: Using Twitter as a Data Source: An Overview of Ethical, Legal, and Methodological Challenges

Publisher: Emerald Publishing Limited

Date: 06-12-2017

DOI: 10.1108/S2398-601820180000002004

Publication

A Tutorial on Leveraging Knowledge Graphs for Web Search

Publisher: Springer International Publishing

Date: 2016

DOI: 10.1007/978-3-319-41718-9_2

Publication

The COVID-19 Infodemic

Publisher: ACM

Date: 19-10-2020

DOI: 10.1145/3340531.3412048

Publication

Health Card Retrieval for Consumer Health Search

Publisher: ACM

Date: 03-11-2019

DOI: 10.1145/3357384.3358128

Publication

Semantic Interlinking

Publisher: Springer International Publishing

Date: 2019

DOI: 10.1007/978-3-319-77525-8_229

Publication

Novel insights into views towards H1N1 during the 2009 Pandemic: a thematic analysis of Twitter data

Publisher: Wiley

Date: 20-01-2019

DOI: 10.1111/HIR.12247

Abstract: Infectious disease outbreaks have the potential to cause a high number of fatalities and are a very serious public health risk. Our aim was to utilise an indepth method to study a period of time where the H1N1 Pandemic of 2009 was at its peak. A data set of n = 214 784 tweets was retrieved and filtered, and the method of thematic analysis was used to analyse the data. Eight key themes emerged from the analysis of data: emotion and feeling, health related information, general commentary and resources, media and health organisations, politics, country of origin, food, and humour and/or sarcasm. A major novel finding was that due to the name 'swine flu', Twitter users had the belief that pigs and pork could host and/or transmit the virus. Our paper also considered the methodological implications for the wider field of library and information science as well as specific implications for health information and library workers. Novel insights were derived on how users communicate about disease outbreaks on social media platforms. Our study also provides an innovative methodological contribution because it was found that by utilising an indepth method it was possible to extract greater insight into user communication.

Publication

Deadline-Aware Fair Scheduling for Multi-Tenant Crowd-Powered Systems

Publisher: Association for Computing Machinery (ACM)

Date: 21-02-2019

DOI: 10.1145/3301003

Abstract: Crowdsourcing has become an integral part of many systems and services that deliver high-quality results for complex tasks such as data linkage, schema matching, and content annotation. A standard function of such crowd-powered systems is to publish a batch of tasks on a crowdsourcing platform automatically and to collect the results once the workers complete them. Currently, these systems provide limited guarantees over the execution time, which is problematic for many applications. Timely completion may even be impossible to guarantee due to factors specific to the crowdsourcing platform, such as the availability of workers and concurrent tasks. In our previous work, we presented the architecture of a crowd-powered system that reshapes the interaction mechanism with the crowd. Specifically, we studied a push-crowdsourcing model whereby the workers receive tasks instead of selecting them from a portal. Based on this interaction model, we employed scheduling techniques similar to those found in distributed computing infrastructures to automate the task assignment process. In this work, we first devise a generic scheduling strategy that supports both fairness and deadline-awareness. Second, to complement the proof-of-concept experiments previously performed with the crowd, we present an extensive set of simulations meant to analyze the properties of the proposed scheduling algorithms in an environment with thousands of workers and tasks. Our experimental results show that, by accounting for human factors, micro-task scheduling can achieve fairness for best-effort batches and boosts production batches.

Publication

Understanding Worker Moods and Reactions to Rejection in Crowdsourcing

Publisher: ACM

Date: 12-09-2019

DOI: 10.1145/3342220.3343644

Publication

Exploiting click-through data for entity retrieval

Publisher: ACM

Date: 19-07-2010

DOI: 10.1145/1835449.1835624

Publication

Representation learning for entity type ranking

Publisher: ACM

Date: 29-03-2020

DOI: 10.1145/3341105.3374029

Publication

Information Resilience: the nexus of responsible and agile approaches to information use

Publisher: Springer Science and Business Media LLC

Date: 16-01-2022

DOI: 10.1007/S00778-021-00720-2

Abstract: The appetite for effective use of information assets has been steadily rising in both public and private sector organisations. However, whether the information is used for social good or commercial gain, there is a growing recognition of the complex socio-technical challenges associated with balancing the erse demands of regulatory compliance and data privacy, social expectations and ethical use, business process agility and value creation, and scarcity of data science talent. In this vision paper, we present a series of case studies that highlight these interconnected challenges, across a range of application areas. We use the insights from the case studies to introduce Information Resilience, as a scaffold within which the competing requirements of responsible and agile approaches to information use can be positioned. The aim of this paper is to develop and present a manifesto for Information Resilience that can serve as a reference for future research and development in relevant areas of responsible data management.

Publication

Implicit Bias in Crowdsourced Knowledge Graphs

Publisher: ACM

Date: 13-05-2019

DOI: 10.1145/3308560.3317307

Publication

L3S at INEX 2008: Retrieving Entities Using Structured Information

Publisher: Springer Berlin Heidelberg

Date: 2009

DOI: 10.1007/978-3-642-03761-0_26

Publication

Overview of the INEX 2008 Entity Ranking Track

Publisher: Springer Berlin Heidelberg

Date: 2009

DOI: 10.1007/978-3-642-03761-0_25

Publication

Entity summarization of news articles

Publisher: ACM

Date: 19-07-2010

DOI: 10.1145/1835449.1835620

Publication

On the role of human and machine metadata in relevance judgment tasks

Publisher: Elsevier BV

Date: 03-2023

DOI: 10.1016/J.IPM.2022.103177

Publication

Can The Crowd Identify Misinformation Objectively? The Effects of Judgment Scale and Assessor's Background

Publisher: ACM

Date: 25-07-2020

DOI: 10.1145/3397271.3401112

Publication

A Classification of IR Effectiveness Metrics

Publisher: Springer Berlin Heidelberg

Date: 2006

DOI: 10.1007/11735106_48

Publication

Building Data Curation Processes with Crowd Intelligence

Publisher: Springer International Publishing

Date: 2020

DOI: 10.1007/978-3-030-58135-0_3

Publication

Crowdsourced Fact-Checking at Twitter

Publisher: ACM

Date: 17-10-2022

DOI: 10.1145/3511808.3557279

Publication

Adversarial Attacks on Crowdsourcing Quality Control

Publisher: AI Access Foundation

Date: 03-03-2020

DOI: 10.1613/JAIR.1.11332

Abstract: Crowdsourcing is a popular methodology to collect manual labels at scale. Such labels are often used to train AI models and, thus, quality control is a key aspect in the process. One of the most popular quality assurance mechanisms in paid micro-task crowdsourcing is based on gold questions: the use of a small set of tasks of which the requester knows the correct answer and, thus, is able to directly assess crowd work quality. In this paper, we show that such mechanism is prone to an attack carried out by a group of colluding crowd workers that is easy to implement and deploy: the inherent size limit of the gold set can be exploited by building an inferential system to detect which parts of the job are more likely to be gold questions. The described attack is robust to various forms of randomisation and programmatic generation of gold questions. We present the architecture of the proposed system, composed of a browser plug-in and an external server used to share information, and briefly introduce its potential evolution to a decentralised implementation. We implement and experimentally validate the gold detection system, using real-world data from a popular crowdsourcing platform. Our experimental results show that crowdworkers using the proposed system spend more time on signalled gold questions but do not neglect the others thus achieving an increased overall work quality. Finally, we discuss the economic and sociological implications of this kind of attack.

Publication

Considering Assessor Agreement in IR Evaluation

Publisher: ACM

Date: 10-2017

DOI: 10.1145/3121050.3121060

Publication

Report on INEX 2009

Publisher: Association for Computing Machinery (ACM)

Date: 18-08-2010

DOI: 10.1145/1842890.1842897

Abstract: INEX investigates focused retrieval from structured documents by providing large test collections of structured documents, uniform evaluation measures, and a forum for organizations to compare their results. This paper reports on the INEX 2009 evaluation c aign, which consisted of a wide range of tracks: Ad hoc, Book, Efficiency, Entity Ranking, Interactive, QA, Link the Wiki, and XML Mining. INEX in running entirely on volunteer effort by the IR research community: anyone with an idea and some time to spend, can have a major impact.

Publication

DataOps-4G: On Supporting Generalists in Data Quality Discovery

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Date: 2022

DOI: 10.1109/TKDE.2022.3151605

Publication

Entity disambiguation in tweets leveraging user social profiles

Publisher: IEEE

Date: 08-2013

DOI: 10.1109/IRI.2013.6642462

Publication

CrowdCO-OP

Publisher: Association for Computing Machinery (ACM)

Date: 14-10-2020

DOI: 10.1145/3415203

Abstract: Paid micro-task crowdsourcing has gained in popularity partly due to the increasing need for large-scale manually labelled datasets which are often used to train and evaluate Artificial Intelligence systems. Modern paid crowdsourcing platforms use a piecework approach to rewards, meaning that workers are paid for each task they complete, given that their work quality is considered sufficient by the requester or the platform. Such an approach creates risks for workers their work may be rejected without being rewarded, and they may be working on poorly rewarded tasks, in light of the disproportionate time required to complete them. As a result, recent research has shown that crowd workers may tend to choose specific, simple, and familiar tasks and avoid new requesters to manage these risks. In this paper, we propose a novel crowdsourcing reward mechanism that allows workers to share these risks and achieve a standardized hourly wage equal for all participating workers. Reward-focused workers can thereby take up challenging and complex HITs without bearing the financial risk of not being rewarded for completed work. We experimentally compare different crowd reward schemes and observe their impact on worker performance and satisfaction. Our results show that 1) workers clearly perceive the benefits of the proposed reward scheme, 2) work effectiveness and efficiency are not impacted as compared to those of the piecework scheme, and 3) the presence of slow workers is limited and does not disrupt the proposed cooperation-based approaches.

Publication

Modelling User Behavior Dynamics with Embeddings

Publisher: ACM

Date: 19-10-2020

DOI: 10.1145/3340531.3411985

Publication

Understanding Malicious Behavior in Crowdsourcing Platforms

Publisher: ACM

Date: 18-04-2015

DOI: 10.1145/2702123.2702443

Publication

Dear search engine: what's your opinion about...?

Publisher: ACM

Date: 26-04-2010

DOI: 10.1145/1863879.1863883

Publication

On Fine-Grained Relevance Scales

Publisher: ACM

Date: 27-06-2018

DOI: 10.1145/3209978.3210052

Publication

The missing links

Publisher: ACM

Date: 08-11-2010

DOI: 10.1145/1967486.1967557

Publication

The Relationship Between User Perception and User Behaviour in Interactive Information Retrieval Evaluation

Publisher: Springer International Publishing

Date: 2016

DOI: 10.1007/978-3-319-30671-1_22

Publication

Correct Me If I'm Wrong

Publisher: ACM

Date: 03-11-2014

DOI: 10.1145/2661829.2661942

Publication

Leveraging semantic technologies for enterprise search

Publisher: ACM

Date: 09-11-2007

DOI: 10.1145/1316874.1316879

Publication

Large-scale linked data integration using probabilistic reasoning and crowdsourcing

Publisher: Springer Science and Business Media LLC

Date: 18-07-2013

DOI: 10.1007/S00778-013-0324-Z

Publication

Ranking Categories for Web Search

Publisher: Springer Berlin Heidelberg

Date: 2008

DOI: 10.1007/978-3-540-78646-7_56

Publication

Evaluating the quality of learning resources: a Learnersourcing approach

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Date: 02-2021

DOI: 10.1109/TLT.2021.3058644

Publication

An Architecture for Finding Entities on the Web

Publisher: IEEE

Date: 11-2009

DOI: 10.1109/LA-WEB.2009.14

Related Organisations

Organisation

University Of Sheffield

Location: United Kingdom of Great Britain and Northern Ireland

View Organisation

Organisation

University Of Queensland

Location: Australia

View Organisation

Organisation

University Of Udine

Location: Italy

View Organisation

Organisation

Yahoo! Research

Location: Spain

View Organisation

Organisation

L3S Research Center

Location: Germany

View Organisation

Organisation

Leibniz Universität Hannover

Location: Germany

View Organisation

Organisation

University Of Fribourg

Location: Switzerland

View Organisation

Related Funding Activities

Grant

Building Crowd Sourced Data Curation Processes

Start Date: 2019

End Date: 2021

Funder: Australian Research Council

View Funded Activity

Grant

Understanding Europe’s Fashion Data Universe

Start Date: 2017

End Date: 2019

Funder: European Commission

View Funded Activity

Grant

BetterCrowd: Human Computation For Big Data

Start Date: 2016

End Date: 2017

Funder: Engineering and Physical Sciences Research Council

View Funded Activity

Grant

Discovery Projects - Grant ID: DP190102141

Start Date: 01-2019

End Date: 12-2023

Amount: $440,000.00

Funder: Australian Research Council

View Funded Activity

Grant

Industrial Transformation Training Centres - Grant ID: IC200100022

Start Date: 07-2021

End Date: 07-2026

Amount: $4,883,406.00

Funder: Australian Research Council

View Funded Activity

Gianluca Demartini

Researcher

Research Topics

Top 5 Research Topics

ANZSRC Field of Research (FoR)

ANZSRC Socio-Economic Objective (SEO)

Related Links

Publications

The Impact of Task Abandonment in Crowdsourcing

Moral Panic through the Lens of Twitter

Task design in complex crowdsourcing experiments: Item assignment optimization

Report on INEX 2008

Perspectives on Large Language Models for Relevance Judgment

Effective named entity recognition for idiosyncratic web collections

On Understanding Data Worker Interaction Behaviors

Incorporating AI and Analytics to Derive Insights from E-exam Logs

TRank: Ranking Entity Types Using the Web of Data

From people to entities

Health Cards for Consumer Health Search

Firearms on Twitter: A Novel Object Detection Pipeline

Hippocampus

Understanding Engagement through Search Behaviour

Exploring Data Literacy Levels in the Crowd – the Case of COVID-19

Understanding reactions to swine flu, Ebola, and the Zika virus using Twitter data: an outlook for future infectious disease outbreaks

NoizCrowd: A Crowd-Based Data Gathering and Management System for Noise Level Data

Human Beyond the Machine: Challenges and Opportunities of Microtask Crowdsourcing

BowlognaBench—Benchmarking RDF Analytics

Hierarchical Clustering of Corals using Image Clustering

A Behavioural Analysis of Metadata Use in Evaluating the Quality of Repurposed Data

Crowdsourcing truthfulness: The impact of judgment scale and assessor bias

Preferences on a Budget: Prioritizing Document Pairs when Crowdsourcing Relevance Judgments

How Many Crowd Workers Do I Need? On Statistical Power when Crowdsourcing Relevance Judgments

Ranking Entities Using Web Search Query Logs

An Introduction to Hybrid Human-Machine Information Systems

An Analysis of the Australian Political Discourse in Sponsored Social Media Content

Analytics of learning tactics and strategies in an online learnersourcing environment

Ontology-Based Word Sense Disambiguation for Scientific Literature

Why finding entities in Wikipedia is difficult, sometimes

Socio-Economic Diversity in Human Annotations

Contextualized ranking of entity types based on knowledge graphs

Combining inverted indices and structured search for ad-hoc object retrieval

Investigating User Perception of Gender Bias in Image Search

Zika Outbreak of 2016: Insights from Twitter

Does Evidence from Peers Help Crowd Workers in Assessing Truthfulness?

Semantically Enhanced Entity Ranking

How Does the Crowd Impact the Model? A tool for raising awareness of social bias in crowdsourced training data

Scheduling Human Intelligence Tasks in Multi-Tenant Crowd-Powered Systems

Social recommendations of content and metadata

A Data-Driven Analysis of Behaviors in Data Curation Processes

Charting the Design and Analytics Agenda of Learnersourcing Systems

Overview of the INEX 2009 Entity Ranking Track

Non-parametric Class Completeness Estimators for Collaborative Knowledge Graphs—The Case of Wikidata

QuRVe: Query Refinement for View Recommendation in Visual Data Exploration

On the State of Reporting in Crowdsourcing Experiments and a Checklist to Aid Current Practices

The Dynamics of Micro-Task Crowdsourcing

A Model for Ranking Entities and Its Application to Wikipedia

Measuring the Effect of Public Health Campaigns on Twitter: The Case of World Autism Awareness Day

ZenCrowd

Pooling-based continuous evaluation of information retrieval systems

Crowd Anatomy Beyond the Good and Bad: Behavioral Traces for Crowd Worker Modeling and Pre-selection

Predicting the Future Impact of News Events

All Those Wasted Hours

Quality Control Attack Schemes in Crowdsourcing

On the Volatility of Commercial Search Engines and its Impact on Information Retrieval Research

B-hist: Entity-centric search over personal web browsing history

Effects of Technological Interventions for Self-regulation: A Control Experiment in Learnersourcing

L3S at INEX 2007: Query Expansion for Entity Ranking Using a Highly Accurate Ontology

Loss-based Active Learning for Named Entity Recognition

Iterative Human-in-the-Loop Discovery of Unknown Unknowns in Image Datasets

Active learning with feature matching for clinical named entity recognition

Can the crowd judge truthfulness? A longitudinal study on recent misinformation about COVID-19

On the Impact of Data Quality on Image Classification Fairness

Scalpel-CD: Leveraging Crowdsourcing and Deep Probabilistic Modeling for Debugging Noisy Training Data

Visual interfaces for stimulating exploratory search

Combining Human and Machine Confidence in Truthfulness Assessment

Human-AI Cooperation to Tackle Misinformation and Polarization

How Does the Crowd Impact the Model? A Tool for Raising Awareness of Social Bias in Crowdsourced Training Data

The Evolution of Power and Standard Wikidata Editors: Comparing Editing Behavior over Time to Predict Lifespan and Volume of Edits

A Neural Model to Jointly Predict and Explain Truthfulness of Statements

Hybrid human–machine information systems: Challenges and opportunities