ARDC Research Link Australia

Publication

A2A: Benchmark Your Clinical Decision Support Search

Publisher: ACM

Date: 27-06-2018

DOI: 10.1145/3209978.3210166

Publication

Assessor error in stratified evaluation

Publisher: ACM

Date: 26-10-2010

DOI: 10.1145/1871437.1871508

Publication

User interaction with novel web search interfaces

Publisher: ACM

Date: 23-11-2009

DOI: 10.1145/1738826.1738880

Publication

Quantifying the impact of concept recognition on biomedical information retrieval

Publisher: Elsevier BV

Date: 2012

DOI: 10.1016/J.IPM.2011.02.009

Publication

Beyond Factoid QA: Effective Methods for Non-factoid Answer Sentence Retrieval

Publisher: Springer International Publishing

Date: 2016

DOI: 10.1007/978-3-319-30671-1_9

Publication

Predicting Re-finding Activity and Difficulty

Publisher: Springer International Publishing

Date: 2015

DOI: 10.1007/978-3-319-16354-3_78

Publication

Query-biased summary generation assisted by query expansion

Publisher: Wiley

Date: 18-07-2015

DOI: 10.1002/ASI.23222

Publication

Using anchor text for homepage and topic distillation search tasks

Publisher: Wiley

Date: 28-03-2012

DOI: 10.1002/ASI.22639

Publication

Compression of inverted indexes for fast query evaluation

Publisher: ACM

Date: 11-08-2002

DOI: 10.1145/564376.564416

Publication

The challenge of high recall in biomedical systematic search

Publisher: ACM

Date: 06-11-2009

DOI: 10.1145/1651318.1651338

Publication

Modeling decision points in user search behavior

Publisher: ACM

Date: 26-08-2014

DOI: 10.1145/2637002.2637032

Publication

Welcome from the technical program chairs

Publisher: ACM

Date: 28-11-2017

DOI: 10.1145/3152771

Publication

Searching Musical Audio Using Symbolic Queries

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Date: 02-2008

DOI: 10.1109/TASL.2007.911644

Publication

Information Retrieval Technology, 5th Asia Information Retrieval Symposium, AIRS 2009, Sapporo, Japan, October 21-23, 2009. Proceedings

Publisher: Springer Berlin Heidelberg

Date: 2009

DOI: 10.1007/978-3-642-04769-5

Publication

Capturing collection size for distributed non-cooperative retrieval

Publisher: ACM

Date: 06-08-2006

DOI: 10.1145/1148170.1148227

Publication

Overview of the INEX 2011 Snippet Retrieval Track

Publisher: Springer Berlin Heidelberg

Date: 2011

DOI: 10.1007/978-3-642-35734-3_27

Publication

The Impact of Query Length and Document Length on Book Search Effectiveness

Publisher: Springer Berlin Heidelberg

Date: 2008

DOI: 10.1007/978-3-642-03761-0_18

Publication

RMIT at INEX 2011 Snippet Retrieval Track

Publisher: Springer Berlin Heidelberg

Date: 2011

DOI: 10.1007/978-3-642-35734-3_29

Publication

Augmenting web search surrogates with images

Publisher: ACM Press

Date: 2013

DOI: 10.1145/2505515.2505714

Publication

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, Santiago, Chile, August 9-13, 2015

Publisher: ACM

Date: 09-08-2015

DOI: 10.1145/2766462

Publication

Query expansion using associated queries

Publisher: ACM

Date: 03-11-2003

DOI: 10.1145/956863.956866

Publication

Towards Detecting Tonic Information Processing Activities with Physiological Data

Publisher: ACM

Date: 08-10-2023

DOI: 10.1145/3594739.3610679

Publication

Report on INEX 2011

Publisher: Association for Computing Machinery (ACM)

Date: 20-05-2012

DOI: 10.1145/2215676.2215679

Abstract: INEX investigates focused retrieval from structured documents by providing large test collections of structured documents, uniform evaluation measures, and a forum for organizations to compare their results. This paper reports on the INEX 2011 evaluation c aign, which consisted of a five active tracks: Books and Social Search, Data Centric, Question Answering, Relevance Feedback, and Snippet Retrieval. INEX 2011 saw a range of new tasks and tracks, such as Social Book Search, Faceted Search, Snippet Retrieval, and Tweet Contextualization.

Publication

Re-Finding Behaviour in Vertical Domains

Publisher: Association for Computing Machinery (ACM)

Date: 05-06-2017

DOI: 10.1145/2975590

Abstract: Re-finding is the process of searching for information that a user has previously encountered and is a common activity carried out with information retrieval systems. In this work, we investigate re-finding in the context of vertical search, differentiating and modeling user re-finding behavior within different media and topic domains, including images, news, reference material, and movies. We distinguish the re-finding behavior in vertical domains from re-finding in a general search context and engineer features that are effective in differentiating re-finding across the domains. The features are then used to build machine-learned models, achieving an accuracy of re-finding detection in verticals of 85.7% on average. Our results demonstrate that detecting re-finding in specific verticals is more difficult than examining re-finding for general search tasks. We then investigate the effectiveness of differentiating re-finding behavior in two restricted contexts: We consider the case where the history of a searcher’s interactions with the search system is not available. In this scenario, our features and models achieve an average accuracy of 77.5% across the domains. We then examine the detection of re-finding during the early part of a search session. Both of these restrictions represent potential real-world search scenarios, where a system is attempting to learn about a user but may have limited information available. Finally, we investigate in which types of domains re-finding is most difficult. Here, it would appear that re-finding images is particularly challenging for users. This research has implications for search engine design, in terms of adapting search results by predicting the type of user tasks and potentially enabling the presentation of vertical-specific results when re-finding is identified. To the best of our knowledge, this is the first work to investigate the issue of vertical re-finding.

Publication

Using Clicks as Implicit Judgments: Expectations Versus Observations

Publisher: Springer Berlin Heidelberg

Date: 2008

DOI: 10.1007/978-3-540-78646-7_6

Publication

Quantum haystacks revisited

Publisher: Springer International Publishing

Date: 2015

DOI: 10.1007/978-3-319-28940-3

Publication

Machine transliteration survey

Publisher: Association for Computing Machinery (ACM)

Date: 04-2011

DOI: 10.1145/1922649.1922654

Abstract: Machine transliteration is the process of automatically transforming the script of a word from a source language to a target language, while preserving pronunciation. The development of algorithms specifically for machine transliteration began over a decade ago based on the phonetics of source and target languages, followed by approaches using statistical and language-specific methods. In this survey, we review the key methodologies introduced in the transliteration literature. The approaches are categorized based on the resources and algorithms used, and the effectiveness is compared.

Publication

Effective Pre-retrieval Query Performance Prediction Using Similarity and Variability Evidence

Publisher: Springer Berlin Heidelberg

Date: 2008

DOI: 10.1007/978-3-540-78646-7_8

Publication

Report on INEX 2013

Publisher: Association for Computing Machinery (ACM)

Date: 21-01-2013

DOI: 10.1145/2568388.2568393

Abstract: INEX investigates focused retrieval from structured documents by providing large test collections of structured documents, uniform evaluation measures, and a forum for organizations to compare their results. This paper reports on the INEX 2013 evaluation c aign, which consisted of four activities addressing three themes: searching professional and user generated data (Social Book Search track) searching structured or semantic data (Linked Data track) and focused retrieval (Snippet Retrieval and Tweet Contextualization tracks). INEX 2013 was an exciting year for INEX in which we consolidated the collaboration with (other activities in) CLEF and for the second time ran our workshop as part of the CLEF labs in order to facilitate knowledge transfer between the evaluation forums. This paper gives an overview of all the INEX 2013 tracks, their aims and task, the built test-collections, and gives an initial analysis of the results.

Publication

The 35th International ACM SIGIR conference on research and development in Information Retrieval, SIGIR '12, Portland, OR, USA, August 12-16, 2012

Publisher: ACM

Date: 12-08-2012

DOI: 10.1145/2348283

Publication

Stemming Arabic Conjunctions and Prepositions

Publisher: Springer Berlin Heidelberg

Date: 2005

DOI: 10.1007/11575832_23

Publication

Sample Sizes for Query Probing in Uncooperative Distributed Information Retrieval

Publisher: Springer Berlin Heidelberg

Date: 2006

DOI: 10.1007/11610113_7

Publication

Ranking Documents by Answer-Passage Quality

Publisher: ACM

Date: 27-06-2018

DOI: 10.1145/3209978.3210028

Publication

The Benefits of Magnitude Estimation Relevance Assessments for Information Retrieval Evaluation

Publisher: ACM

Date: 09-08-2015

DOI: 10.1145/2766462.2767760

Publication

UQV100: A Test Collection with Query Variability

Publisher: ACM

Date: 07-07-2016

DOI: 10.1145/2911451.2914671

Publication

Proceedings of the 25th ACM International on Conference on Information and Knowledge Management

Publisher: ACM

Date: 24-10-2016

DOI: 10.1145/2983323

Publication

Proceedings of the 22nd Australasian Document Computing Symposium, ADCS 2017, Brisbane, QLD, Australia, December 7-8, 2017

Publisher: ACM

Date: 07-12-2017

DOI: 10.1145/3166072

Publication

Effective retrieval of polyphonic audio with polyphonic symbolic queries

Publisher: ACM

Date: 24-09-2007

DOI: 10.1145/1290082.1290100

Publication

Models and metrics: IR evaluation as a user process

Publisher: ACM

Date: 05-12-2012

DOI: 10.1145/2407085.2407092

Publication

Sentence length bias in TREC novelty track judgements

Publisher: ACM

Date: 05-12-2012

DOI: 10.1145/2407085.2407093

Publication

Cost and benefit estimation of experts' mediation in an enterprise search

Publisher: Wiley

Date: 11-10-2014

DOI: 10.1002/ASI.22951

Publication

SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, Washington, USA, August 6-11, 2006

Publisher: ACM

Date: 06-08-2006

DOI: 10.1145/1148170

Publication

Incorporating User Expectations and Behavior into the Measurement of Search Effectiveness

Publisher: Association for Computing Machinery (ACM)

Date: 05-06-2017

DOI: 10.1145/3052768

Abstract: Information retrieval systems aim to help users satisfy information needs. We argue that the goal of the person using the system, and the pattern of behavior that they exhibit as they proceed to attain that goal, should be incorporated into the methods and techniques used to evaluate the effectiveness of IR systems, so that the resulting effectiveness scores have a useful interpretation that corresponds to the users’ search experience. In particular, we investigate the role of search task complexity, and show that it has a direct bearing on the number of relevant answer documents sought by users in response to an information need, suggesting that useful effectiveness metrics must be goal sensitive . We further suggest that user behavior while scanning results listings is affected by the rate at which their goal is being realized, and hence that appropriate effectiveness metrics must be adaptive to the presence (or not) of relevant documents in the ranking. In response to these two observations, we present a new effectiveness metric, INST, that has both of the desired properties: INST employs a parameter T , a direct measure of the user’s search goal that adjusts the top-weightedness of the evaluation score moreover, as progress towards the target T is made, the modeled user behavior is adapted, to reflect the remaining expectations. INST is experimentally compared to previous effectiveness metrics, including Average Precision (AP), Normalized Discounted Cumulative Gain (NDCG), and Rank-Biased Precision (RBP), demonstrating our claims as to INST’s usefulness. Like RBP, INST is a weighted-precision metric, meaning that each score can be accompanied by a residual that quantifies the extent of the score uncertainty caused by unjudged documents. As part of our experimentation, we use crowd-sourced data and score residuals to demonstrate that a wide range of queries arise for even quite specific information needs, and that these variant queries introduce significant levels of residual uncertainty into typical experimental evaluations. These causes of variability have wide-reaching implications for experiment design, and for the construction of test collections.

Publication

Query association surrogates for Web search

Publisher: Wiley

Date: 25-02-2004

DOI: 10.1002/ASI.20011

Publication

Component-based Analysis of Dynamic Search Performance

Publisher: Association for Computing Machinery (ACM)

Date: 22-11-2022

DOI: 10.1145/3483237

Abstract: In many search scenarios, such as exploratory, comparative, or survey-oriented search, users interact with dynamic search systems to satisfy multi-aspect information needs. These systems utilize different dynamic approaches that exploit various user feedback granularity types. Although studies have provided insights about the role of many components of these systems, they used black-box and isolated experimental setups. Therefore, the effects of these components or their interactions are still not well understood. We address this by following a methodology based on Analysis of Variance (ANOVA). We built a Grid Of Points that consists of systems based on different ways to instantiate three components: initial rankers, dynamic rerankers, and user feedback granularity. Using evaluation scores based on the TREC Dynamic Domain collections, we built several ANOVA models to estimate the effects. We found that (i) although all components significantly affect search effectiveness, the initial ranker has the largest effective size, (ii) the effect sizes of these components vary based on the length of the search session and the used effectiveness metric, and (iii) initial rankers and dynamic rerankers have more prominent effects than user feedback granularity. To improve effectiveness, we recommend improving the quality of initial rankers and dynamic rerankers. This does not require eliciting detailed user feedback, which might be expensive or invasive.

Publication

Harnessing Semantics for Answer Sentence Retrieval

Publisher: ACM

Date: 22-10-2015

DOI: 10.1145/2810133.2810136

Publication

Information Retrieval Technology - 9th Asia Information Retrieval Societies Conference, AIRS 2013, Singapore, December 9-11, 2013. Proceedings

Publisher: Springer Berlin Heidelberg

Date: 2013

DOI: 10.1007/978-3-642-45068-6

Publication

Using Information Scent to Understand Mobile and Desktop Web Search Behavior

Publisher: ACM

Date: 07-08-2017

DOI: 10.1145/3077136.3080817

Publication

What Users Do: The Eyes Have It

Publisher: Springer Berlin Heidelberg

Date: 2013

DOI: 10.1007/978-3-642-45068-6_36

Publication

Domain expert topic familiarity and search behavior

Publisher: ACM

Date: 24-07-2011

DOI: 10.1145/2009916.2010086

Publication

A case for improved evaluation of query difficulty prediction

Publisher: ACM

Date: 19-07-2009

DOI: 10.1145/1571941.1572055

Publication

On the Cost of Negation for Dynamic Pruning

Publisher: Springer International Publishing

Date: 2018

DOI: 10.1007/978-3-319-76941-7_42

Publication

Proceedings of the 21st Australasian Document Computing Symposium, ADCS 2016, Caulfield, VIC, Australia, December 5-7, 2016

Publisher: ACM

Date: 05-12-2016

DOI: 10.1145/3015022

Publication

Document Summarization for Answering Non-Factoid Queries

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Date: 2018

DOI: 10.1109/TKDE.2017.2754373

Publication

TREC: topic engineering exercise

Publisher: ACM

Date: 03-07-2014

DOI: 10.1145/2600428.2609531

Publication

Using score differences for search result diversification

Publisher: ACM

Date: 03-07-2014

DOI: 10.1145/2600428.2609530

Publication

Information retrieval evaluation using test collections

Publisher: Springer Science and Business Media LLC

Date: 06-2016

DOI: 10.1007/S10791-016-9281-7

Publication

Understanding and Analysing Novice Programmer Interactions in a Facebook Programming Group

Publisher: IEEE

Date: 04-2014

DOI: 10.1109/LATICE.2014.28

Publication

Designing and Evaluating Presentation Strategies for Fact-Checked Content

Publisher: ACM

Date: 21-10-2023

DOI: 10.1145/3583780.3614841

Publication

Advances in Information Retrieval

Publisher: Springer International Publishing

Date: 2015

DOI: 10.1007/978-3-319-16354-3

Publication

Differences in effectiveness across sub-collections

Publisher: ACM

Date: 29-10-2012

DOI: 10.1145/2396761.2398553

Publication

Advances in Information Retrieval

Publisher: Springer International Publishing

Date: 2016

DOI: 10.1007/978-3-319-30671-1

Publication

Efficient in-memory top-k document retrieval

Publisher: ACM

Date: 12-08-2012

DOI: 10.1145/2348283.2348317

Publication

A Living Lab Study of Query Amendment in Job Search

Publisher: ACM

Date: 27-06-2018

DOI: 10.1145/3209978.3210082

Publication

An Enhanced Evaluation Framework for Query Performance Prediction

Publisher: Springer International Publishing

Date: 2021

DOI: 10.1007/978-3-030-72113-8_8

Publication

Quantifying test collection quality based on the consistency of relevance judgements

Publisher: ACM

Date: 24-07-2011

DOI: 10.1145/2009916.2010057

Publication

Advances in Information Retrieval - 39th European Conference on IR Research, ECIR 2017, Aberdeen, UK, April 8-13, 2017, Proceedings

Publisher: Springer International Publishing

Date: 2017

DOI: 10.1007/978-3-319-56608-5

Publication

Features of Disagreement Between Retrieval Effectiveness Measures

Publisher: ACM

Date: 09-08-2015

DOI: 10.1145/2766462.2767824

Publication

Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

Publisher: ACM

Date: 07-07-2016

DOI: 10.1145/2911451

Publication

Constructing query-biased summaries: a comparison of human and system generated snippets

Publisher: ACM

Date: 18-08-2010

DOI: 10.1145/1840784.1840813

Publication

Proceedings of the 9th Workshop on Multimedia for Cooking and Eating Activities in conjunction with The 2017 International Joint Conference on Artificial Intelligence, Melbourne, Australia, August 20 - 20, 2017

Publisher: ACM

Date: 20-08-2017

DOI: 10.1145/3106668

Publication

On Crowdsourcing Relevance Magnitudes for Information Retrieval Evaluation

Publisher: Association for Computing Machinery (ACM)

Date: 04-01-2017

DOI: 10.1145/3002172

Abstract: Magnitude estimation is a psychophysical scaling technique for the measurement of sensation, where observers assign numbers to stimuli in response to their perceived intensity. We investigate the use of magnitude estimation for judging the relevance of documents for information retrieval evaluation, carrying out a large-scale user study across 18 TREC topics and collecting over 50,000 magnitude estimation judgments using crowdsourcing. Our analysis shows that magnitude estimation judgments can be reliably collected using crowdsourcing, are competitive in terms of assessor cost, and are, on average, rank-aligned with ordinal judgments made by expert relevance assessors. We explore the application of magnitude estimation for IR evaluation, calibrating two gain-based effectiveness metrics, nDCG and ERR, directly from user-reported perceptions of relevance. A comparison of TREC system effectiveness rankings based on binary, ordinal, and magnitude estimation relevance shows substantial variation in particular, the top systems ranked using magnitude estimation and ordinal judgments differ substantially. Analysis of the magnitude estimation scores shows that this effect is due in part to varying perceptions of relevance: different users have different perceptions of the impact of relative differences in document relevance. These results have direct implications for IR evaluation, suggesting that current assumptions about a single view of relevance being sufficient to represent a population of users are unlikely to hold.

Publication

Concurrence of Word Concepts in Cooking Recipe Search

Publisher: ACM

Date: 20-08-2017

DOI: 10.1145/3106668.3106674

Publication

Topic Distillation with Query-Dependent Link Connections and Page Characteristics

Publisher: Association for Computing Machinery (ACM)

Date: 05-2011

DOI: 10.1145/1961659.1961660

Abstract: Searchers on the Web often aim to find key resources about a topic. Finding such results is called topic distillation. Previous research has shown that the use of sources of evidence such as page indegree and URL structure can significantly improve search performance on interconnected collections such as the Web, beyond the use of simple term distribution statistics. This article presents a new approach to improve topic distillation by exploring the use of external sources of evidence: link structure, including query dependent indegree and outdegree and web page characteristics, such as the density of anchor links. Our experiments with the TREC .GOV collection, an 18GB crawl of the US .gov domain from 2002, show that using such evidence can significantly improve search effectiveness, with combinations of evidence leading to significant performance gains over both full-text and anchor-text baselines. Moreover, we demonstrate that, at a different scope level, both local query-dependent outdegree and query-dependent indegree out-performed their global query-independent counterparts and at the same scope level, outdegree out-performed indegree. Adding query-dependent indegree or page characteristics to query-dependent outdegree could have a small, but not significant, improvement.

Publication

English to Persian Transliteration

Publisher: Springer Berlin Heidelberg

Date: 2006

DOI: 10.1007/11880561_21

Publication

Examining the Impact of Uncontrolled Variables on Physiological Signals in User Studies for Information Processing Activities

Publisher: ACM

Date: 18-07-2023

DOI: 10.1145/3539618.3591981

Publication

Retrieval Consistency in the Presence of Query Variations

Publisher: ACM

Date: 07-08-2017

DOI: 10.1145/3077136.3080839

Publication

INST: An Adaptive Metric for Information Retrieval Evaluation

Publisher: ACM

Date: 08-12-2015

DOI: 10.1145/2838931.2838938

Publication

Data Fusion for Japanese Term and Character N-gram Search

Publisher: ACM

Date: 08-12-2015

DOI: 10.1145/2838931.2838939

Publication

An Empirical Analysis of Pruning Techniques: Performance, Retrievability and Bias

Publisher: ACM

Date: 06-11-2017

DOI: 10.1145/3132847.3133151

Publication

QWERTY: The Effects of Typing on Web Search Behavior

Publisher: ACM Press

Date: 2018

DOI: 10.1145/3176349.3176872

Publication

Answering English Queries in Automatically Transcribed Arabic Speech

Publisher: IEEE

Date: 2007

DOI: 10.1109/ICIS.2007.61

Publication

Including summaries in system evaluation

Publisher: ACM

Date: 19-07-2009

DOI: 10.1145/1571941.1572029

Publication

Boolean versus ranked querying for biomedical systematic reviews

Publisher: Springer Science and Business Media LLC

Date: 12-10-2010

DOI: 10.1186/1472-6947-10-58

Publication

Examining Additivity and Weak Baselines

Publisher: Association for Computing Machinery (ACM)

Date: 09-06-2016

DOI: 10.1145/2882782

Abstract: We present a study of which baseline to use when testing a new retrieval technique. In contrast to past work, we show that measuring a statistically significant improvement over a weak baseline is not a good predictor of whether a similar improvement will be measured on a strong baseline. Sometimes strong baselines are made worse when a new technique is applied. We investigate whether conducting comparisons against a range of weaker baselines can increase confidence that an observed effect will also show improvements on a stronger baseline. Our results indicate that this is not the case -- at best, testing against a range of baselines means that an experimenter can be more confident that the new technique is unlikely to significantly harm a strong baseline. Examining recent past work, we present evidence that the information retrieval (IR) community continues to test against weak baselines. This is unfortunate as, in light of our experiments, we conclude that the only way to be confident that a new technique is a contribution is to compare it against nothing less than the state of the art.

Publication

The effect of threshold priming and need for cognition on relevance calibration and assessment

Publisher: ACM

Date: 28-07-2013

DOI: 10.1145/2484028.2484090

Publication

Estimating Measurement Uncertainty for Information Retrieval Effectiveness Metrics

Publisher: Association for Computing Machinery (ACM)

Date: 29-09-2018

DOI: 10.1145/3239572

Abstract: One typical way of building test collections for offline measurement of information retrieval systems is to pool the ranked outputs of different systems down to some chosen depth d and then form relevance judgments for those documents only. Non-pooled documents—ones that did not appear in the top- d sets of any of the contributing systems—are then deemed to be non-relevant for the purposes of evaluating the relative behavior of the systems. In this article, we use RBP-derived residuals to re-examine the reliability of that process. By fitting the RBP parameter ϕ to maximize similarity between AP- and NDCG-induced system rankings, on the one hand, and RBP-induced rankings, on the other, an estimate can be made as to the potential score uncertainty associated with those two recall-based metrics. We then consider the effect that residual size—as an indicator of possible measurement uncertainty in utility-based metrics—has in connection with recall-based metrics by computing the effect of increasing pool sizes and examining the trends that arise in terms of both metric score and system separability using standard statistical tests. The experimental results show that the confidence levels expressed via the p -values generated by statistical tests are only weakly connected to the size of the residual and to the degree of measurement uncertainty caused by the presence of unjudged documents. Statistical confidence estimates are, however, largely consistent as pooling depths are altered. We therefore recommend that all such experimental results should report, in addition to the outcomes of statistical significance tests, the residual measurements generated by a suitably matched weighted-precision metric, to give a clear indication of measurement uncertainty that arises due to the presence of unjudged documents in test collections with finite pooled judgments.

Publication

User performance versus precision measures for simple search tasks

Publisher: ACM

Date: 06-08-2006

DOI: 10.1145/1148170.1148176

Publication

Gauging the Quality of Relevance Assessments using Inter-Rater Agreement

Publisher: ACM

Date: 07-08-2017

DOI: 10.1145/3077136.3080729

Publication

A Study of Querying Behaviour of Expert and Non-expert Users of Biomedical Search Systems

Publisher: ACM

Date: 26-11-2014

DOI: 10.1145/2682862.2682871

Publication

Users versus models: what observation tells us about effectiveness metrics

Publisher: ACM Press

Date: 2013

DOI: 10.1145/2505515.2507665

Publication

Visualizing search results and document collections using topic maps

Publisher: Elsevier BV

Date: 07-2010

DOI: 10.1016/J.WEBSEM.2010.03.005

Publication

Relevance thresholds in system evaluations

Publisher: ACM

Date: 20-07-2008

DOI: 10.1145/1390334.1390455

Publication

Focused Retrieval of Content and Structure, 10th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2011, Saarbrücken, Germany, December 12-14, 2011, Revised Selected Papers

Publisher: Springer Berlin Heidelberg

Date: 2012

DOI: 10.1007/978-3-642-35734-3

Publication

Advances in Focused Retrieval

Publisher: Springer Berlin Heidelberg

Date: 2009

DOI: 10.1007/978-3-642-03761-0

Publication

A Crowdsourcing Methodology to Measure Algorithmic Bias in Black-Box Systems: A Case Study with COVID-Related Searches

Publisher: Springer International Publishing

Date: 2022

DOI: 10.1007/978-3-031-09316-6_5

Publication

User Variability and IR System Evaluation

Publisher: ACM

Date: 09-08-2015

DOI: 10.1145/2766462.2767728

Publication

On the effect of relevance scales in crowdsourcing relevance assessments for Information Retrieval evaluation

Publisher: Elsevier BV

Date: 11-2021

DOI: 10.1016/J.IPM.2021.102688

Publication

Metric and Relevance Mismatch in Retrieval Evaluation

Publisher: Springer Berlin Heidelberg

Date: 2009

DOI: 10.1007/978-3-642-04769-5_5

Publication

Identifying Re-finding Difficulty from User Query Logs

Publisher: ACM

Date: 26-11-2014

DOI: 10.1145/2682862.2682867

Publication

Language Influences on Tweeter Geolocation

Publisher: Springer International Publishing

Date: 2017

DOI: 10.1007/978-3-319-56608-5_26

Publication

Pooled Evaluation Over Query Variations: Users are as Diverse as Systems

Publisher: ACM

Date: 17-10-2015

DOI: 10.1145/2806416.2806606

Publication

Choices in batch information retrieval evaluation

Publisher: ACM

Date: 05-12-2013

DOI: 10.1145/2537734.2537745

Publication

Using eye tracking for evaluating web search interfaces

Publisher: ACM

Date: 05-12-2013

DOI: 10.1145/2537734.2537747

Publication

Can Generative LLMs Create Query Variants for Test Collections? An Exploratory Study

Publisher: ACM

Date: 18-07-2023

DOI: 10.1145/3539618.3591960

Publication

Query association for effective retrieval

Publisher: ACM

Date: 04-11-2002

DOI: 10.1145/584792.584846

Publication

Investigating the Effectiveness of Clickthrough Data for Document Reordering

Publisher: Springer Berlin Heidelberg

Date: 2008

DOI: 10.1007/978-3-540-78646-7_61

Publication

Early Termination Heuristics for Score-at-a-Time Index Traversal

Publisher: ACM

Date: 07-12-2017

DOI: 10.1145/3166072.3166073

Publication

User preference choices for complex question answering

Publisher: ACM

Date: 20-07-2008

DOI: 10.1145/1390334.1390467

Publication

Report on INEX 2012

Publisher: Association for Computing Machinery (ACM)

Date: 21-12-2012

DOI: 10.1145/2422256.2422264

Abstract: INEX investigates focused retrieval from structured documents by providing large test collections of structured documents, uniform evaluation measures, and a forum for organizations to compare their results. This paper reports on the INEX'12 evaluation c aign, which consisted of a five tracks: Linked Data, Relevance Feedback, Snippet Retrieval, Social Book Search, and Tweet Contextualization. INEX'12 was an exciting year for INEX in which we joined forces with CLEF and for the first time ran our workshop as part of the CLEF labs in order to facilitate knowledge transfer between the evaluation forums.

Publication

The Effect of Document Order and Topic Difficulty on Assessor Agreement

Publisher: ACM

Date: 12-09-2016

DOI: 10.1145/2970398.2970431

Publication

Proceedings of the 2018 Conference on Human Information Interaction&Retrieval, CHIIR 2018, New Brunswick, NJ, USA, March 11-15, 2018

Publisher: ACM

Date: 03-2018

DOI: 10.1145/3176349

Publication

Judging Relevance Using Magnitude Estimation

Publisher: Springer International Publishing

Date: 2015

DOI: 10.1007/978-3-319-16354-3_23

Publication

Towards Nuanced System Evaluation Based on Implicit User Expectations

Publisher: Springer International Publishing

Date: 2015

DOI: 10.1007/978-3-319-28940-3_26

Publication

Different Rankers on Different Subcollections

Publisher: Springer International Publishing

Date: 2015

DOI: 10.1007/978-3-319-16354-3_21

Publication

Size and Source Matter: Understanding Inconsistencies in Test Collection-Based Evaluation

Publisher: ACM

Date: 03-11-2014

DOI: 10.1145/2661829.2661945

Publication

On the Effectiveness of Query Weighting for Adapting Rank Learners to New Unlabelled Collections

Publisher: ACM

Date: 24-10-2016

DOI: 10.1145/2983323.2983852

Publication

Proceeding of the 3rd International Workshop on Data and Text Mining in Bioinformatics, DTMBIO 2009, Hong Kong, China, November 6, 2009

Publisher: ACM

Date: 06-11-2009

DOI: 10.1145/1651318

Publication

Only forward?

Publisher: ACM

Date: 28-11-2017

DOI: 10.1145/3152771.3156165

Publication

Assessing the Cognitive Complexity of Information Needs

Publisher: ACM

Date: 26-11-2014

DOI: 10.1145/2682862.2682874

Publication

Using query logs to establish vocabularies in distributed information retrieval

Publisher: Elsevier BV

Date: 2007

DOI: 10.1016/J.IPM.2006.04.003

Publication

Tasks, Queries, and Rankers in Pre-Retrieval Performance Prediction

Publisher: ACM

Date: 07-12-2017

DOI: 10.1145/3166072.3166079

Falk Scholer

Researcher

Research Topics

Top 5 Research Topics

ANZSRC Field of Research (FoR)

ANZSRC Socio-Economic Objective (SEO)

Related Links

Publications

A2A: Benchmark Your Clinical Decision Support Search

Assessor error in stratified evaluation

User interaction with novel web search interfaces

Quantifying the impact of concept recognition on biomedical information retrieval

Beyond Factoid QA: Effective Methods for Non-factoid Answer Sentence Retrieval

Predicting Re-finding Activity and Difficulty

Query-biased summary generation assisted by query expansion

Using anchor text for homepage and topic distillation search tasks

Compression of inverted indexes for fast query evaluation

The challenge of high recall in biomedical systematic search

Modeling decision points in user search behavior

Welcome from the technical program chairs

Searching Musical Audio Using Symbolic Queries

Information Retrieval Technology, 5th Asia Information Retrieval Symposium, AIRS 2009, Sapporo, Japan, October 21-23, 2009. Proceedings

Capturing collection size for distributed non-cooperative retrieval

Overview of the INEX 2011 Snippet Retrieval Track

The Impact of Query Length and Document Length on Book Search Effectiveness

RMIT at INEX 2011 Snippet Retrieval Track

Augmenting web search surrogates with images

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, Santiago, Chile, August 9-13, 2015

Query expansion using associated queries

Towards Detecting Tonic Information Processing Activities with Physiological Data

Report on INEX 2011

Re-Finding Behaviour in Vertical Domains

Using Clicks as Implicit Judgments: Expectations Versus Observations

Quantum haystacks revisited

Machine transliteration survey

Effective Pre-retrieval Query Performance Prediction Using Similarity and Variability Evidence

Report on INEX 2013

The 35th International ACM SIGIR conference on research and development in Information Retrieval, SIGIR '12, Portland, OR, USA, August 12-16, 2012

Stemming Arabic Conjunctions and Prepositions

Sample Sizes for Query Probing in Uncooperative Distributed Information Retrieval

Ranking Documents by Answer-Passage Quality

The Benefits of Magnitude Estimation Relevance Assessments for Information Retrieval Evaluation

UQV100: A Test Collection with Query Variability

Proceedings of the 25th ACM International on Conference on Information and Knowledge Management

Proceedings of the 22nd Australasian Document Computing Symposium, ADCS 2017, Brisbane, QLD, Australia, December 7-8, 2017

Effective retrieval of polyphonic audio with polyphonic symbolic queries

Models and metrics: IR evaluation as a user process

Sentence length bias in TREC novelty track judgements

Cost and benefit estimation of experts' mediation in an enterprise search

SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, Washington, USA, August 6-11, 2006

Incorporating User Expectations and Behavior into the Measurement of Search Effectiveness

Query association surrogates for Web search

Component-based Analysis of Dynamic Search Performance

Harnessing Semantics for Answer Sentence Retrieval

Information Retrieval Technology - 9th Asia Information Retrieval Societies Conference, AIRS 2013, Singapore, December 9-11, 2013. Proceedings

Using Information Scent to Understand Mobile and Desktop Web Search Behavior

What Users Do: The Eyes Have It

Domain expert topic familiarity and search behavior

A case for improved evaluation of query difficulty prediction

On the Cost of Negation for Dynamic Pruning

Proceedings of the 21st Australasian Document Computing Symposium, ADCS 2016, Caulfield, VIC, Australia, December 5-7, 2016

Document Summarization for Answering Non-Factoid Queries

TREC: topic engineering exercise

Using score differences for search result diversification

Information retrieval evaluation using test collections

Understanding and Analysing Novice Programmer Interactions in a Facebook Programming Group

Designing and Evaluating Presentation Strategies for Fact-Checked Content

Advances in Information Retrieval

Differences in effectiveness across sub-collections

Advances in Information Retrieval

Efficient in-memory top-k document retrieval

A Living Lab Study of Query Amendment in Job Search

An Enhanced Evaluation Framework for Query Performance Prediction

Quantifying test collection quality based on the consistency of relevance judgements

Advances in Information Retrieval - 39th European Conference on IR Research, ECIR 2017, Aberdeen, UK, April 8-13, 2017, Proceedings

Features of Disagreement Between Retrieval Effectiveness Measures

Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

Constructing query-biased summaries: a comparison of human and system generated snippets

Proceedings of the 9th Workshop on Multimedia for Cooking and Eating Activities in conjunction with The 2017 International Joint Conference on Artificial Intelligence, Melbourne, Australia, August 20 - 20, 2017

On Crowdsourcing Relevance Magnitudes for Information Retrieval Evaluation