Towards expertise modelling for routing data cleaning tasks within a community of knowledge workers

Research output: Chapter in Book or Conference Publication/ProceedingConference Publicationpeer-review

7 Citations (Scopus)

Abstract

Applications consuming data have to deal with variety of data quality issues such as missing values, duplication, incorrect values, etc. Although automatic approaches can be utilized for data cleaning the results can remain uncertain. Therefore updates suggested by automatic data cleaning algorithms require further human verification. This paper presents an approach for generating tasks for uncertain updates and routing these tasks to appropriate workers based on their expertise. Specifically the paper tackles the problem of modelling the expertise of knowledge workers for the purpose of routing tasks within collaborative data quality management. The proposed expertise model represents the profile of a worker against a set of concepts describing the data. A simple routing algorithm is employed for leveraging the expertise profiles for matching data cleaning tasks with workers. The proposed approach is evaluated on a real world dataset using human workers. The results demonstrate the effectiveness of using concepts described the data for modelling expertise, in terms of likelihood of receiving responses to tasks routed to workers.

Original languageEnglish
Title of host publicationProceedings of ICIQ 2012
Subtitle of host publication17th International Conference on Information Quality
EditorsLaure Berti-Equille, Isabelle Comyn-Wattiau, Monica Scannapieco
PublisherMIT
Pages58-69
Number of pages12
ISBN (Electronic)9781627483964
Publication statusPublished - 2012
Event17th International Conference on Information Quality, ICIQ 2012 - Paris, France
Duration: 16 Nov 201217 Nov 2012

Publication series

NameProceedings of ICIQ 2012: 17th International Conference on Information Quality

Conference

Conference17th International Conference on Information Quality, ICIQ 2012
Country/TerritoryFrance
CityParis
Period16/11/1217/11/12

Keywords

  • Crowd sourcing
  • Data cleaning
  • Linked data
  • Web 2.0

Fingerprint

Dive into the research topics of 'Towards expertise modelling for routing data cleaning tasks within a community of knowledge workers'. Together they form a unique fingerprint.

Cite this