TY - JOUR
T1 - Towards a Window-based Diverse Entity Summarisation Engine in Publish/Subscribe Systems
AU - Pavlopoulou, Niki
AU - Curry, Edward
N1 - Publisher Copyright:
1Copyright ©2019 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
PY - 2019
Y1 - 2019
N2 - The rise of Smart Homes, Smart Cities and Internet of Things results in the creation of a wide range of entity-based data streams and users interested in the real-time analysis of these streams. These smart environments possess characteristics, like dynamism, continuity, heterogeneity and high volume of data and users. A suitable data dissemination paradigm is needed that can overcome these challenges and at the same time, provide expressive notifications to users, but not at the expense of usability or resources. Publish/Subscribe systems can efficiently realise some of these requirements; however, they need additional support when applied in smart environments to overcome assumptions related to usability and redundancy-awareness. Therefore, the key question of the paper is: Can we define an entity-centric Publish/Subscribe system that provides expressive user notifications along with high usability and limited resource usage? In this work, we explore this question and propose a Publish/Subscribe system with windowing, data fusion, and top-k diverse ranking that can result in the creation of expressive entity summaries using limited resources. Our results show that sending a top-k fused diverse summary as a notification is better than sending all the separate notifications or the fused ones without top-k filtering. Specifically, top-k fused diverse summarisation results in 50% to 80% reduction of forwarded messages and redundancy-awareness with an F-score ranging from 0.35 to 0.73 depending on the k. Nevertheless, these results are achieved at the expense of a slightly higher latency; therefore, there is some trade-off between latency, the number of forwarded messages, and expressiveness.
AB - The rise of Smart Homes, Smart Cities and Internet of Things results in the creation of a wide range of entity-based data streams and users interested in the real-time analysis of these streams. These smart environments possess characteristics, like dynamism, continuity, heterogeneity and high volume of data and users. A suitable data dissemination paradigm is needed that can overcome these challenges and at the same time, provide expressive notifications to users, but not at the expense of usability or resources. Publish/Subscribe systems can efficiently realise some of these requirements; however, they need additional support when applied in smart environments to overcome assumptions related to usability and redundancy-awareness. Therefore, the key question of the paper is: Can we define an entity-centric Publish/Subscribe system that provides expressive user notifications along with high usability and limited resource usage? In this work, we explore this question and propose a Publish/Subscribe system with windowing, data fusion, and top-k diverse ranking that can result in the creation of expressive entity summaries using limited resources. Our results show that sending a top-k fused diverse summary as a notification is better than sending all the separate notifications or the fused ones without top-k filtering. Specifically, top-k fused diverse summarisation results in 50% to 80% reduction of forwarded messages and redundancy-awareness with an F-score ranging from 0.35 to 0.73 depending on the k. Nevertheless, these results are achieved at the expense of a slightly higher latency; therefore, there is some trade-off between latency, the number of forwarded messages, and expressiveness.
KW - Data Fusion
KW - Diversity
KW - Entity Summarisation
KW - Publish/Subscribe Systems
KW - RDF Graphs
UR - http://www.scopus.com/inward/record.url?scp=85072730477&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85072730477
SN - 1613-0073
VL - 2446
SP - 32
EP - 39
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
T2 - 2nd International Workshop on EntitY REtrieval, EYRE 2019
Y2 - 3 November 2019
ER -