Using tags and clustering to identify topic-relevant blogs

Research output: Contribution to conference (Published)Paperpeer-review

29 Citations (Scopus)

Abstract

The Web has experienced an exponential growth in the use of weblogs or blogs. Blog entries are generally organised using tags, informally defined labels which are increasingly being proposed as a 'grassroots' answer to SemanticWeb standards. Despite this, tags have been shown to be weak at partitioning blog data. In this paper, we demonstrate how tags provide useful, discriminating information where the blog corpus is initially partitioned using a conventional clustering technique. Using extensive empirical evaluation we demonstrate how tag cloud information within each cluster allows us to identify the most topic-relevant blogs in the cluster. We conclude that tags have a key auxiliary role in refining and confirming the information produced using typical knowledge discovery techniques.

Original languageEnglish
Publication statusPublished - 2007
Event2007 International Conference on Weblogs and Social Media, ICWSM 2007 - Boulder, CO, United States
Duration: 26 Mar 200728 Mar 2007

Conference

Conference2007 International Conference on Weblogs and Social Media, ICWSM 2007
Country/TerritoryUnited States
CityBoulder, CO
Period26/03/0728/03/07

Keywords

  • Blog
  • Clustering
  • Relevance
  • Tag
  • Tag Cloud

Fingerprint

Dive into the research topics of 'Using tags and clustering to identify topic-relevant blogs'. Together they form a unique fingerprint.

Cite this