Abstract
Due to the increasing adoption of open data among governments worldwide especially in the European Union area, a deeper analysis of the newly published data is becoming a mandate. Apart from analyzing the published dataset itself we aimed on analyzing published dataset catalogues. A dataset catalogue or a dataset metadata contains features that describe what the data is about in a textual representation. So, we first acquire data from open data portals, choose descriptive dataset catalogue features, and then construct an aggregated textual representation of the datasets. Afterwards we enrich those textual representations using Natural Language Processing (NLP) methods to create a new comparable data feature “Named Entities”. By mining the new data feature we are able to produce datasets and publishers relatedness network. Those networks are used to point similarities between the published data across multiple open data portals. Pointing all possible collaborations for integrating and standardizing data features and types would increase the value of da1ta and ease its analysis process.
| Original language | English |
|---|---|
| Title of host publication | IFIP Advances in Information and Communication Technology |
| Editors | Luis M. Camarinha-Matos, Rosanna Fornasiero, Hamideh Afsarmanesh |
| Publisher | Springer New York LLC |
| Pages | 253-264 |
| Number of pages | 12 |
| ISBN (Print) | 9783319651507 |
| DOIs | |
| Publication status | Published - 2017 |
| Externally published | Yes |
| Event | 18th IFIP WG 5.5 Working Conference on Virtual Enterprises, PRO-VE 2017 - Vicenza, Italy Duration: 18 Sep 2017 → 20 Sep 2017 |
Publication series
| Name | IFIP Advances in Information and Communication Technology |
|---|---|
| Volume | 506 |
| ISSN (Print) | 1868-4238 |
Conference
| Conference | 18th IFIP WG 5.5 Working Conference on Virtual Enterprises, PRO-VE 2017 |
|---|---|
| Country/Territory | Italy |
| City | Vicenza |
| Period | 18/09/17 → 20/09/17 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 16 Peace, Justice and Strong Institutions
Keywords
- Collaborative network
- Data mining
- E-government
- Open data
- Unstructured data analysis
Fingerprint
Dive into the research topics of 'Mining governmental collaboration through semantic profiling of open data catalogues and publishers'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver