Representing texts as contextualized entity-centric linked data graphs

  • Andre Freitas
  • , Sean Oriain
  • , Edward Curry
  • , Joao C.P. Da Silva
  • , Danilo S. Carvalho

Research output: Chapter in Book or Conference Publication/ProceedingConference Publicationpeer-review

7 Citations (Scopus)

Abstract

The integration of a small fraction of the information present in the Web of Documents to the Linked Data Web can provide a significant shift on the amount of information available to data consumers. However, information extracted from text does not easily fit into the usually highly normalized structure of ontology-based datasets. While the representation of structured data assumes a high level of regularity, relatively simple and consistent conceptual models, the representation of information extracted from texts need to take into account large terminological variation, complex contextual/dependency patterns, and fuzzy or conflicting semantics. This work focuses on bridging the gap between structured and unstructured data, proposing the representation of text as structured discourse graphs (SDGs), targeting an RDF representation of unstructured data. The representation focuses on a semantic best-effort information extraction scenario, where information from text is extracted under a pay-as-you-go data quality perspective, trading terminological normalization for domain-independency, context capture, wider representation scope and maximization of textual information capture.

Original languageEnglish
Title of host publicationProceedings - 24th International Workshop on Database and Expert Systems Applications, DEXA 2013
Pages133-137
Number of pages5
DOIs
Publication statusPublished - 2013
Event24th International Workshop on Database and Expert Systems Applications, DEXA 2013 - Prague, Czech Republic
Duration: 26 Aug 201329 Aug 2013

Publication series

NameProceedings - International Workshop on Database and Expert Systems Applications, DEXA
ISSN (Print)1529-4188

Conference

Conference24th International Workshop on Database and Expert Systems Applications, DEXA 2013
Country/TerritoryCzech Republic
CityPrague
Period26/08/1329/08/13

Keywords

  • Discoruse Graphs
  • Discourse Representation
  • Linked Data
  • Semantic Web

Fingerprint

Dive into the research topics of 'Representing texts as contextualized entity-centric linked data graphs'. Together they form a unique fingerprint.

Cite this