Abstract
The growing size, heterogeneity and complexity of databases demand the creation of strategies to facilitate users and systems to consume data. Ideally, query mechanisms should be schema-agnostic, i.e. they should be able to match user queries in their own vocabulary and syntax to the data, ab- stracting data consumers from the representation of the data. This work provides an information- theoretical framework to evaluate the semantic complexity involved in the query-database commu- nication, under a schema-agnostic query scenario. Different entropy measures are introduced to quantify the semantic phenomena involved in the user-database communication, including structural complexity, ambiguity, synonymy and vagueness. The entropy measures are validated using natural language queries over Semantic Web databases. The analysis of the semantic complexity is used to improve the understanding of the core semantic dimensions present at the query-data matching process, allowing the improvement of the design of schema-agnostic query mechanisms and defining measures which can be used to assess the semantic uncertainty or difficulty behind a schema-agnostic querying task.
Original language | English (Ireland) |
---|---|
Title of host publication | 11th International Conference on Computational Semantics (IWCS 2015) |
Publication status | Published - 1 Jan 2015 |
Authors (Note for portal: view the doc link for the full list of authors)
- Authors
- Freitas, Andre;Sales, Juliano Efson;Handschuh, Siegfried;Curry, Edward