TY - GEN
T1 - On the semantic representation and extraction of complex category descriptors
AU - Freitas, André
AU - Vieira, Rafael
AU - Curry, Edward
AU - Carvalho, Danilo
AU - Da Silva, João Carlos Pereira
PY - 2014
Y1 - 2014
N2 - Natural language descriptors used for categorizations are present from folksonomies to ontologies. While some descriptors are composed of simple expressions, other descriptors have complex compositional patterns (e.g. 'French Senators Of The Second Empire', 'Churches Destroyed In The Great Fire Of London And Not Rebuilt'). As conceptual models get more complex and decentralized, more content is transferred to unstructured natural language descriptors, increasing the terminological variation, reducing the conceptual integration and the structure level of the model. This work describes a representation for complex natural language category descriptors (NLCDs). In the representation, complex categories are decomposed into a graph of primitive concepts, supporting their interlinking and semantic interpretation. A category extractor is built and the quality of its extraction under the proposed representation model is evaluated.
AB - Natural language descriptors used for categorizations are present from folksonomies to ontologies. While some descriptors are composed of simple expressions, other descriptors have complex compositional patterns (e.g. 'French Senators Of The Second Empire', 'Churches Destroyed In The Great Fire Of London And Not Rebuilt'). As conceptual models get more complex and decentralized, more content is transferred to unstructured natural language descriptors, increasing the terminological variation, reducing the conceptual integration and the structure level of the model. This work describes a representation for complex natural language category descriptors (NLCDs). In the representation, complex categories are decomposed into a graph of primitive concepts, supporting their interlinking and semantic interpretation. A category extractor is built and the quality of its extraction under the proposed representation model is evaluated.
UR - https://www.scopus.com/pages/publications/84958523151
U2 - 10.1007/978-3-319-07983-7_6
DO - 10.1007/978-3-319-07983-7_6
M3 - Conference Publication
AN - SCOPUS:84958523151
SN - 9783319079820
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 45
EP - 50
BT - Natural Language Processing and Information Systems - 19th International Conference on Applications of Natural Language to Information Systems, NLDB 2014, Proceedings
PB - Springer-Verlag
T2 - 19th International Conference on Applications of Natural Language to Information Systems, NLDB 2014
Y2 - 18 June 2014 through 20 June 2014
ER -