Abstract
Grammars for Natural Language Processing (NLP) applications are generally built either by linguists - on the basis of their language competence, or by automated tools applied to existing large corpora of language data - using either supervised or unsupervised methods (or a combination of both). Domain knowledge usually played just a little role in this process. The increasing availability of extended knowledge representation systems, like taxonomies and ontologies, is giving the opportunity to consider new approaches to the (automated) generation of processing grammars, especially in the field of domain-oriented Information Extraction (IE). The reason for this being that most of the taxonomies and ontologies are equipped with natural language expressions included in ontology elements like labels, comments or definitions. These de facto established relations between (domain) knowledge and natural language expressions can be exploited for the automatic generation of domain specific NLP and IE grammars. We describe in this paper steps leading to this automation.
| Original language | English |
|---|---|
| Pages (from-to) | 23-30 |
| Number of pages | 8 |
| Journal | CEUR Workshop Proceedings |
| Volume | 925 |
| Publication status | Published - 2012 |
| Event | Workshop on Semantic Web and Information Extraction, SWAIE 2012 - Workshop in Conjunction with the 18th International Conference on Knowledge Engineering and Knowledge Management, EKAW 2012 - Galway, Ireland Duration: 9 Oct 2012 → 9 Oct 2012 |
Keywords
- Business reporting standards
- Grammar generation
- Ontology-based Information Extraction