Options
Enriching Taxonomies With Functional Domain Knowledge
Date Issued
2018-06-27
Date Available
2019-04-24T13:40:48Z
Abstract
The rising need to harvest domain specific knowledge in several applications is largely limited by the ability to dynamically grow structured knowledge representations, due to the increasing emergence of new concepts and their semantic relationships with existing ones. Such enrichment of existing hierarchical knowledge sources with new information to better model the "changing world" presents two-fold challenges: (1) Detection of previously unknown entities or concepts, and (2) Insertion of the new concepts into the knowledge structure, respecting the semantic integrity of the created relationships. To this end we propose a novel framework, ETF, to enrich large-scale, generic taxonomies with new concepts from resources such as news and research publications. Our approach learns a high-dimensional embedding for the existing concepts of the taxonomy, as well as for the new concepts. During the insertion of a new concept, this embedding is used to identify semantically similar neighborhoods within the existing taxonomy. The potential parent-child relationships linking the new concepts to the existing ones are then predicted using a set of semantic and graph features. Extensive evaluation of ETF on large, real-world taxonomies of Wikipedia and WordNet showcase more than 5% F1-score improvements compared to state-of-the-art baselines. We further demonstrate that ETF can accurately categorize newly emerging concepts and question-answer pairs across different domains.
Other Sponsorship
National Science Foundation (US)
Type of Material
Conference Publication
Start Page
745
End Page
754
Copyright (Published Version)
2018 the Authors
Web versions
Language
English
Status of Item
Not peer reviewed
Journal
41st International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2018
Conference Details
41st International ACM SIGIR Conference on Research and Development in Information Retrieval, Ann Arbor Michigan, USA. July 8-12, 2018
ISBN
978-1-4503-5657-2
This item is made available under a Creative Commons License
File(s)
Loading...
Name
ajwani_sigir18.pdf
Size
843.71 KB
Format
Adobe PDF
Checksum (MD5)
2b2b78c41c91a481b8679891c279fe94
Owning collection