EVE: explainable vector based embedding technique using Wikipedia

Files in This Item:
Access to this item has been restricted by the copyright holder until:2019-06-04
File Description SizeFormat 
insight_publication.pdf638.4 kBAdobe PDFDownload    Request a copy
Title: EVE: explainable vector based embedding technique using Wikipedia
Authors: Qureshi, M. Atif
Greene, Derek
Permanent link: http://hdl.handle.net/10197/10460
Date: 4-Jun-2018
Online since: 2019-05-15T09:26:18Z
Abstract: We present an unsupervised explainable vector embedding technique, called EVE, which is built upon the structure of Wikipedia. The proposed model defines the dimensions of a semantic vector representing a concept using human readable labels, thereby it is readily interpretable. Specifically, each vector is constructed using the Wikipedia category graph structure together with the Wikipedia article link structure. To test the effectiveness of the proposed model, we consider its usefulness in three fundamental tasks: 1) intruder detection to evaluate its ability to identify a non-coherent vector from a list of coherent vectors, 2) ability to cluster to evaluate its tendency to group related vectors together while keeping unrelated vectors in separate clusters, and 3) sorting relevant items first to evaluate its ability to rank vectors (items) relevant to the query in the top order of the result. For each task, we also propose a strategy to generate a task-specific human-interpretable explanation from the model. These demonstrate the overall effectiveness of the explainable embeddings generated by EVE. Finally, we compare EVE with the Word2Vec, FastText, and GloVe embedding techniques across the three tasks, and report improvements over the state-of-the-art.
Funding Details: Science Foundation Ireland
Type of material: Journal Article
Publisher: Springer
Journal: Journal of Intelligent Information Systems
Start page: 1
End page: 29
Copyright (published version): 2018 Springer
Keywords: Distributional semanticsUnsupervised learningWikipedia
DOI: 10.1007/s10844-018-0511-x
Language: en
Status of Item: Peer reviewed
Appears in Collections:Insight Research Collection

Show full item record

Citations 50

checked on May 17, 2019

Google ScholarTM



This item is available under the Attribution-NonCommercial-NoDerivs 3.0 Ireland. No item may be reproduced for commercial purposes. For other possible restrictions on use please refer to the publisher's URL where this is made available, or to notes contained in the item itself. Other terms may apply.