Mining the Cultural Memory of Irish Industrial Schools Using Word Embedding and Text Classification

Title: Mining the Cultural Memory of Irish Industrial Schools Using Word Embedding and Text Classification
Authors: Leavy, Susan
Keane, Mark T.
Pine, Emilie
Permanent link:
Date: 16-Jul-2016
Online since: 2019-05-01T08:42:49Z
Abstract: The Industrial Memories project aims for new distant (i.e., text analytic) and close readings (i.e., witnessing) of the 2009 Ryan Report, the report of the Irish Government’s investigation into abuse at Irish Industrial Schools. The project has digitised the Report and used techniques such as word embedding and automated text classification using machine learning to re-present the Report’s key findings in novel ways that better convey its contents. The Ryan Report exposes the horrific details of systematic abuse of children in Irish industrial schools between 1920 and 1990. It contains 2,600 pages with over 500,000 words detailing evidence from the 9- year-long investigation. However, the Report’s narrative form and its sheer length effectively make many of it findings quite opaque. The Industrial Memories project uses text analytics to examine the language of the Report, to identify recurring patterns and extract key findings. The project represents the Report via an exploratory web-based interface that supports further analysis of the text. The methodology outlined is scalable and suggests new approaches to such voluminous state documents.
Funding Details: Irish Research Council
Type of material: Conference Publication
Keywords: Irish Industrial SchoolsIrish GovernmentChild abuseTwentieth centuryText analyticsMachine learning
Other versions:
Language: en
Status of Item: Not peer reviewed
Conference Details: Digital Humanities 2016 Conference, Kraków, Poland, 12-16 July 2016
Appears in Collections:Computer Science Research Collection

Show full item record

Google ScholarTM


This item is available under the Attribution-NonCommercial-NoDerivs 3.0 Ireland. No item may be reproduced for commercial purposes. For other possible restrictions on use please refer to the publisher's URL where this is made available, or to notes contained in the item itself. Other terms may apply.