UCD : Diachronic Text Classification with Character, Word, and Syntactic N-grams

Files in This Item:
File Description SizeFormat 
insight_publication.pdf191.84 kBAdobe PDFDownload
Title: UCD : Diachronic Text Classification with Character, Word, and Syntactic N-grams
Authors: Szymanski, Terrence
Lynch, Gerard
Permanent link: http://hdl.handle.net/10197/6846
Date: 5-Jun-2015
Abstract: We present our submission to SemEval-2015Task 7: Diachronic Text Evaluation, in whichwe approach the task of assigning a date toa text as a multi-class classification problem.We extract n-gram features from the text atthe letter, word, and syntactic level, and usethese to train a classifier on date-labeled trainingdata. We also incorporate date probabilitiesof syntactic features as estimated from avery large external corpus of books. Our systemachieved the highest performance of allsystems on subtask 2: identifying texts by specifictime language use.
Funding Details: Enterprise Ireland
Science Foundation Ireland
Type of material: Conference Publication
Keywords: Machine Learning & Statistics;Stylistic classification
Language: en
Status of Item: Peer reviewed
Conference Details: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015). United States
Appears in Collections:Computer Science Research Collection
Insight Research Collection

Show full item record

Download(s) 20

312
checked on May 25, 2018

Google ScholarTM

Check


This item is available under the Attribution-NonCommercial-NoDerivs 3.0 Ireland. No item may be reproduced for commercial purposes. For other possible restrictions on use please refer to the publisher's URL where this is made available, or to notes contained in the item itself. Other terms may apply.