UCD : Diachronic Text Classification with Character, Word, and Syntactic N-grams

Files in This Item:
File Description SizeFormat 
insight_publication.pdf191.84 kBAdobe PDFDownload
Title: UCD : Diachronic Text Classification with Character, Word, and Syntactic N-grams
Authors: Szymanski, TerrenceLynch, Gerard
Permanent link: http://hdl.handle.net/10197/6846
Date: 5-Jun-2015
Online since: 2015-08-26T12:03:01Z
Abstract: We present our submission to SemEval-2015Task 7: Diachronic Text Evaluation, in whichwe approach the task of assigning a date toa text as a multi-class classification problem.We extract n-gram features from the text atthe letter, word, and syntactic level, and usethese to train a classifier on date-labeled trainingdata. We also incorporate date probabilitiesof syntactic features as estimated from avery large external corpus of books. Our systemachieved the highest performance of allsystems on subtask 2: identifying texts by specifictime language use.
Funding Details: Enterprise Ireland
Science Foundation Ireland
Type of material: Conference Publication
Keywords: Machine Learning & StatisticsStylistic classification
Other versions: http://alt.qcri.org/semeval2015/
Language: en
Status of Item: Peer reviewed
Conference Details: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015). United States
Appears in Collections:Computer Science Research Collection
Insight Research Collection

Show full item record

Page view(s) 50

checked on Nov 13, 2019

Download(s) 50

checked on Nov 13, 2019

Google ScholarTM


This item is available under the Attribution-NonCommercial-NoDerivs 3.0 Ireland. No item may be reproduced for commercial purposes. For other possible restrictions on use please refer to the publisher's URL where this is made available, or to notes contained in the item itself. Other terms may apply.