Options
The Problem of False Positives in Automated Census Linking: Evidence from Nineteenth-Century New York's Irish Immigrants
Date Issued
2021-06
Date Available
2021-06-22T10:18:01Z
Abstract
Automated census linkage algorithms have become popular for generating longitudinal data on social mobility, especially for immigrants and their children. But what if these algorithms are particularly bad at tracking immigrants? Using nineteenth-century Irish immigrants as a test case, we examine the most popular of these algorithms—that created by Abramitzky, Boustan, Eriksson (ABE), and their collaborators. Our findings raise serious questions about the quality of automated census links. False positives range from about one-third to one-half of all links depending on the ABE variant used. These bad links lead to sizeable estimation errors when measuring Irish immigrant social mobility.
Other Sponsorship
George Washington University
CUNY Research Foundation
National Endowment for the Humanities
Type of Material
Working Paper
Publisher
University College Dublin. School of Economics
Start Page
1
End Page
55
Series
UCD Centre for Economic Research Working Paper Series
WP2021/14
Copyright (Published Version)
2021 the Authors
Classification
N21
J61
R23
Language
English
Status of Item
Peer reviewed
This item is made available under a Creative Commons License
File(s)
Loading...
Name
WP21_14.pdf
Size
2.96 MB
Format
Adobe PDF
Checksum (MD5)
677e69dd77912693b2239e6818ec4b63
Owning collection