Linked samples and measurement error in historical US census data

B-Tier
Journal: Explorations in Economic History
Year: 2024
Volume: 93
Issue: C

Authors (2)

Hwang, Sam Il Myoung (not in RePEc) Squires, Munir (University of British Columbia)

Score contribution per author:

1.005 = (α=2.01 / 2 authors) × 1.0x B-tier

α: calibrated so average coauthorship-adjusted count equals average raw count

Abstract

The quality of historical US census data is critical to the performance of linking algorithms. We use genealogical profiles to correct measurement error in census names and ages. Our findings suggest that one in every two records has an error in name or age, and human capital is correlated with lower error rates. While errors in age decline across subsequent census rounds from 1850 to 1930, errors in names do not exhibit such trends. Fixing all transcription errors, hence leaving only those errors made at the time of enumeration, would reduce error rates in names by 41 percent. Correcting all names and ages using genealogical profiles leads to 20%–36% more links and fewer false positives. Reassuringly, we find that reducing such errors has a negligible effect on estimates of intergenerational mobility.

Technical Details

RePEc Handle
repec:eee:exehis:v:93:y:2024:i:c:s0014498324000093
Journal Field
Economic History
Author Count
2
Added to Database
2026-01-29