Digitization and data frames for card index records

B-Tier
Journal: Explorations in Economic History
Year: 2023
Volume: 87
Issue: C

Authors (3)

Amujala, Someswar (not in RePEc) Vossmeyer, Angela (not in RePEc) Das, Sanjiv R. (Santa Clara University)

Score contribution per author:

0.673 = (α=2.02 / 3 authors) × 1.0x B-tier

α: calibrated so average coauthorship-adjusted count equals average raw count

Abstract

We develop a methodology for converting card index archival records into usable data frames for statistical and textual analyses. Leveraging machine learning and natural-language processing tools from Amazon Web Services (AWS), we overcome hurdles associated with character recognition, inconsistent data reporting, column misalignment, and irregular naming. In this article, we detail the step-by-step conversion process and discuss remedies for common problems and edge cases, using historical records from the Reconstruction Finance Corporation.

Technical Details

RePEc Handle
repec:eee:exehis:v:87:y:2023:i:c:s001449832200047x
Journal Field
Economic History
Author Count
3
Added to Database
2026-01-25