Optimal text-based time-series indices

B-Tier
Journal: International Journal of Forecasting
Year: 2026
Volume: 42
Issue: 1
Pages: 44-60

Authors (2)

Score contribution per author:

1.005 = (α=2.01 / 2 authors) × 1.0x B-tier

α: calibrated so average coauthorship-adjusted count equals average raw count

Abstract

We propose an approach to construct text-based time-series indices in an optimal way—typically, indices that maximize the contemporaneous relation or the predictive performance with respect to a target variable, such as inflation. Our methodology relies on binary selection matrices that, applied to the vocabulary of tokens, select the relevant texts in the corpus. Various widely known text-based indices, such as the Economic Policy Uncertainty (EPU) index, can be formulated in terms of selection matrices. We design a genetic algorithm with domain-specific knowledge featuring tailor-made crossover and mutation operations to perform the complex optimization. We illustrate our methodology with a corpus of news articles from the Wall Street Journal by optimizing text-based indices that forecast inflation at various horizons.

Technical Details

RePEc Handle
repec:eee:intfor:v:42:y:2026:i:1:p:44-60
Journal Field
Econometrics
Author Count
2
Added to Database
2026-01-24