Optimal text-based time-series indices

B-Tier

Journal: International Journal of Forecasting

Year: 2026

Volume: 42

Issue: 1

Pages: 44-60

Authors (2)

Ardia, David (HEC Montréal (École des Hautes...) Bluteau, Keven (not in RePEc)

Score contribution per author:

1.009 = (α=2.02 / 2 authors) × 1.0x B-tier

α: calibrated so average coauthorship-adjusted count equals average raw count

View Full Article View on IDEAS/RePEc

Abstract

We propose an approach to construct text-based time-series indices in an optimal way—typically, indices that maximize the contemporaneous relation or the predictive performance with respect to a target variable, such as inflation. Our methodology relies on binary selection matrices that, applied to the vocabulary of tokens, select the relevant texts in the corpus. Various widely known text-based indices, such as the Economic Policy Uncertainty (EPU) index, can be formulated in terms of selection matrices. We design a genetic algorithm with domain-specific knowledge featuring tailor-made crossover and mutation operations to perform the complex optimization. We illustrate our methodology with a corpus of news articles from the Wall Street Journal by optimizing text-based indices that forecast inflation at various horizons.

Optimal text-based time-series indices

Authors (2)

Abstract

Technical Details