Targeting predictors in random forest regression

B-Tier
Journal: International Journal of Forecasting
Year: 2023
Volume: 39
Issue: 2
Pages: 841-868

Authors (4)

Borup, Daniel (Aarhus Universitet) Christensen, Bent Jesper (Aarhus Universitet) Mühlbach, Nicolaj Søndergaard (not in RePEc) Nielsen, Mikkel Slot (not in RePEc)

Score contribution per author:

0.505 = (α=2.02 / 4 authors) × 1.0x B-tier

α: calibrated so average coauthorship-adjusted count equals average raw count

Abstract

Random forest (RF) regression is an extremely popular tool for analyzing high-dimensional data. Nonetheless, its benefits may be lessened in sparse settings due to weak predictors, and a pre-estimation dimension reduction (targeting) step is required. We show that proper targeting controls the probability of placing splits along strong predictors, thus providing an important complement to RF’s feature sampling. This is supported by simulations using finite representative samples. Moreover, we quantify the immediate gain from targeting in terms of the increased strength of individual trees. Macroeconomic and financial applications show that the bias–variance trade-off implied by targeting, due to increased correlation among trees in the forest, is balanced at a medium degree of targeting, selecting the best 5%–30% of commonly applied predictors. Improvements in the predictive accuracy of targeted RF relative to ordinary RF are considerable, up to 21%, occurring both in recessions and expansions, particularly at long horizons.

Technical Details

RePEc Handle
repec:eee:intfor:v:39:y:2023:i:2:p:841-868
Journal Field
Econometrics
Author Count
4
Added to Database
2026-01-24