Dual Interpretation of Machine Learning Forecasts

December 17, 2024 · Declared Dead · 🏛 arXiv.org

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Philippe Goulet Coulombe, Maximilian Goebel, Karin Klieber arXiv ID 2412.13076 Category econ.EM Cross-listed cs.LG, stat.ML Citations 3 Venue arXiv.org Last Checked 1 month ago

Abstract

Machine learning predictions are typically interpreted as the sum of contributions of predictors. Yet, each out-of-sample prediction can also be expressed as a linear combination of in-sample values of the predicted variable, with weights corresponding to pairwise proximity scores between current and past economic events. While this dual route leads nowhere in some contexts (e.g., large cross-sectional datasets), it provides sparser interpretations in settings with many regressors and little training data-like macroeconomic forecasting. In this case, the sequence of contributions can be visualized as a time series, allowing analysts to explain predictions as quantifiable combinations of historical analogies. Moreover, the weights can be viewed as those of a data portfolio, inspiring new diagnostic measures such as forecast concentration, short position, and turnover. We show how weights can be retrieved seamlessly for (kernel) ridge regression, random forest, boosted trees, and neural networks. Then, we apply these tools to analyze post-pandemic forecasts of inflation, GDP growth, and recession probabilities. In all cases, the approach opens the black box from a new angle and demonstrates how machine learning models leverage history partly repeating itself.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — econ.EM

R.I.P. 👻 Ghosted

Design-based Analysis in Difference-In-Differences Settings with Staggered Adoption

Susan Athey, Guido Imbens

econ.EM 🏛 J.E 📚 731 cites 7 years ago

R.I.P. 👻 Ghosted

Machine Learning Advances for Time Series Forecasting

Ricardo P. Masini, Marcelo C. Medeiros, Eduardo F. Mendes

econ.EM 🏛 Journal of economic surveys (Print) 📚 408 cites 5 years ago

R.I.P. 👻 Ghosted

Deep Neural Networks for Estimation and Inference

Max H. Farrell, Tengyuan Liang, Sanjog Misra

econ.EM 🏛 Econometrica 📚 261 cites 7 years ago

R.I.P. 👻 Ghosted

Take a Look Around: Using Street View and Satellite Images to Estimate House Prices

Stephen Law, Brooks Paige, Chris Russell

econ.EM 🏛 ACM TIST 📚 150 cites 7 years ago

R.I.P. 👻 Ghosted

Discrete Choice and Rational Inattention: a General Equivalence Result

Mogens Fosgerau, Emerson Melo, ... (+2 more)

econ.EM 🏛 International Economic Review 📚 97 cites 8 years ago

R.I.P. 👻 Ghosted

Estimating Heterogeneous Consumer Preferences for Restaurants and Travel Time Using Mobile Location Data

Susan Athey, David Blei, ... (+3 more)

econ.EM 🏛 arXiv 📚 69 cites 8 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago