Can machine learning models improve traditional asset pricing strategies?
In Empirical Asset Pricing via Machine Learning, Shihao Gu, Bryan Kelly, and Dacheng Xiu test a variety of machine learning algorithms to predict stock returns and pit them against traditional linear models.
Using over 900 potential return predictors, they evaluate these methods on both cross-sectional stock selection and time-series market timing tasks and conclude:
The best-performing algorithms are nonlinear models (trees and neural nets) that capture complex predictor interactions missed by linear approaches.
Importantly, the algorithms consistently rediscover known factors like momentum, liquidity, and volatility and confirm the primacy of these traditional return drivers.
Despite the promising results, even the best ML models explain only a small fraction of return variance (monthly R² well below 1%), as most fluctuations are driven by unpredictable news.
Moreover, high-turnover strategies could in practice face liquidity constraints and trading costs that erode their excess returns, especially when involving small cap stocks. The highest Sharpe (2.45) came from an equal-weighted strategy that heavily tilts toward micro-cap stocks, which may be impossible at scale due to market impact.