Can AI-powered financial analysis be trusted?
Edward Xuejun Li, Min Shen, Zhiyuan Tu, and Dexin Zhou study this question in their paper "The Promise and Peril of Generative AI: Evidence from GPT as Sell-Side Analysts".
They analyse 7,114 earnings press releases from 1,000 firms, prompting GPT to identify key sentences, extract financial metrics, and forecast one-year-ahead earnings per share.
Their goal is to explore whether GPT produces reliable earnings forecasts and whether observable processing signals predict output quality. Their main conclusions include:
These findings highlight LLMs lower the cost of processing corporate disclosures but introduce the new challenge of evaluating the quality and reliability of AI-generated outputs.
Notably, GPT outperforms analysts in more than one-quarter of firm-quarters, suggesting meaningful potential when conditions are favourable.
Models however have limited awareness of their own informational constraints, so investors who cannot recognise when model reliability weakens risk making poorly informed decisions.
As a limitation, the study captures only one dimension of GPT's quantitative reasoning, and future refinements may reveal additional processing signals linked to accuracy.
Pre-cutoff performance may also benefit from what the authors call a "time-traveller's advantage," where GPT draws on information unavailable to other analysts at the same time.