A recent study on the preprint server arXiv confirms that training AI chatbots with too much low-quality content from social media inflicts “brain damage-like” effects on them, leading to declines in both IQ and EQ.
The research reveals a critical issue: when large language models are fed vast amounts of short, fast-paced, and sensational social media posts, their reasoning abilities are the first to collapse. The models begin to “cut corners,” skipping crucial reasoning steps, or even abandoning thinking altogether, directly outputting incorrect answers. Worse, the higher the proportion of “junk” in the data, the more pronounced this intelligence-reducing effect becomes.
To quantify the impact, the research team conducted a large-scale “AI personality test.” They trained several open-source models, including Meta’s Llama 3 and Alibaba’s Qwen, on one million posts from the X platform. The tests found that models, which originally had fairly normal personalities, saw their negative personality traits amplified under the continuous intake of “junk information,” even beginning to show signs of “psychopathic” tendencies.
Subsequent remediation experiments were also discouraging. Attempts to “cure” the models by refining instructions or mixing in high-quality data showed limited effectiveness. The models’ ingrained “bad habits” of skipping deep thinking and rushing to answers proved difficult to eradicate. This demonstrates that post-hoc fixes are far less effective than ensuring a “healthy diet” from the data source.
The core takeaway is simple: data quality is fundamental to AI. Experts emphasize that the future requires extremely strict screening and filtering of training data to block low-quality noise at the source.
Currently, platforms like LinkedIn have announced plans to use user data for AI training. This study undoubtedly serves as a wake-up call: before indiscriminately feeding data to models, have we properly sorted the “garbage”? Otherwise, what we get might not be intelligent assistants, but a cohort of AI suffering from “brain damage.”