Training a modest machine-learning model uses more carbon than the manufacturing and lifetime use of five automobiles

Seriously… At first I thought, wait I’m pretty sure we got 95% of the way there with transformer design decades or longer ago. What kind of esoteric materials and shapes are they computing together? Then I went to, “More than meets the eye, huh?”