Modern neural networks, with billions of parameters, are so overparameterized that they can "overfit" even random, structureless data. Yet when trained on datasets with structure, they learn the ...