Perfect data isn’t needed as the errors will “even out”.
That is an assumption.
I do not think that it is a correct assumption.
now there’s the problem that most new content on the Internet is low quality AI garbage.
This reminds me about a recommendation from some philosopher - I forgot who it was - he said that you should read only such books that are at least 100 years old.