This is pretty much every study right now as things accelerate. Even just six months can be a dramatic difference in capabilities.
For example, Meta’s 3-405B has one of the leading situational awarenesses of current models, but isn’t present at all to the same degree in 2-70B or even 3-70B.