An alternate solution is something like a Mac mini with an m series chip and 16gb of unified memory. The neural cores on apple silicon are actually pretty impressive and since they use unified memory the models would have access to whatever the system has.
I only mention it because a Mac mini might be cheaper than GPU with tons of vram by a couple hundred bucks.
And it will sip power comparatively.
4090 with 24gb of vram is $1900 M2 Mac mini with 24gb is $1000