Favorites - For LLMs it entirely depends on what size models you want to use and how fast... - kbin.life

There have been multiple accounts created with the sole purpose of posting advertisement posts or replies containing unsolicited advertising.

Accounts which solely post advertisements, or persistently post them may be terminated.

fhein , 13 hours ago

For LLMs it entirely depends on what size models you want to use and how fast you want it to run. Since there’s diminishing returns to increasing model sizes, i.e. a 14B model isn’t twice as good as a 7B model, the best bang for the buck will be achieved with the smallest model you think has acceptable quality. And if you think generation speeds of around 1 token/second are acceptable, you’ll probably get more value for money using partial offloading.

If your answer is “I don’t know what models I want to run” then a second-hand RTX3090 is probably your best bet. If you want to run larger models, building a rig with multiple (used) RTX3090 is probably still the cheapest way to do it.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

daddy32 12 hours ago
NorthWestWind 11 hours ago
Fisch 11 hours ago
ClamDrinker 11 hours ago
Deckweiss 13 hours ago
AreaKode 9 hours ago
DaGeek247 8 hours ago
AMillionMonkeys 8 hours ago
Viper3210 6 hours ago
JackGreenEarth 24 minutes ago

Federation

Status:

/m/[email protected]

Microblog (178)

Thread

TheBigBrother

@[email protected]

Added: 16 hours ago
Views: 8
Online: -
Ratio: 0

Magazine

selfhosted

@[email protected]

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.
No spam posting.
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it’s not obvious why your post topic revolves around selfhosting, please include details to make it clear.
Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

Created: 1 year ago
Owner: r00ty
Subscribers: 1
Online: -

Threads 2968
Comments 61042
Posts 178
Replies 412
Moderators 1
Moderation log 29

Moderators

r00ty

Active people