There have been multiple accounts created with the sole purpose of posting advertisement posts or replies containing unsolicited advertising.

Accounts which solely post advertisements, or persistently post them may be terminated.

Self-Hosted AI is pretty darn cool

I don’t consider myself very technical. I’ve never taken a computer science course and don’t know python. I’ve learned some things like Linux, the command line, docker and networking/pfSense because I value my privacy. My point is that anyone can do this, even if you aren’t technical.

I tried both LM Studio and Ollama. I prefer Ollama. Then you download models and use them to have your own private, personal GPT. I access it both on my local machine through the command line but I also installed Open WebUI in a docker container so I can access it on any device on my local network (I don’t expose services to the internet).

Having a private ai/gpt is pretty cool. You can download and test new models. And it is private. Yes, there are ethical concerns about how the model got the training. I’m not minimizing those concerns. But if you want your own AI/GPT assistant, give it a try. I set it up in a couple of hours, and as I said… I’m not even that technical.

EonNShadow ,

“learned some things like Linux, command line, docker, and networking/pfsense” “I don’t consider myself technical”

Don’t sell yourself short, I work in IT and have colleagues on our helpdesk who would struggle endlessly with those concepts.

I hereby dub you a tech person, like it or not, those skills can and do pay the bills.

chagall OP ,

This made me smile. Thank you. The grass is always greener and I sometimes daydream of working in IT instead of healthcare. Maybe someday.

deuleb_biezelbob ,
@deuleb_biezelbob@programming.dev avatar

Nah dont.

ZeroHora ,
@ZeroHora@lemmy.ml avatar

hahahaha best advice ever.

GissaMittJobb ,

Healthcare is pretty rough, I’d be willing to bet that the grass actually is greener in this case.

ITGuyLevi ,

Join us. We have cookies (well at least until the end of our sessions)!

damnthefilibuster ,

Now that you’ve dubbed OP a tech person…

Hey OP, can you help me fix my printer? It’s only printing “RED RUM RED RUM” for some reason.

sugar_in_your_tea ,

Have you tried giving it red rum?

https://batchdistillery.co.uk/wp-content/uploads/2020/06/RR-70-shop.jpg

Oh, and make sure you hold it out with the insides of your arms exposed, it’ll feel less threatening that way.

webghost0101 ,

Thank you for this. I consider myself technical and those words felt like a punch in the gut.

chagall OP ,

I’m sorry if I offended. I can’t code or understand existing code and have always felt that technical people code. I guess I should expand my definition. Again, sorry that my words felt like a punch in the gut… wasn’t my intention at all.

IsoKiero ,

It depends heavily on what you do and what you’re comparing yourself against. I’ve been making a living with IT for nearly 20 years and I still don’t consider myself to be an expert on anything, but it’s a really wide field and what I’ve learned that the things I consider ‘easy’ or ‘simple’ (mostly with linux servers) are surprisingly difficult for people who’d (for example) wipe the floor with me if we competed on planning and setting up an server infrastructure or build enterprise networks.

And of course I’ve also met the other end of spectrum. People who claim to be ‘experts’ or ‘senior techs’ at something are so incompetent on their tasks or their field of knowledge is so ridiculously narrow that I wouldn’t trust them with anything above first tier helpdesk if even that. And the sad part is that those ‘experts’ often make way more money than me because they happened to score a job on some big IT company and their hours are billed accordingly.

And then there’s the whole other can of worms on a forums like this where ‘technical people’ range from someone who can install a operating system by following instructions to the guys who write assembly code to some obscure old hardware just for the fun of it.

AlexanderESmith ,

I've been in my field for 21 years. I make really good money, and have high confidence in my hard-earned skillset. I focus on Cloud Infrastructure and general Linux administration, but I code in a handful of languages to facilitate automation (because I'm lazy). I consider myself extremely well versed in technology in general.

... Then I watch a YouTube video where some guy hand-solders a motherboard for his custom-designed keyboard, and decides it's easier to - instead of adapt someone else's - write his own driver from scratch, then open-sources everything from the PCB, to the CAD files he used to print the case, to the driver itself. Then he uploads the video, titled "Quick weekend project because I couldn't find the keyboard layout I wanted on Amazon", and forgets about the whole thing.

In that moment, I feel like a complete, no-nothing idiot.

A person with a doctorate in astrophysics isn't not-a-doctor because they don't know how to do brain surgery. Hell, a doctor who is a surgeon isn't not-those-things because he isn't - specifically - a brain surgeon.

Don't sell yourself short because other people do things you don't (yet, or even ever). You're a technical person.

GBU_28 ,
scrubbles ,
@scrubbles@poptalk.scrubbles.tech avatar

I was just talking to a member of my devops team and I was talking about this exact thing and they said “I didn’t know you could attach a GPU to a container”. So, yup, just stay on top of this stuff at home and you’ll do fine

possiblylinux127 ,

Who uses GPUs for AI anyway. They cost more than a car sometimes

Flax_vert ,

This gave me confidence as well, thank you 😆

DavidGarcia ,

people need to take a step back and realize we have the capability to trap quasi-omnipotent quasi-demons in our personal computers

yeah they lie a lot and rarely do what you want them to, but that’s just what demons do

And it’s all powered by some dark crystals created with light magic that slowly poison the planet

that’s some arcane bullshit

LastoftheDinosaurs ,
@LastoftheDinosaurs@reddthat.com avatar

How long can something like that really last, though? I wish we had a better idea of the timeline, before the quasi-demons start freelancing lol

HumanPerson ,

Yeah, I like it too. My only issue is ollama’s lack of intel support. I have been looking at issue 1590 on their GitHub. For now I have a 1050ti in a cardboard box PC with other hardware being 10+ years old and a mixed set of RAM totalling 12G. It also has a 100Mbit nic, so I can’t take advantage of full internet speed when downloading models. The worst part is they can support intel, but haven’t merged the solution because of an issue with the windows intel drivers. Linux is fine but I can 't have it. I wasn’t planning to rant, but I already typed it so… enjoy?

chagall OP ,

Yeah, I have an NVDIA GPU and it is magic. The best part is when you are using Ollama, open a second terminal window and enter the command, watch -n 0.5 nvidia-smi and you can see your GPU usage go up and down in real-time as you ask the GPT questions. Pretty cool.

Hopefully they get the ARC folks up and running soon.

BlueKey ,
@BlueKey@kbin.run avatar

You can also achieve this with nvtop and have a pretty TUI (terminal UI).

klopstock , (edited )

There is ipex-llm from Intel which you can use with your intel IGPU/GPU/CPU for llms which also supports ollama.

Goodtoknow ,
@Goodtoknow@lemmy.ca avatar

Have you found much practical use for small models yet? I love the idea that even the 1.1B tinyllama model can run on my phone, but haven’t found much real world use for it yet. Llama3 8b feels better, but not much better for even emails as it’s a bit dumb

chagall OP ,

I use my phone all the time, but I just use a wireguard VPN to tunnel into my home container of Open WebUI. Then I can interact with my desktop machine using a NVIDIA gpu. I’m currently testing mistral-nemo. It’s pretty great but it gets a bit verbose sometimes.

kureta ,

I am also using open webui. Most LLMs are too verbose for me, so I created a model in open-webui with system prompt “Do not repeat the questions. Avoid giving lists as answers. Do not summarize the answer at the end. If asked a follow-up question, respond with only new information, do not repeat previously stated information.” and named it No Nonsense.

chagall OP ,

That’s really smart. I just found out about fabric yesterday and it is helping me with things like what you stated. Prompt engineering is a huge thing.

coffee_with_cream ,

Uncensored models are so much better, too. chatGPT is like one of those plastic children’s toy hammers vs real models are titanium hammers

BlackLaZoR ,
@BlackLaZoR@kbin.run avatar

I access it both on my local machine through the command line

You really don't have to - There's GPT4ALL designed for normal users with very simple GUI

Also, with minimal command line knowledge you can install InvokeAI - probably the best UX for image generating AI on the market. Works both on Linux and Windows

chagall OP ,

It’s so great that there is so much ongoing development of these types of tools out there. I’m currently using openweb ui as my GUI but I’ll give your suggestion a try next week. I haven’t figured out a use case for stable diffusion except for creating new content for the shitposting community on lemmy lol. But if you have any ideas, please let me know… I’d love to test it out if I have a good use case.

BlackLaZoR ,
@BlackLaZoR@kbin.run avatar

But if you have any ideas

Both my avatar and channel cover are made with AI models - so this is a good start.

IMO the biggest potential is indie game dev - AI image generation is amazing for static backgrounds, character design, and with certain loras it absolutely shreds pixelart - I even saw entire workflows for building pixelart animations (I think it was for ComfyUi tho).

Also local image models are uncensored so... porn XD

Zos_Kia ,
@Zos_Kia@lemmynsfw.com avatar

If you like to write, I find that story boarding with stable diffusion is definitely an improvement. The quality of the images is what it is, but they can help you map out scenes and locations, and spot visual details and cues to include in your writing.

possiblylinux127 ,

I switched from OpenWebUI to Alpaca as I had no use for multi accounts.

chagall OP ,

Open WebUI now has a docker environment variable so you can, by default, turn off the login page. You just declare it when you’re spinning up the container and you’re good to go.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • [email protected]
  • random
  • lifeLocal
  • goranko
  • All magazines