My experience with local LLM

ntn888@lemmy.ml · 14 days ago

My experience with local LLM

ntn888@lemmy.ml · 14 days ago

hey, thanks for your response… yeah that’s what I meant, the 2b models aren’t usable in today’s state, but more practical for everyday use if they work out…

I actually meant the 31b models are useful for my purpose. I don’t do full-on agentic coding, just interactive chat/prompting. Example, I make good use for making linux shell scripts (as I don’t know howto myself). Currently I use qwen3.5-flash via cloud. It’s as good as the frontier models back then if not better…

SuspciousCarrot78@lemmy.world · edit-2 42 minutes ago

[deleted by user]

NoiseColor @lemmy.world · 14 days ago

I wanted to use smaller models, but then do more work on the “thinking” process. I didn’t come far, because it get so slow with normal hardware and too expensive on dedicated one. Time consuming (I’m also not a programmer) but a fun project, but in the end I just decided to satisfy the privacy angle with protons ai Lumo.

inari@piefed.zip · 14 days ago

Proton has AI? Damn, that’s gotta be bleeding their coffers

SuspciousCarrot78@lemmy.world · edit-2 42 minutes ago

[deleted by user]

NoiseColor @lemmy.world · 14 days ago

They have been working on this. Only 3 months ago it was pretty terrible. Today it’s almost on par with chatgpt. A bit worse on rag, slower,… good enough for normal use.

SuspciousCarrot78@lemmy.world · edit-2 42 minutes ago

[deleted by user]