AI lab
Had a quick squiz at it, and if it meets your needs, I’d just wait, unless you want to get into the lower levels of things (and run linux), llama-swap is just an inference server, you’ll need something like Open WebUI for chat as well etc.
AI lab
Had a quick squiz at it, and if it meets your needs, I’d just wait, unless you want to get into the lower levels of things (and run linux), llama-swap is just an inference server, you’ll need something like Open WebUI for chat as well etc.
Not sure about AI lab (although I also use podman, prefer Llama-swap), but pretty much everything uses llama-cpp under the hood, which usually takes a day or three to setup for a new architecture. Although I seem to recall them being ready for Qwen3.5 day one due to collaboration.
I find giving it a week or so for the dust to settle (even if it ‘works’, best parameters, quantization bugs etc take a while to shake out) unless there’s a huge motivation.
Also benches are more like guidelines than actual rules, best to do your own on your own use cases.
Devs are reverse centaurs now.
Lines of code was never a good metric, but it looks like productivity to the C-suite. This will bite them (and everyone who uses the code) in the ass. After some spectacular fails it will be judgement that a Dev is most prized for, meanwhile, this.
Still, eight to ten productive hours a day in any sustained fashion is bullshit, more like 3-4 with a bunch of meetings, learning, deciphering etc. filling out the day.