CEOs are quietly realizing the AI replacement plan has a problem. Two problems, actually.
One: the token costs for running AI agents are now exceeding what they were paying the employees they fired.
Two: when the tokens run out, the AI stops. Just stops. No continuity. No workaround. Just a spinning wheel where your workforce used to be.
You fired humans to save money and bought a subscription that bills you into a corner.
The employees you let go knew what to do when things broke. The AI just invoices you for the outage.
And then thereโs the permission problem nobody wants to talk about.
To do its job, the AI agent needs access. Full access. Your systems, your patents, your contracts, your future plans. Everything you spent years building, handed over to a process that has no loyalty, no discretion, and no skin in the game.
You didnโt hire a replacement. You gave a stranger with no soul the keys to everything you own.
Enjoy.
Ollama + zed = OP hybrid system
Llama dot cpp squeezes even more performance out of gpus with limited vram. I was using ollama for the past three or so years locally on a quadro rtx5000. It works great but putting llama dot cpp on my server made it way faster. It's a little more difficult to setup though.
Funny you mention that - Iโm new to local llms but I started with lm studio for convenience, then moved to ollama for a bit of a speed boost. And migrated from cursor to zed for another bit of speed boost. Now working on migrating from ollama to llama.cpp for yet another speed boost, each time trading convenience for performance lol
Although, I just realized that ollama is just a wrapper for llama.cpp - but with some added steps
Yep you are right about the wrapper. One really nice thing about llama dot cpp is that you will be able to use gguf format which blows wide open your options for models. Hugging face dot co if you don't already know.
Massive improvement! Thanks for the advice - itโs significantly snappier.
https://files.catbox.moe/cyg0nm.mov
You got local linux ai hardware recommendations? I never figured out GPUs on loonix
Yes it will run on any modern nvidia card. The more vram the better. The drivers are out there and actively being updated.