Stop Using Ollama

(sleepingrobots.com)

454 points | by Zetaphor 5 hours ago

51 comments

u1hcw9nx 8 minutes ago
Two Views of MIT-Style Licenses:
1. MIT-style licenses are "do what you want" as long as you provide a single line of attribution. Including building big closed source business around it.
2. MIT-style licenses are "do what you want" under the law, but they carry moral, GPL-like obligations to think about the "community."
To my knowledge Georgi Gerganov, the creator of llama.cpp, has only complained about attribution when it was missing. As an open-source developer, he selected a permissive license and has not complained about other issues, only the lack of credit. It seems he treats the MIT license as the first kind.
The article has other good points not related to licensing that are good to know. Like performance issues and simplicity that makes me consider llama.cpp.
cientifico 2 hours ago
For most users that wanted to run LLM locally, ollama solved the UX problem.
One command, and you are running the models even with the rocm drivers without knowing.
If llama provides such UX, they failed terrible at communicating that. Starting with the name. Llama.cpp: that's a cpp library! Ollama is the wrapper. That's the mental model. I don't want to build my own program! I just want to have fun :-P
[-]
- anakaine 1 hour ago
  Llama.cpp now has a gui installed by default. It previously lacked this. Times have changed.
  [-]
  - nikodunk 1 hour ago
    Having read above article, I just gave llama.cpp a shot. It is as easy as the author says now, though definitely not documented quite as well. My quickstart:
    brew install llama.cpp
    llama-server -hf ggml-org/gemma-4-E4B-it-GGUF --port 8000
    Go to localhost:8000 for the Web UI. On Linux it accelerates correctly on my AMD GPU, which Ollama failed to do, though of course everyone's mileage seems to vary on this.
    [-]
    - teekert 1 hour ago
      Was hoping it was so easy :) But I probably need to look into it some more.
      llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'gemma4' llama_model_load_from_file_impl: failed to load model
      Edit: @below, I used `nix-shell -p llama-cpp` so not brew related. Could indeed be an older version indeed! I'll check.
      [-]
      - roosgit 20 minutes ago
        I just hit that error a few minutes ago. I build my llama.cpp from source because I use CUDA on Linux. So I made the mistake of trying to run Gemma4 on an older version I had and I got the same error. It’s possible brew installs an older version which doens’t support Gemma4 yet.
        [-]
        teekert 10 minutes ago
        Ah it was indeed just that!
        I'm now on:
        $ llama --version version: 8770 (82764d8) built with GNU 15.2.0 for Linux x86_64
        And this works as advertised, nice chat interface, but no openai API I guess, so no opencode...
        zozbot234 15 minutes ago
        And that's exactly why llama.cpp is not usable by casual users. They follow the "move fast and break things" model. With ollama, you just have to make sure you're getting/building the latest version.
  - OtherShrezzing 1 hour ago
    While that might be true, for as long as its name is “.cpp”, people are going to think it’s a C++ library and avoid it.
    [-]
    - eterm 1 hour ago
      This is the first I'm learning that it isn't just a C++ library.
      In fact the first line of the wikipedia article is:
      > llama.cpp is an open source software library
      [-]
    - RobotToaster 1 hour ago
      It would make sense to just make the GUI a separate project, they could call it llama.gui.
    - figassis 1 hour ago
      This is correct, and I avoided it for this reason, did not have the bandwidth to get into any cpp rabbit hole so just used whatever seemed to abstract it away.
  - mijoharas 1 hour ago
    Frankly I think the cli UX and documentation is still much better for ollama.
    It makes a bunch of decisions for you so you don't have to think much to get a model up and running.
- samus 22 minutes ago
  How about kobold.cpp then? Or LMStudio (I know it's not open source, but at least they give proper credit to llama.cpp)?
  Re curation: they should strive to not integrate broken support for models and avoid uploading broken GGUFs.
- ekianjo 19 minutes ago
  > For most users that wanted to run LLM locally, ollama solved the UX problem
  This does not absolve them from the license violation
- well_ackshually 58 minutes ago
  >solved the UX problem.
  >One command
  Notwithstanding the fact that there's about zero difference between `ollama run model-name` and `llama-cpp -hf model-name`, and that running things in the terminal is already a gigantic UX blocker (Ollama's popularity comes from the fact that it has a GUI), why are you putting the blame back on an open source project that owes you approximately zero communication ?
  [-]
  - zozbot234 56 minutes ago
    > Ollama's popularity comes from the fact that it has a GUI
    It's not the GUI, it's the curated model hosting platform. Way easier to use than HF for casual users.
    [-]
    - kgwgk 1 minute ago
      It also made easy for casual users to think they were running deepseek.
- amelius 1 hour ago
  Whip that llama! Oh wait, that's a different program.
  [-]
  - mech422 49 minutes ago
    LOL
    https://www.youtube.com/watch?v=HaF-nRS_CWM
- FrozenSynapse 1 hour ago
  but if ollama is much slower, that's cutting on your fun and you'll be having better fun with a faster GUI
  [-]
  - UqWBcuFx6NV4r 13 minutes ago
    You’ve completely missed the point.
flux3125 3 minutes ago
I stopped using Ollama a couple of months ago. Not out of frustration, but because llama.cpp has improved a lot recently with router mode, hot-swapping, a modern and simple web UI, MCP support and lots of other improvements.
FeepingCreature 5 minutes ago
I always avoided Ollama because it smelled like a project that was trying so desperately to own the entire workflow. I guess I dodged a bigger bullet than I knew.
0xbadcafebee 2 hours ago
No mention of the fact that Ollama is about 1000x easier to use. Llama.cpp is a great project, but it's also one of the least user friendly pieces of software I've used. I don't think anyone in the project cares about normal users.
I started with Ollama, and it was great. But I moved to llama.cpp to have more up-to-date fixes. I still use Ollama to pull and list my models because it's so easy. I then built my own set of scripts to populate a separate cache directory of hardlinks so llama-swap can load the gguf's into llama.cpp.
[-]
- AndroTux 1 hour ago
  Exactly. The blog post states that the alternatives listed are similarly intuitive. They are not. If you just need a chat app, then sure, there’s plenty of options. But if you want an OpenAI compatible API with model management, accessibility breaks down fast.
  I’m open to suggestions, but the alternatives outlined in the blog post ain’t it.
  [-]
  - mentalgear 1 hour ago
    The reported alternatives seem pretty User-Friendly to me:
    > LM Studio gives you a GUI if that’s what you want. It uses llama.cpp under the hood, exposes all the knobs, and supports any GGUF model without lock-in.
    > Jan(https://www.jan.ai/) is another open-source desktop app with a clean chat interface and local-first design.
    > Msty(https://msty.ai/) offers a polished GUI with multi-model support and built-in RAG. koboldcpp is another option with a web UI and extensive configuration options.
    API wise: LM Studio has REST, OpenAI and more API Compatibilities.
  - homarp 1 hour ago
    like someone said above: brew install llama.cpp
    llama-server -hf ggml-org/gemma-4-E4B-it-GGUF --port 8000 (with MCP support and web chat interface)
    and you have OpenAI API on the same 8000 port. (https://github.com/ggml-org/llama.cpp/tree/master/tools/serv... lists the endpoints)
- flux3125 55 minutes ago
  > so llama-swap can load
  Just in case you haven't seen it yet, llama.cpp now has a router mode that lets you hot-swap models. I've switched over from llama-swap and have been happy with it.
- throw9393rj 1 hour ago
  I spend like 2 hours trying to get vulkan acceleration working with ollama, no luck (half models are not supported and crash it). With llama.cpp podman container starts and works in 5 minutes.
- BrissyCoder 1 hour ago
  > No mention of the fact that Ollama is about 1000x easier to use.
  Easier than what?
  I came across LM Studio (mentioned in the post) about 3 years ago before I even knew what Ollama as. It was far better even then.
tosh 34 minutes ago
This is a bit like saying stop using Ubuntu, use Debian instead.
Both llama.cpp and ollama are great and focused on different things and yet complement each other (both can be true at the same time!)
Ollama has great ux and also supports inference via mlx, which has better performance on apple silicon than llama.cpp
I'm using llama.cpp, ollama, lm studio, mlx etc etc depending on what is most convenient for me at the time to get done what I want to get done (e.g. a specific model config to run, mcp, just try a prompt quickly, …)
[-]
- matja 21 minutes ago
  > This is a bit like saying stop using Ubuntu, use Debian instead.
  Not really, because Ubuntu has always acknowledged Debian and explicitly documented the dependency:
  > Debian is the rock on which Ubuntu is built.
  > Ubuntu builds on the Debian architecture and infrastructure and collaborates widely with Debian developers, but there are important differences. Ubuntu has a distinctive user interface, a separate developer community (though many developers participate in both projects) and a different release process.
  Source: https://ubuntu.com/community/docs/governance/debian
  Ollama never has for llama.cpp. That's all that's being asked for, a credit.
  [-]
  - UqWBcuFx6NV4r 11 minutes ago
    OK. That says absolutely nothing about actual UX or anything that matters to most actual users (as opposed to argumentative HN ideologues).
- oefrha 29 minutes ago
  > Both llama.cpp and ollama are great and focused on different things and yet complement each other
  According to the article, ollama is not great (that’s an understatement), focused on making money for the company, stealing clout and nothing else, and hardly complements llama.cpp at all since not long after the initial launch. All of these are backed by evidence.
  You may disagree, but then you need to refute OP’s points, not try to handwave them away with a BS analogy that’s nothing like the original.
- operatingthetan 27 minutes ago
  I guess read the article before commenting?
- damnitbuilds 32 minutes ago
  The author points out that the Ollama people are evil.
  So it is more like saying "Stop using SCO Unix, use Linux instead".
  [-]
  - yuppiepuppie 27 minutes ago
    Where do they use the term "evil"?
    [-]
    - cadamsdotcom 23 minutes ago
      In the gaps between the tops of the lines and the bottoms of the other lines ;)
Zetaphor 5 hours ago
I got tired of repeating the same points and having to dig up sources every time, so here's the timeline (as I know it) in one place with sources.
[-]
- brabel 2 hours ago
  Thanks for writing this, I hope people here will actually read this and not assume this is some unfounded hit piece. I was involved a little bit in llama.cpp and knew most of what you wrote and it’s just disgusting how ollama founders behaved! For people looking for alternatives, I would also recommend llama-file, it’s a one file executable for any OS that includes your chosen model: https://github.com/mozilla-ai/llamafile?tab=readme-ov-file
  It’s truly open source, backed by Mozilla, openly uses llama.cpp and was created by wizard Justine Tunney of CosmopolitanC fame.
  [-]
  - cachius 1 hour ago
    I also thought llamafile deserves a mention. Once you have all model params and tunings done bakes 'em into a single portable binary!
- Mario9382 2 hours ago
  Really nice. I wasn't aware of any of this.
- kelsolaar 2 hours ago
  Great writing, thanks for the summary and timeline.
- robot-wrangler 2 hours ago
  Thanks, did not know any of this.
denismi 1 hour ago
Hmm..
```
  pacman -Ss ollama | wc -l                                                                                                              
  16
  pacman -Ss llama.cpp | wc -l
  0
  pacman -Ss lmstudio | wc -l
  0
```
Maybe some day.
[-]
- mongrelion 53 minutes ago
  llama.cpp moves too quickly to be added as a stable package. Instead, you can get it directly from AUR: https://aur.archlinux.org/packages?O=0&K=llama.cpp
  There are packages for Vulkan, ROCm and CUDA. They all work.
- FlyingSnake 19 minutes ago
  yay -S llama.cpp
  I just installed llama.cpp on CachyOS after reading this article. It’s much faster and better than Ollama.
song 22 minutes ago
So, on a mac, what good alternative to ollama supports mlx for acceleration? My main use case is that I have an old m1 max macbook pro with 64 gb ram that I use as a model server.
zxcholmes 1 hour ago
The name "llama.cpp" doesn't seem very friendly anymore nowadays... Back then, "llama" probably referred to those models from Facebook, and now those Llama series models clearly can't represent the strongest open-source models anymore...
[-]
- kgwgk 1 hour ago
  Doesn't the "llama" in "ollama" present exactly the same issue?
  Edit: or maybe that was your point. I guess that for historical reasons this is a kind of generic name for local deployments now (see https://www.reddit.com/r/LocalLLaMA) just like people will call anything ChatGPT.
zarzavat 25 minutes ago
It's as if Ollama is trying to create a walled garden, but the garden is outside of their property, so all it achieves is walling themselves in.
alfiedotwtf 1 minute ago
I'm a llama.cpp user, but apart from the MIT licensing issue, I personally don't see what's the problem here is? Sure Ollama could have advertised better that llama.cpp was it's original backend, but were they obligated to? It's no different to Docker or VMWare that hitch a ride on kernel primitives etc.
rothific 11 minutes ago
I've been experimenting with running Gemma with MLX directly within my own harness: https://github.com/cjroth/mlx-harness
usernomdeguerre 3 hours ago
Do they still not let you change the default model folder? You had to go through this whole song and dance to manually register a model via a pointless dockerfile wannabe that then seemed to copy the original model into their hash storage (again, unable to change where that storage lived).
At the time I dropped it for LMStudio, which to be fair was not fully open source either, but at least exposed the model folder and integrated with HF rather than a proprietary model garden for no good reason.
[-]
- zozbot234 1 hour ago
  > Do they still not let you change the default model folder?
  Actually they do. It's environment variable OLLAMA_MODELS in the server configuration file.
- andreidbr 2 hours ago
  This also annoyed me a lot. I was running it before upgrading the SSD storage and I wanted to compare with LM Studio. Figured it would be good to have both interfaces use the same models downloaded from HF.
  Had to go down the same rabbit hole of finding where things are, how they're sorted/separated/etc. It was unnecessarily painful
opem 8 minutes ago
There is also lemonade-server from AMD. Although I am not sure if that is any better.
fy20 2 hours ago
It feels like a bit of history is missing... If ollama was founded 3 years before llama.cpp was released, what engine did they use then? When did they transition?
[-]
- wolvoleo 1 hour ago
  I don't think that is the case. Llama.cpp appeared within weeks after meta released llama to select researchers (which then made it out to the public). 3 years before that nobody knew of the name llama. I'm sure that llama.cpp existed first
- Maxious 1 hour ago
  They spent several years in stealth mode but the initial release was llama.cpp.
  Ollama v0.0.1 "Fast inference server written in Go, powered by llama.cpp" https://github.com/ollama/ollama/tree/v0.0.1
  [-]
  - em-bee 41 minutes ago
    They spent several years in stealth mode
    doing what?
    trying to build themselves what llama.cpp ended up doing for them?
    [-]
    - saghul 22 minutes ago
      I asked myself the same question. Some other commenter mentioned above they started with some Kubernetes infrastructure thing and they pivoted later.
endymion-light 14 minutes ago
I'm sorry, on a mac, Ollama just works. It lets me use a model and test it quickly. This is like saying stop using google drive, upload everything to s3 instead!
When i'm using Ollama - I honeslty don't care about performance, I'm looking to try out a model and then if it seems good, place it onto a most dedicated stack specifically for it.
[-]
- brabel 4 minutes ago
  Ollama is a bit easier to use, you’re right. But the point of the article is the way they just disregarded the license of llama.cpp, moved away from open source while still claiming to be open source and pivoted to cloud offerings when the whole point was to run local models all while without contributing anything back to the big open source projects it owns its existence to. Maybe you don’t care about performance (weird given performance is the main blocker for local LLMs) but you should care about the ethics of companies making the product you use?
  And anyway this thread has lots of alternatives that are even easier to use and don’t shit on the open source community making things happen.
speedgoose 2 hours ago
I prefer Ollama over the suggested alternatives.
I will switch once we have good user experience on simple features.
A new model is released on HF or the Ollama registry? One `ollama pull` and it's available. It's underwhelming? `ollama rm`.
[-]
- kennywinker 2 hours ago
  > This creates a recurring pattern on r/LocalLLaMA: new model launches, people try it through Ollama, it’s broken or slow or has botched chat templates, and the model gets blamed instead of the runtime.
  Seems like maybe, at least some of the time, you’re being underwhelmed my ollama not the model.
  The better performance point alone seems worth switching away
  [-]
  - speedgoose 2 hours ago
    I follow the llama.cpp runtime improvements and it’s also true for this project. They may rush a bit less but you also have to wait for a few days after a model release to get a working runtime with most features.
    [-]
    - Maxious 1 hour ago
      Model authors are welcome to add support to llama.cpp before release like IBM did for granite 4 https://github.com/ggml-org/llama.cpp/pull/13550
- derrikcurran 50 minutes ago
  `wget https://huggingface.co/[USER]/[REPO]/resolve/main/[FILE_NAME...`
  `rm [FILE_NAME]`
  With Ollama, the initial one-time setup is a little easier, and the CLI is useful, but is it worth dysfunctional templates, worse performance, and the other issues? Not to me.
  Jinja templates are very common, and Jinja is not always losslessly convertible to the Go template syntax expected by Ollama. This means that some models simply cannot work correctly with Ollama. Sometimes the effects of this incompatibility are subtle and unpredictable.
- pheggs 2 hours ago
  you can pull directly from huggingface with llama.cpp, and it also has a decent web chat included
  [-]
  - speedgoose 2 hours ago
    Does it have a model registry with an API and hot swapping or you still have to use sometime like llama swap as suggested in the article ? Or is it CLI?
    [-]
    - dminik 1 hour ago
      You can have multiple models served now with loading/unloading with just the server binary.
      https://github.com/ggml-org/llama.cpp/blob/master/tools/serv...
      [-]
      - speedgoose 26 minutes ago
        It only lacks the automatic FIFO loading/unloading then. Maybe it will be there in a few weeks.
renierbotha 8 minutes ago
Thank you, I needed to read this.
iib 38 minutes ago
Has anybody figured some of the best flags to compile llama.cpp for rocm? I'm using the framework desktop and the Vulkan backend, because it was easier to compile out of the box, but I feel there's large peformance gains on the table by swtiching to rocm. Not sure if installing with brew on ubuntu would be easier.
TomGarden 2 hours ago
The performance issues are crazy. Thanks for sharing this
osmsucks 2 hours ago
I noticed the performance issues too. I started using Jan recently and tried running the same model via llama.cpp vs local ollama, and the llama.cpp one was noticeably faster.
aquir 25 minutes ago
I did not know! Shady :(
I was using LM Studio since I've moved to MacOS so that's fine I guess
utopiah 2 hours ago
Not sure why VLC doesn't do that.
It's a joke... but also not really? I mean VLC is "just" an interface to play videos. Videos are content files one "interact" with, mostly play/pause and few other functions like seeking. Because there are different video formats VLC relies on codecs to decode the videos, so basically delegating the "hard" part to codecs.
Now... what's the difference here? A model is a codec, the interactions are sending text/image/etc to it, output is text/image/etc out. It's not even radically bigger in size as videos can be huge, like models.
I'm confused as why this isn't a solved problem, especially (and yes I'm being a big sarcastic here, can't help myself) in a time where "AI" supposedly made all smart wise developers who rely on it 10x or even 1000x more productive.
Weird.
[-]
- sudb 2 hours ago
  What problem is it that you are confused isn't solved?
  I think the codec analogy is neat but isn't the codec here llama.cpp, and the models are content files? Then the equivalent of VLC are things like LMStudio etc. which use llama.cpp to let you run models locally?
  I'd guess one reason we haven't solved the "codec" layer is that there doesn't seem to be a standard that open model trainers have converged on yet?
  [-]
  - imtringued 46 minutes ago
    llama.cpp is the ffmpeg/libavcodec equivalent in this story.
tyfon 2 hours ago
I think the biggest advantage for me with ollama is the ability to "hotswap" models with different utility instead of restarting the server with different models combined with the simple "ollama pull model". In other words, it has been quite convenient.
Due to this post I had to search a bit and it seems that llama.cpp recently got router support[1], so I need to have a look at this.
My main use for this is a discord bot where I have different models for different features like replying to messages with images/video or pure text, and non reply generation of sentiment and image descriptions. These all perform best with different models and it has been very convenient for the server to just swap in and out models on request.
[1] https://huggingface.co/blog/ggml-org/model-management-in-lla...
[-]
- majorchord 2 hours ago
  > the ability to "hotswap" models with different utility instead of restarting the server
  The article mentions llama-swap does this
- hacker_homie 1 hour ago
  Llama.cpp added the ability load/switch models on demand with the max-models and models preset flags.
- segmondy 2 hours ago
  You can do that with llama-server
asim 31 minutes ago
Ah man the VC death trap. It's ok. I don't mean it like that but this is classic. It's unavoidable. They gotta make money. They took money, they gotta make money. It's not easy. Everyone has principles, developers more than anyone. They are developers, they are people like you and me. They didn't even start as ollama. They started as a kubernetes infra project in YC and pivoted. Listen don't be hard on these guys. It's hard enough. Trust me I did it. And not as well them.
This is the game. We shouldn't delude ourselves into thinking there are alternative ways to become profitable around open source, there aren't. You effectively end up in this trap and there's no escape and then you have to compromise on everything to build the company, return the money, make a profit. You took people's money, now you have to make good, there's no choice. And anyone who thinks differently is deluded. Open source only goes one way. To the enterprise. Everything else is burning money and wasting time. Look at Docker. Textbook example of the enormous struggle to capture the value of a project that had so much potential, defined an industry and ultimately failed. Even the reboot failed. Sorry. It did.
This stuff is messy. Give them some credit. They give you an epic open source project. Be grateful for that. And now if you want to move on, move on. They don't need a hard time. They're already having a hard time. These guys are probably sweating bullets trying to make it work while their investors breathe down their necks waiting for the payoff. Let them breathe.
Good luck to you ollama guys!
WhereIsTheTruth 10 minutes ago
The state of LLM as a service is just depressing
It is a parasitic stack that redirects investment into service wrappers while leaving core infrastructure underfunded
san_tekart 1 hour ago
The CLI is great locally, but the architecture fights you in production. Putting a stateful daemon that manages its own blob storage inside a container is a classic anti-pattern. I ended up moving to a proper stateless binary like llama-server for k8s.
dragochat 45 minutes ago
how about the others:
- vLLM https://vllm.ai/ ?
- oMLX https://github.com/jundot/omlx ?

mrkeen 55 minutes ago

> Red Hat’s ramalama is worth a look too, a container-native model runner that explicitly credits its upstream dependencies front and center. Exactly what Ollama should have done from the start.

  % ramalama run qwen3.5-9b
  Error: Manifest for qwen3.5-9b:latest was not found in the Ollama registry

[-]

mrkeen 46 minutes ago

I've now given ramalama a look:

  % ramalama run qwen3.5   
   > hi

  Server or container exited. Shutting down client.

  % ramalama run gemma4:e2b
   > hello

  Server or container exited. Shutting down client.

dhruv3006 1 hour ago
ollama is pretty intuitive to use still - dont see why will stop.
thot_experiment 1 hour ago
I was pretty big on ollama, it seemed like a great default solution. I had alpha that it was a trash organization but I didn't listen because I just liked having a reliable inference backend that didn't require me to install torch. I switched to llama.cpp for everything maybe 6 months ago because of how fucking frustrating every one of my interactions with ollama (the organization) were. I wanna publicly apologize to everyone who's concerns I brushed off. Ollama is a vampire on the culture and their demise cannot come soon enough.
FWIW llama.cpp does almost everything ollama does better than ollama with the exception of model management, but like, be real, you can just ask it to write an API of your preferred shape and qwen will handle it without issue.

mentalgear 1 hour ago

> Ollama is a Y Combinator-backed (W21) startup, founded by engineers who previously built a Docker GUI that was acquired by Docker Inc. The playbook is familiar: wrap an existing open-source project in a user-friendly interface, build a user base, raise money, then figure out monetization.

    The progression follows the pattern cleanly:

    1. Launch on open source, build on llama.cpp, gain community trust
    2. Minimize attribution, make the product look self-sufficient to investors
    3. Create lock-in, proprietary model registry format, hashed filenames that don’t work with other tools
    4. Launch closed-source components, the GUI app
    5. Add cloud services, the monetization vector

Havoc 1 hour ago
Alas people want convenience and don’t care about this sort of stuff.
DeathArrow 1 hour ago
I see no mention of vLLM in the article.
[-]
- StrauXX 47 minutes ago
  vLLM isn't suitable for people running LLMs side-by-side with regular applications on their PC. It is very good at hosting LLMs for production on dedicated servers. For the prod usecase ollama/llamacpp are practically useless (but that's ok - it's not the projects goal to be).
NamlchakKhandro 1 hour ago
LM Studio is 1000x easier to use than ollama btw
damnitbuilds 34 minutes ago
I am trying to run models that are on the edge of what my hardware can support. I guess many people are.
So given, as the author states, Ollama runs the LLMs inefficiently, what is the tool that runs them most efficiently on limited hardware ?
yokoprime 2 hours ago
i had no idea about all this. especially the performance and bugs. thanks for informing me!
sminchev 49 minutes ago
With such concurrency in the market, it is unforgivable to manage a product that way. The concurrency will kill you.
Clients get disappointed, alternatives have better services, and more are popping out monthly. If they continue that way, nothing good will happen, unfortunately :(
dnnddidiej 2 hours ago
On a practical note if fumbles connection handling as to be unusable to download anything.
NamlchakKhandro 1 hour ago
drop ollama in the bin, no one needs it.
eternaut 1 hour ago
the article nails it!
stuaxo 43 minutes ago
Way too much text - feels LLM written.
At the top could have been a link to equivalent llamacpp workflows to ollamas.
I wish the op had gone back and written this as a human, I agree with not using Ollama but don't like reading slop.
[-]
- IshKebab 11 minutes ago
  Yeah my thoughts exactly. Definitely slop. I have no objection to using AI to help writing. I just don't want to read the same sloppy cliches again and again and again. The short sentences. The Bigger Picture. Here's the rub. It's not just A, it's B.
  It's like those cliche titles - for fun and profit, the unreasonable effectiveness of, all you need is, etc. etc. but throughout the prose. Stop it guys!
goodpoint 1 hour ago
The missing attribution pattern is nasty.
dackdel 2 hours ago
i use goose by block
[-]
- sudb 2 hours ago
  seems pretty unrelated to the post?
  also you might be the only person in the wild I've seen admit to this
arcza 1 hour ago
I find the style of writing incredibly annoying (it doesn't make the point, full of hyperbole) and the website has the standard slopsite black background and glowing CSS.
[-]
- Karuma 1 hour ago
  That's because it was fully written by an LLM, as usual lately with all the articles on the front page of HN.
  No wonder I get downvoted to hell every time I mention this... People here can't even tell anymore. They just find this horrible slop completely normal. HN is just another dead website filled with slop articles, time to move on to some smaller reddit communities...
  [-]
  - arcza 14 minutes ago
    As Claude would say: You're absolutely right!
_bobm 1 hour ago
amen
paganel 1 hour ago
Another scummy YCombinator project, one of many lately. Looks like no-one is left at the wheel, at least as long as the valuations (and hence money) keep coming in.
eddie-wang 1 hour ago
[dead]
ipeev 1 hour ago
[dead]
holliplex 30 minutes ago
Please, please, please stop getting Claude to write blogposts.