14 comments

  • stanpinte 7 minutes ago
    We are developing many applications in my company, some of them safety critical. A natural routing way could happen for certain phases of development, and interfaces via git. One agent works on branch a and is responsible for brainstorm planning specs, and the other is responsible code and tests. The first agent creates tickets for the second one and the second one consumes these. This works with today’s standard harness.
  • _pdp_ 14 minutes ago
    There are so many proxies like this now but I can tell you from first hand experience this is not going to work. You cannot just route away from a situation at such a high level especially when we are talking about models that are quite different in behaviour, with different context windows and tuned to different tool uses. The harness is doing all kind of funky things to compensate for issues (like tool call truncation) that a proxy that routes dynamically like this will work against the very same strategies that make the harness work.

    Interesting concept, work in theory, but I cannot see this being part of larger system.

  • josalhor 1 hour ago
    We need LLM query routing at the OS level like Mobile data. I know it will sound crazy but hear me out. I think about this AI inference as infrastructure. I do not want to pay for it on every app I use it on. I do not think "I have to pay the mobile data of youtube, and the mobile data of whatsapp etc.". I pay Mobile data infrastructure and let my device route it appropiately. In fact, if we ever go the local llm route, you could have LLM capabilities without having access to the internet (or local LAN), and your OS/computer is the only one capable of doing that routing for you.
    • solenoid0937 28 minutes ago
      It doesn't sound crazy at all, this seems almost obvious. The OS should provide a chat completions server and the user should be able to select the underlying LLM's server. This should be just like selecting a default search engine or browser.

      Hopefully the EU forces US tech giants to do this. God knows Apple and Google won't do this on their own. They gotta get that sweet default provider revenue.

  • dd8601fn 2 hours ago
    It's funny how much that first paragraph is Claude's voice. I don't know how it got trained so hard to use, "the shape of" for everything.
    • paradox460 1 hour ago
      Loads of ed sheeran in the training data?
  • JSR_FDED 1 hour ago
    Slight tangent, but “Wayfinder sits behind whatever OpenAI-compatible client you already use” reminds me that descriptions of where proxies sit in the information flow always seem so arbitrary to me:

      - “after the client”
      - “reverse proxy” (in front  of servers)
      - “proxy” (in front of client)
    
    I always have to look this up, surely there must be a standardized way to describe this?
    • parasti 1 hour ago
      "after the client" and "in front of client" can mean the same thing depending on your viewpoint.
      • JSR_FDED 11 minutes ago
        Exactly, that’s my point
  • try-working 4 hours ago
    Love to see local/cloud routing explicitly supported.

    I'm building another router for routing between local and remote models, ShowHN coming up later today. Here's a sneak preview of the github: https://github.com/try-works/role-model

  • ListeningPie 1 hour ago
    can you send to multiple LLMs to compare responses? From that create a heuristic of which LLM gets what.
  • throwawayk7h 2 hours ago
    It'd be nice to just have a command prefix e.g.

    /local fix my typo

    • girvo 1 hour ago
      That’s what I did with Pi, super simple :)
  • quijoteuniv 2 hours ago
    This is the way!
  • terekhindc 14 minutes ago
    [dead]
  • tcballard 3 hours ago
    [dead]
  • niemandhier 2 hours ago
    [dead]
  • kevinten10 1 hour ago
    [dead]
  • tcballard 3 hours ago
    [flagged]