jakozaur 14 minutes ago
It looks like much more context is required to decide on the best model (e.g., summarizing logs might use a cheap model, whereas you likely want Opus/Mythos/GPT 5.6 to debug multithreading logic). In an agentic system, a decision about the model may be embedded in the decision to orchestrate the model.
g00k an hour ago
stpedgwdgfhgdd 2 hours ago
How does this router translate to $$$ when developing?
peterbell_nyc 31 minutes ago
I'm just trying to figure out why on the fly routing would beat testing and tuning and locking models and versions for each class of call, with evals and auto tunes running to explore more possible models for commonly run classes of prompt over time . . .
spqw an hour ago
As prices increase we will see more of these tools to optimise and make the best use of token budget
k9294 an hour ago
alansaber an hour ago
suyash an hour ago
gautam_io an hour ago
Will this use my Claude Pro/Max subscription? Or will it always use the API billing "pay as you go"?
_pdp_ an hour ago
Also, small LLMs are prone to stop before completion, throw errors and produce loops. Is this factored in the design of the tool? I am not sure.
edit: spellcheck
mkagenius 40 minutes ago
1. https://github.com/instavm/murmur - Murmur
debarshri an hour ago
arendtio an hour ago
emilio_srg2 an hour ago
slopinthebag an hour ago
This is probably not a very effective way of marketing imo. At least, it turns me completely off.
ai_slop_hater an hour ago
randomuser558 a few seconds ago
iluvcommunism 38 minutes ago
gmziven an hour ago
bijowo1676 15 minutes ago
Do people voluntarily use these proxies/routers, knowing their prompts, outputs and code will be seen by other people ?
I get it might be ok for personal projects, but for anything that makes money and is a part of business... this must be big no-no ?