vineyardmike 2 hours ago
Anyways, so excited for an open-weight model and I hope it performs well. I’ll be testing this ASAP.
xnx 2 hours ago
beklein 2 hours ago
minimaxir 3 hours ago
kkukshtel 2 hours ago
rvz 3 hours ago
Then you will be able to achieve Jevons Paradox and enjoy the same “productivity gains” without paying for these extortionate token prices by closed model providers or have it as cheap as possible.
And especially, no silent nerfing of the model.
hmate9 2 hours ago
The bidirectionality could be a big deal: being able to refine a sentence with both left and right context feels closer to how editing/thinking actually works than committing to each token forever.
Maybe the current models aren’t good enough yet, but the direction feels important.