gandreani 37 minutes ago
"We're also launching GPT‑5.6 Sol on Cerebras at up to 750 tokens per second in July, bringing frontier intelligence to customers at unprecedented speed. Access will initially be limited to select customers as we expand capacity."
750 tokens/s on a frontier model is going to be extremely interesting. I doubt this new version is anything but a version bump in terms of capabilities but if we can start getting these answers back faster, they end up being more useful.
Just off the top of my head, I can think of the tedious task of finding certain functionality within a codebase. I usually can't beat an AI agent harness at this task today. If the AI model is 3x faster I have less of chance.
razighter777 an hour ago
It's worrying that with no formal and transparent policy framework that the government will be picking winners and losers and stifling innovation.
There's been no public policy, executive order, legislation, or otherwise on this, I wonder if anyone has filed FOIA requests for these decisions or the conversations between the Executive Branch and AI companies.
Fraterkes 2 hours ago
This amount of courting the current administration is pretty scary imo.
HyperL0gi an hour ago
- GPT-5 mini costs $0.25/$2 and will be discontinued in December.
- GPT-5.4 mini costs $0.75/$4.5 and is supposed to be the replacement.
- GPT-5.4 nano costs $0.2/$1.25 and, while it ranks better in benchmarks than GPT-5 mini, it's not even close when you test it in real scenarios.
So you're left being forced to go to GPT 5.4 mini if you use 5 mini today.
The same thing is happening here as their “Luna“ model will cost $1/$6.
Can't we just stay with the models we actually want? I don't need GPT 5.4 mini. GPT-5 does the job.
Maybe it’s the realization that it was never that cheap in the first place and they're forcing us to upgrade in a slow and painful way.
impulser_ an hour ago
These models aren't even that smart and they are already trying to control them and lock them down to a handful of people.
Then these executive and VC wonder why people hate AI and are against them.
Because the future is heading toward intelligence for the rich and you stuck with whatever model they want you to have.
The next step is banning open source models.
The future is not looking so bright if these models are already going locked down to whoever the government what's to have them.
This is no different than the government banning books because they don't want you to learn.
sim04ful 2 minutes ago
This seems like it would be the largest and first closed-source model Cerebras has offered till date
jdw64 an hour ago
Recently, I went head-to-head with GPT on nearly 2,000 lines of code, and GPT's solution was superior and faster. I even referenced multiple codebases on GitHub while trying, but they were incomparable to GPT.
So using GPT brings both fear and excitement.
The fear comes from realizing that this level of code is now the average for most people. The excitement comes from knowing that I can now study and learn at this level too.
I'm really looking forward to seeing how much more advanced the code will be with the upgrade to 5.6.
pixelpoet 2 hours ago
mohsen1 an hour ago
I'm curious about how does this work? Do the subagents also get to use the same tools? Will the client be flooded with tool calls? Why extra pricing for a new "model" when the same thing can happen in the client with more controls?
And if it's an army of subagents, why do they compare it to Fable and Mythos? Those models with similar harness would probably bench better I'm guessing
mccoyb 2 hours ago
ddp26 2 hours ago
supermdguy 43 minutes ago
This is really exciting. I work on voice AI, and we're still using 4.1/4.1 mini since none of the frontier models come close on latency. I'm excited to be able to have more interactive experiences, I think it'll unlock new ways of working with these models.
loufe 2 hours ago
If it was the next generation, why isn't it a major version change..?
HarHarVeryFunny an hour ago
Multiple weeks!
Not just 5 work days, but at least 10!
modeless an hour ago
I'm very glad to see them say this explicitly and prominently.
type4 2 hours ago
firasd an hour ago
Agent Arena (Dynamic ranking of models on how well they orchestrate tools for real-world agentic tasks, based on signals like tool reliability, task completion, and steerability.)
Top 10, Highest rank to lowest
Claude Fable 5 (High), Claude Opus 4.8 (Thinking), GPT 5.5 (xHigh), Claude Opus 4.7 (Thinking), GPT 5.5 (High), Claude Opus 4.7, Claude Opus 4.6, GPT 5.5, GPT 5.4 (High), GLM 5.2 (Max)
Text Arena View overall rankings across various AI models in text-to-text tasks across math, coding, creative writing, and other open-ended domains.
Top 10, Highest rank to lowest
claude-fable-5, claude-opus-4-6-thinking, claude-opus-4-7-thinking, claude-opus-4-6, claude-opus-4-7, muse-spark, gemini-3.1-pro-preview, gemini-3-pro, claude-opus-4-8-thinking, gpt-5.5-high
osti 2 hours ago
m3h 44 minutes ago
Who knows what they will fix, block or change in the model between the preview and GA time. Open models can't arrive soon enough.
GodelNumbering 17 minutes ago
zkmon 13 minutes ago
mekpro an hour ago
jimmydoe an hour ago
If this is the new norm, we as workers should all start look for jobs in those companies.
nopakos 39 minutes ago
scrlk 27 minutes ago
So the next naming scheme might be FTX, Madoff and Enron? :^)
woeirua an hour ago
ChrisLTD 2 hours ago
jansenmac an hour ago
vatsachak an hour ago
But GPT-5.5 is as useful an LLM can be; it has solved lemmas I've thought about for a year, it can implement typed STLCs in Rust when I give it a formal grammar, it can help me analyze Postgres planner dumps.
It's great at tasks that have short solutions but
- they cannot learn based on a project
- their long term planning capabilities are worse than worms
- they are unconfident in decision making
- their internal representations are disgusting compared to JEPA
- they don't have any "system clock" like humans and computers do
- LLM architecture is not modular like computer architecture or human brain architecture
There's so many issues with LLMs. I wish that companies can start working on the next generation of architectures before the bubble pops
bluepeter an hour ago
corygarms an hour ago
tomComb an hour ago
Really glad to see some reasonably prominent pushback against this government overreach.
The information has been reporting that the government wants to individually approve which companies get access and when.
Imagine the wonderful opportunities for corruption and influence peddling, not to mention, excluding any companies that don’t support Trump
rappatic an hour ago
leumon 2 hours ago
I hope this means then fable will also get released again.
smeeth an hour ago
micimize 16 minutes ago
Doesn't that undermine all good-faith discourse on cybersecurity safeguards, controlled usage etc? Or is that overstating the case (I'm not a security researcher myself so kinda parroting).
swe_dima 37 minutes ago
an hour ago
Comment deletedOsrsNeedsf2P an hour ago
an hour ago
Comment deletedbijowo1676 2 hours ago
an hour ago
Comment deletedmikkelam an hour ago
hereme888 22 minutes ago
Is it just me, or does it seem like Anthropic has been more of a pioneer the past few years, and OpenAI tries to copy features they like?
low_tech_punk 2 hours ago
duggan an hour ago
The clowns in the US administration can barely remain coherent from one sentence to the next.
Having them be the gatekeepers of technological progress in 2026 is fucking lame.
ddwrll an hour ago
I'm looking at you Codex.
an hour ago
Comment deletedmasonwan 14 minutes ago
johnnyApplePRNG an hour ago
nsingh2 2 hours ago
KronisLV an hour ago
simianwords 33 minutes ago
IAmGraydon an hour ago
kristofferR an hour ago
transcriptase an hour ago
andrewlin247 an hour ago
ChrisArchitect 2 hours ago
thesurlydev an hour ago
Anyone know the latest around Fable being re-released after gov smackdown?
simianwords an hour ago
1. Naming convention is copied from Anthropic and honestly is more catchy than a number (amongst normal people)
2. How in the world did Anthropic have to do all the theatrics about Mythos just to have OpenAI release an equivalent or stronger model a month later without any drama???
3. Cheaper models are just don’t fit any usecase imo and OpenAI knows it so they keep increasing the floor - I’m still convinced task per capability is reduced with each release
4. How in the world would open source models keep up with the multi layer security? Either this security is all theater or we will finally see a ceiling in open source models because by definition they can’t have those protections
5. Cybersecurity things are boring to me because it’s all zero sum cat and mouse games
submeta an hour ago
I mean, if they deem Fable 5 to powerful to share with the rest of the world, what's left for us?
da_grift_shift an hour ago
Flagged activity can also trigger account-level review across relevant conversations and risk signals, consistent with our terms and policies around content retention and review. Looking beyond a single conversation helps our systems distinguish persistent malicious behavior from legitimate dual-use security work, where similar technical concepts may appear in very different contexts.
Fascinating!Every conversation you have with these "more capable" models will be monitored and joined up and then your entire account might one day be tagged as Distiller or Cyber Threat Actor or whatnot. When combined with identity verification (which isn't discussed in this press release), expect people to be falsely flagged and banned from ever using OpenAI models again.
Wish I could find the thread from last week where discussions of exactly this kind of thing were dismissed as daft and outlandish.
arendtio an hour ago
I mean, you can read them even without the colors, but who on earth thought that those are a good set of colors? Oh, I forgot it was probably someone on 'Sol'.
kmeisthax an hour ago
My brother in Christ, then why did you (and your competitors) spend years telling the government you needed them to tie your hands behind your back? Did you really think they'd just give you a crown that says "Gatekeeper Of All Neural Networks"?
meetpateltech an hour ago
Sol Ultra ≈ Pro
Sol ≈ Standard
Terra ≈ Mini
Luna ≈ Nano
ALittleLight an hour ago
randomuser558 3 minutes ago
gck1 an hour ago
Comment deletedw4yai an hour ago
nakedrobot2 2 hours ago
Beam me up Scotty. No intelligent life forms on this planet.
rvz 2 hours ago
> GPT‑5.6 is priced per 1M tokens across three model sizes:
> Sol is $5 input / $30 output;
> Terra is $2.50 input / $15 output
> Luna is $1 input / $6 output.
The OpenAI casino has never been more ready to take your money on gambling even more tokens.
an hour ago
Comment deleted