Grok 4.3

Posted by simianwords |4 hours ago |51 comments

khalic 3 hours ago[1 more]

This project is a gigantic waste of resources, it’s fine tuned on politics of the CEO, was used for CSAM generation and just sucks overall

artdigital 3 hours ago[3 more]

Grok is my favorite model for chatting, and my favorite voice mode. It seems to be the only voice mode that isn't routing to a extremely cheap model (like Haiku), and has been the highest quality out of all the frontier ones. When you subscribe to SuperGrok you can also create a "council" of agents, each with their own system prompt and when you ask something, they will all get asked in parallel to come to a conclusion. Good stuff!

Just wish they would finally put some work into their apps, it's the only thing keeping me from actually subscribing to SuperGrok:

- No MCP / connected apps support. It's been teased but here we are, still not available. I can't connect Grok to anything, so I can't use it for serious work

- Projects are still not available in the app so as soon as you move something into a project, it's gone from all the native apps

- No way to add artifacts (like generated markdown docs) directly to a project, we have to export to PDF/markdown and re-import. And there isn't even a way to export artifacts. This makes serious project work hard because we can't dynamically evolve projects with new information

- No memory, no ability to look up other chats, each chat is completely new

- No voice mode in projects at all

If someone from xAI is reading this, please consider adding some of these.

sundarurfriend 3 hours ago[1 more]

As an English-as-second-language speaker and writer, one thing Grok really shines at is capturing the tone and level of "formality" of a piece of text and the replicating it correctly. It seems to understand the little human subtleties of language in a way the other major providers don't. Chatgpt goes overly stiff and formal sounding, or ends up in a weird "aye guvnor" type informal language (Claude is sometimes better but not always).

Grok seems in general better at being "human" in ways that are hard to define: for eg. if I ask it "does this message roughly convey things correctly, to the level it can given this length", it will likely answer like a human would (either a yes or a change suggestion that sticks to the tone and length), while Chatgpt would write a dissertation on the message that still doesn't clear anything up.

Recently I've noticed that Grok seems to have gotten really good at dictation too (that feature where you click the mic to ask it something). Chatgpt has like 90-95% accuracy with my accent, the speech input on Android's Gboard something like 75%, Grok surprisingly gets something like 98% of my words correct.

tornikeo 3 hours ago[3 more]

So, we have: - claude for corps and gov - codex for devs - grok for what, roleplay, racism? Those are the two things I've ever heard grok associated with around me.

maz1b 4 hours ago

I still wish they named it something else, but congratulations to the team on what seems to be a good release!

Pricing is also quite surprising, compared to comparable competitors. I guess they have tons of capacity or really want to bring over more people.

netdur 4 hours ago

In court vs openai, Musk said Grok is partly trained on openai models, so it should be somehow similar to Chinese models in terms of performance and cost!

OtherShrezzing 3 hours ago

The tok/s stat is interesting. Since the dominant constraint on inference speed is hardware, it suggests X purchased far more compute than was really needed to serve the demand for their models.

Expensive miscalculation.

mythz 4 hours ago[2 more]

Ok speed (202.7 tok/s) and value (1.25 -> 2.50) look great, with pretty decent intelligence.

alyxya 4 hours ago

Despite their attrition, this combined with their cursor partnership is likely going to make them competitive in coding agents soon.

ragchronos 4 hours ago[1 more]

When looking at the benchmarks, this model seems to be really close to Kimi K2.6 in terms of intelligence and pricing, hitting that sweet spot. It does also have a higher AA-Omniscience index, which is something kimi and other open models lack in. Curious to see how pleasant it is to use.

BoredPositron 3 hours ago

Yay, free tokens. I don't know why but grok always seems good fast in the free token phase and after that degrades.

simianwords 4 hours ago[5 more]

https://artificialanalysis.ai/models/grok-4-3

Imustaskforhelp 4 hours ago[1 more]

Pelican riding a bike here: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...

(ran this on arena.ai direct chat and also tried to write this gist inspired by how simon writes his gists about pelicans)

Edit: just realized that I made pelican riding a bike instead of bicycle, which now makes sense as to why it hardened the bicycle to look tankier, going to compare this with pelican riding a bicycle if anybody else shares the pelican riding a bicycle.

alfiedotwtf 3 hours ago[1 more]

If there was any model I wouldn’t trust, it wouldn’t be the ones from China, it would be the one from Elon Musk

happosai 3 hours ago[1 more]

I lost the trust in them when they added the racist "what about killing of Boers in south Africa" thing to their system prompt.

No way am I going to use a model where the backing has such blatantly obvious brain washing goals.

th3b0tk1ll3r 3 hours ago[1 more]

Auuuwh no. That's not supposed to be a joke? Grok? not close to what models can do.