Show HN: A benchmark where LLMs make memes from current news

Posted by max-azendorf |3 hours ago |1 comments

vintagedave 2 hours ago

I was a bit skeptical but this is actually really neat. Sometimes they nail it. The situations where both are bad are common, and seeing which LLM produced them lines up quite well with how good I think the same models are.

max-azendorf 3 hours ago

Comment deleted