Show HN: A benchmark where LLMs make memes from current news
Posted by max-azendorf |3 hours ago |1 comments
vintagedave 2 hours ago
I was a bit skeptical but this is actually really neat. Sometimes they nail it. The situations where both are bad are common, and seeing which LLM produced them lines up quite well with how good I think the same models are.