Modelwire
Subscribe

Qwen3.6-35B-A3B on my laptop drew me a better pelican than Claude Opus 4.7

Illustration accompanying: Qwen3.6-35B-A3B on my laptop drew me a better pelican than Claude Opus 4.7

Simon Willison compared Qwen3.6-35B-A3B and Claude Opus 4.7 using his informal "pelican riding a bicycle" benchmark, finding Alibaba's model produced superior image generation on a MacBook Pro M5 despite being smaller and quantized.

Modelwire context

Skeptical read

The 'pelican riding a bicycle' test is Willison's personal aesthetic preference, not a standardized benchmark, and the comparison pits a quantized local model against a cloud API model at very different cost and latency profiles. The result may say more about Claude Opus 4.7's SVG or image-rendering quirks than about general capability.

Anthropic is clearly under competitive pressure from multiple directions right now. TechCrunch reported on April 17 that Anthropic just launched Claude Design specifically to help non-designers generate visuals quickly, which suggests the company is aware its image-adjacent capabilities need a dedicated product push. Meanwhile, OpenAI's upgraded Codex (covered here from The Verge and TechCrunch on April 16) is attacking Anthropic on the coding side simultaneously. A hobbyist benchmark showing a smaller Alibaba model outperforming Claude on a creative visual task fits that broader pattern of Anthropic being squeezed, but one informal test from one developer is thin evidence on its own.

If Willison or other independent evaluators run the same Qwen3 model against Claude Design outputs on a wider set of visual tasks and the gap holds, that would be meaningful. If the advantage disappears outside this single prompt, it's a quantization artifact or a prompt-sensitivity issue, not a capability story.

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsQwen3.6-35B-A3B · Claude Opus 4.7 · Alibaba · Anthropic · Simon Willison · Unsloth

Modelwire summarizes — we don’t republish. The full article lives on simonwillison.net. If you’re a publisher and want a different summarization policy for your work, see our takedown page.

Qwen3.6-35B-A3B on my laptop drew me a better pelican than Claude Opus 4.7 · Modelwire