๐Ÿฆ† Pelican Benchmark Overview

AI Models Drawing "A Pelican Riding a Bicycle"

Based on the benchmark by Simon Willison ยท View all posts
This page was AI-generated using Claude Code

60
Models
15
Providers
Nov 12, 2024 - Nov 24, 2025
Date Range
Claude Opus 4.5
Claude Opus 4.5
Anthropic
๐Ÿ“… Nov 24, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 3 Pro
Gemini 3 Pro
Google
๐Ÿ“… Nov 18, 2025
๐Ÿ“ Read Blog Post โ†’
Kimi K2 Thinking
Kimi K2 Thinking
Moonshot AI
๐Ÿ“… Nov 6, 2025
๐Ÿ“ Read Blog Post โ†’
Composer 1
Composer 1
Cursor
๐Ÿ“… Oct 29, 2025
๐Ÿ“ Read Blog Post โ†’
Claude Haiku 4.5
Claude Haiku 4.5
Anthropic
๐Ÿ“… Oct 15, 2025
๐Ÿ“ Read Blog Post โ†’
GPT-5 Pro
GPT-5 Pro
OpenAI
๐Ÿ“… Oct 6, 2025
๐Ÿ“ Read Blog Post โ†’
DeepSeek-V3.2-Exp
DeepSeek-V3.2-Exp
DeepSeek
๐Ÿ“… Oct 1, 2025
๐Ÿ“ Read Blog Post โ†’
GLM-4.6
GLM-4.6
Z.ai
๐Ÿ“… Oct 1, 2025
๐Ÿ“ Read Blog Post โ†’
Claude Sonnet 4.5
Claude Sonnet 4.5
Anthropic
๐Ÿ“… Sep 29, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Flash (thinking mode)
Gemini 2.5 Flash (thinking mode)
Google
๐Ÿ“… Sep 25, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Flash (non-thinking)
Gemini 2.5 Flash (non-thinking)
Google
๐Ÿ“… Sep 25, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Flash Lite (thinking mode)
Gemini 2.5 Flash Lite (thinking mode)
Google
๐Ÿ“… Sep 25, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Flash Lite (non-thinking)
Gemini 2.5 Flash Lite (non-thinking)
Google
๐Ÿ“… Sep 25, 2025
๐Ÿ“ Read Blog Post โ†’
GPT-5-Codex
GPT-5-Codex
OpenAI
๐Ÿ“… Sep 23, 2025
๐Ÿ“ Read Blog Post โ†’
Grok 4 Fast (non-reasoning)
Grok 4 Fast (non-reasoning)
xAI
๐Ÿ“… Sep 20, 2025
๐Ÿ“ Read Blog Post โ†’
Grok 4 Fast (reasoning)
Grok 4 Fast (reasoning)
xAI
๐Ÿ“… Sep 20, 2025
๐Ÿ“ Read Blog Post โ†’
Qwen3-Next-80B-A3B-Thinking
Qwen3-Next-80B-A3B-Thinking
Qwen
๐Ÿ“… Sep 12, 2025
๐Ÿ“ Read Blog Post โ†’
Qwen3-Next-80B-A3B-Instruct
Qwen3-Next-80B-A3B-Instruct
Qwen
๐Ÿ“… Sep 12, 2025
๐Ÿ“ Read Blog Post โ†’
Kimi-K2-Instruct-0905
Kimi-K2-Instruct-0905
Moonshot AI
๐Ÿ“… Sep 6, 2025
๐Ÿ“ Read Blog Post โ†’
DeepSeek-V3.1
DeepSeek-V3.1
DeepSeek
๐Ÿ“… Aug 22, 2025
๐Ÿ“ Read Blog Post โ†’
Qwen3-4B-Thinking-2507
Qwen3-4B-Thinking-2507
Qwen
๐Ÿ“… Aug 10, 2025
๐Ÿ“ Read Blog Post โ†’
Qwen3-4B-Instruct-2507
Qwen3-4B-Instruct-2507
Qwen
๐Ÿ“… Aug 10, 2025
๐Ÿ“ Read Blog Post โ†’
GPT-5
GPT-5
OpenAI
๐Ÿ“… Aug 7, 2025
๐Ÿ“ Read Blog Post โ†’
GPT-5 Mini
GPT-5 Mini
OpenAI
๐Ÿ“… Aug 7, 2025
๐Ÿ“ Read Blog Post โ†’
GPT-5 Nano
GPT-5 Nano
OpenAI
๐Ÿ“… Aug 7, 2025
๐Ÿ“ Read Blog Post โ†’
gpt-oss-20b (reasoning=high)
gpt-oss-20b (reasoning=high)
OpenAI
๐Ÿ“… Aug 5, 2025
๐Ÿ“ Read Blog Post โ†’
Claude Opus 4.1
Claude Opus 4.1
Anthropic
๐Ÿ“… Aug 5, 2025
๐Ÿ“ Read Blog Post โ†’
Claude Opus 4
Claude Opus 4
Anthropic
๐Ÿ“… Aug 5, 2025
๐Ÿ“ Read Blog Post โ†’
XBai o4
XBai o4
MetaStone AI
๐Ÿ“… Aug 3, 2025
๐Ÿ“ Read Blog Post โ†’
Qwen3-Coder-30B-A3B-Instruct
Qwen3-Coder-30B-A3B-Instruct
Alibaba
๐Ÿ“… Jul 31, 2025
๐Ÿ“ Read Blog Post โ†’
Qwen3-30B-A3B-Thinking-2507
Qwen3-30B-A3B-Thinking-2507
Qwen
๐Ÿ“… Jul 30, 2025
๐Ÿ“ Read Blog Post โ†’
GLM-4.5 Air
GLM-4.5 Air
Z.ai
๐Ÿ“… Jul 29, 2025
๐Ÿ“ Read Blog Post โ†’
Qwen3-235B-A22B-Thinking-2507
Qwen3-235B-A22B-Thinking-2507
Qwen
๐Ÿ“… Jul 25, 2025
๐Ÿ“ Read Blog Post โ†’
Kimi K2
Kimi K2
Moonshot AI
๐Ÿ“… Jul 11, 2025
๐Ÿ“ Read Blog Post โ†’
Mistral-Small-3.2-24B-Instruct-2506
Mistral-Small-3.2-24B-Instruct-2506
Mistral
๐Ÿ“… Jun 20, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Pro
Gemini 2.5 Pro
Google
๐Ÿ“… Jun 17, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Flash
Gemini 2.5 Flash
Google
๐Ÿ“… Jun 17, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Flash Lite Preview
Gemini 2.5 Flash Lite Preview
Google
๐Ÿ“… Jun 17, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Flash Preview (with thinking)
Gemini 2.5 Flash Preview (with thinking)
Google
๐Ÿ“… May 20, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Flash Preview (without thinking)
Gemini 2.5 Flash Preview (without thinking)
Google
๐Ÿ“… May 20, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Pro Preview
Gemini 2.5 Pro Preview
Google
๐Ÿ“… May 6, 2025
๐Ÿ“ Read Blog Post โ†’
Qwen 3 32B
Qwen 3 32B
Alibaba
๐Ÿ“… Apr 29, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Flash (preview-04-17, low thinking)
Gemini 2.5 Flash (preview-04-17, low thinking)
Google
๐Ÿ“… Apr 17, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Flash (preview-04-17, medium thinking)
Gemini 2.5 Flash (preview-04-17, medium thinking)
Google
๐Ÿ“… Apr 17, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Flash (preview-04-17, high thinking)
Gemini 2.5 Flash (preview-04-17, high thinking)
Google
๐Ÿ“… Apr 17, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.5 Pro
Gemini 2.5 Pro
Google
๐Ÿ“… Mar 25, 2025
๐Ÿ“ Read Blog Post โ†’
DeepSeek-V3-0324
DeepSeek-V3-0324
DeepSeek
๐Ÿ“… Mar 24, 2025
๐Ÿ“ Read Blog Post โ†’
o1-pro (high reasoning)
o1-pro (high reasoning)
OpenAI
๐Ÿ“… Mar 19, 2025
๐Ÿ“ Read Blog Post โ†’
o1-pro (standard reasoning)
o1-pro (standard reasoning)
OpenAI
๐Ÿ“… Mar 19, 2025
๐Ÿ“ Read Blog Post โ†’
OLMo 2 32B
OLMo 2 32B
Allen AI
๐Ÿ“… Mar 16, 2025
๐Ÿ“ Read Blog Post โ†’
Gemma 3 27B
Gemma 3 27B
Google
๐Ÿ“… Mar 12, 2025
๐Ÿ“ Read Blog Post โ†’
GPT-4.5-preview
GPT-4.5-preview
OpenAI
๐Ÿ“… Feb 27, 2025
๐Ÿ“ Read Blog Post โ†’
Claude 3.7 Sonnet
Claude 3.7 Sonnet
Anthropic
๐Ÿ“… Feb 24, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini 2.0 Flash
Gemini 2.0 Flash
Google
๐Ÿ“… Feb 5, 2025
๐Ÿ“ Read Blog Post โ†’
DeepSeek-R1-Distill-Llama-8B
DeepSeek-R1-Distill-Llama-8B
unsloth
๐Ÿ“… Jan 20, 2025
๐Ÿ“ Read Blog Post โ†’
Phi-4
Phi-4
Microsoft
๐Ÿ“… Jan 8, 2025
๐Ÿ“ Read Blog Post โ†’
Gemini-exp-1206
Gemini-exp-1206
Google
๐Ÿ“… Dec 6, 2024
๐Ÿ“ Read Blog Post โ†’
Gemini-exp-1206 (animated)
Gemini-exp-1206 (animated)
Google
๐Ÿ“… Dec 6, 2024
๐Ÿ“ Read Blog Post โ†’
QwQ-32B Preview
QwQ-32B Preview
Alibaba
๐Ÿ“… Nov 27, 2024
๐Ÿ“ Read Blog Post โ†’
Qwen2.5-Coder-32B-Instruct
Qwen2.5-Coder-32B-Instruct
Alibaba
๐Ÿ“… Nov 12, 2024
๐Ÿ“ Read Blog Post โ†’