GPT-4o vs Claude 4 Sonnet vs Gemini 2.5 Pro
Complete comparison (June 2026): benchmarks, pricing, features, and recommendations.
📅Last updated: June 6, 2026
📋 Overview
| Feature | GPT-4o | Claude 4 Sonnet | Gemini 2.5 Pro |
|---|---|---|---|
| Release Date | May 2024 | Jun 2025 | Mar 2025 |
| Parameters | Undisclosed | Undisclosed | Undisclosed |
| Context Window | 128K tokens | 200K tokens | 1M tokens |
| License | Proprietary | Proprietary | Proprietary |
| Multimodal | ✅ Text, Image, Audio | ✅ Text, Image | ✅ Text, Image, Audio, Video |
| Function Calling | ✅ | ✅ | ✅ |
| Streaming | ✅ | ✅ | ✅ |
📊 Benchmark Comparison
| Benchmark | GPT-4o | Claude 4 Sonnet | Gemini 2.5 Pro |
|---|---|---|---|
| MMLU | 88.7 | 89 | 90 ✓ |
| HumanEval | 90.2 | 92.1 ✓ | 88.4 |
| Chatbot Arena ELO | 1387 ✓ | 1385 | 1380 |
| MATH | 76.6 | 78.3 | 83 ✓ |
| GPQA Diamond | 53.6 | 59.4 | 65 ✓ |
💰 Pricing
| Price Point | GPT-4o | Claude 4 Sonnet | Gemini 2.5 Pro |
|---|---|---|---|
| Input / 1M tokens | $2.50 | $3.00 | $1.25 |
| Output / 1M tokens | $10.00 | $15.00 | $10.00 |
| Batch Input / 1M | $1.25 | $1.50 | $0.625 |
| Batch Output / 1M | $5.00 | $7.50 | $5.00 |
| Free Tier | ✅ ChatGPT Free | ❌ | ✅ Gemini Free |
🎯 Which One Should You Use?
General Chat & Writing
GPT-4oBest balance of quality, speed, and cost. Huge ecosystem of integrations.
Code Generation
Claude 4 SonnetHighest HumanEval score. Excels at complex multi-file code tasks.
Long Documents
Gemini 2.5 Pro1M context window handles massive documents. Strong on GPQA.
Best Value
DeepSeek V3Not on this list, but at $0.27/$1.10 per 1M tokens, it crushes on price.
Multimodal (Audio/Video)
Gemini 2.5 ProOnly model that natively handles video input alongside text and audio.