AI is scaling. Watch it happen.
| Model | Released | p50 Horizon | % Tasks Solved |
|---|---|---|---|
| Claude Opus 4.6 | Feb 2026 | 14.5 hours | 79% |
| GPT-5.2 | Dec 2025 | 6.6 hours | 75% |
| GPT-5.3 Codex | Feb 2026 | 6.5 hours | 75% |
| Claude Opus 4.5 | Nov 2025 | 5.3 hours | 73% |
| Gemini 3 Pro | Nov 2025 | 3.9 hours | 71% |
| GPT-5 | Aug 2025 | 3.6 hours | 69% |
| o3 | Apr 2025 | 2 hours | 64% |
| o1 | Dec 2024 | 38 min | 51% |
| Claude 3.5 Sonnet | Oct 2024 | 20 min | 45% |
| GPT-4(Mar 2023) | Mar 2023 | 3.5 min | 29% |