Cost and Context Window Comparison
Comparison of context window and cost per 1M tokens.
Models |
Context Window |
Input Cost / 1M tokens |
Output Cost / 1M tokens |
GPT-4 |
8,000 |
$30.00 |
$60.00 |
GPT-4-32k |
32,000 |
$60.00 |
$120.00 |
GPT-4 Turbo |
128,000 |
$10.00 |
$30.00 |
GPT-3.5 Turbo |
16,000 |
$0.5 |
$1.5 |
GPT-3.5 Turbo Instruct |
4,000 |
$1.5 |
$2.00 |
Gemini Pro |
32,000 |
$0.125 |
$0.375 |
Gemini 1.5 Pro |
128,000 |
$7 |
$21 |
Mistral Small |
16,000 |
$2.00 |
$6.00 |
Mistral Medium |
32,000 |
$2.7 |
$8.1 |
Mistral Large |
32,000 |
$8.00 |
$24.00 |
Claude 3 Opus |
200,000 |
$15.00 |
$75.00 |
Claude 3 Sonnet |
200,000 |
$3.00 |
$15.00 |
Claude 3 Haiku |
200,000 |
$0.25 |
$1.25 |
GPT4o |
128,000 |
$5 |
$15 |
Gemini 1.5 Flash |
1,000,000 |
$0.35 |
$0.70 |
Claude 3.5 Sonnet |
200,000 |
$3 |
$15 |
GPT-4o mini |
128,000 |
$0.15 |
$0.60 |
Claude 3.5 Haiku |
200,000 |
$0.80 |
$4 |
AWS Nova Pro |
300,000 |
$0.0008 |
$0.0032 |
AWS Nova Lite |
300,000 |
$0.00006 |
$0.00024 |
AWS Nova Micro |
300,000 |
$0.000035 |
$0.00014 |
Gemini 2.0 Flash (Exp) |
1,000,000 |
- |
- |