COMPARE LLM MODELS

Claude 3.5 Sonnet

vs

Llama 3.1 405b

Compare these models on reasoning, tool use, math, and coding tasks. Check their pricing, speed, and overall performance side by side.
Basic Comparison
Model
Context size
Cutoff date
Input/Output cost
Max output tokens
Latency (TTFT)
Throughput
Claude 3.5 Sonnet
200,000
Apr 2024
$3
/
$15
4096
1.22s
78 t/s
Llama 3.1 405b
128,000
Dec 2023
$2.7
/
$2.7
4096
0.59s
27 t/s
Standard Benchmarks
Dynamic Chart