Lodestar
LiveTracking 308 LLM models · 7 GPU types·updated 2026-06-30 13:26 UTC·auto-refreshed every 4 hours

AI infrastructure price compass

The price-performance compass for AI infrastructure — live LLM API costs, the daily Lodestar Index, and the cheapest GPU rentals. Updated automatically.

$0.084Lodestar Index / 1M
$0.015Cheapest blended
308Models tracked
7GPU types
Lodestar Index — USD / 1M tokens · production-grade (q≥80)
$0.084
cheapest model at quality ≥ 80: qwen/qwen3-30b-a3b-instruct-2507
history builds daily — the trend line appears after a few runs.

Model: qwen/qwen3-30b-a3b-instruct-2507

Cheapest blended LLM today: inclusionai/ling-2.6-flash at $0.015 / 1M tokens (blend 75% in / 25% out · 308 models)

Price movers (since last update)

No price changes since the last build. Movers light up the moment a provider repricing is detected.

LLM API price ranking (blended $/1M tokens)

Showing top 30 of 308 models by blended price.

#ModelProviderIn $/1MOut $/1MBlended $/1MContext
1inclusionai/ling-2.6-flashInclusionai0.010.030.015262,144
2meta-llama/llama-3.1-8b-instructMeta-Llama0.020.030.022131,072
3mistralai/mistral-nemoMistralai0.020.030.022131,072
4ibm-granite/granite-4.0-h-microIbm-Granite0.020.110.041131,000
5sao10k/l3-lunaris-8bSao10K0.040.050.0438,192
6liquid/lfm-2-24b-a2bLiquid0.030.120.052128,000
7qwen/qwen-2.5-7b-instructQwen0.040.100.055131,072
8openai/gpt-oss-20bOpenai0.030.140.057131,072
9mistralai/mistral-small-24b-instruct-2501Mistralai0.050.080.05832,768
10openai/gpt-oss-120bOpenai0.030.150.060131,072
11gryphe/mythomax-l2-13bGryphe0.060.060.0604,096
12amazon/nova-micro-v1Amazon0.040.140.061128,000
13ibm-granite/granite-4.1-8bIbm-Granite0.050.100.062131,072
14google/gemma-3-4b-itGoogle0.050.100.062131,072
15cohere/command-r7b-12-2024Cohere0.040.150.066128,000
16meta-llama/llama-3.2-1b-instructMeta-Llama0.030.200.070131,072
17arcee-ai/trinity-miniArcee-Ai0.040.150.071131,072
18google/gemma-3n-e4b-itGoogle0.060.120.07532,768
19google/gemma-3-12b-itGoogle0.050.150.075131,072
20qwen/qwen3-30b-a3b-instruct-2507Qwen0.050.190.084131,072
21nvidia/nemotron-3-nano-30b-a3bNvidia0.050.200.087262,144
22microsoft/phi-4Microsoft0.070.140.08716,384
23qwen/qwen3-235b-a22b-2507Qwen0.090.100.092262,144
24tencent/hy3-previewTencent0.060.210.100262,144
25rekaai/reka-edgeRekaai0.100.100.10016,384
26mistralai/ministral-3b-2512Mistralai0.100.100.100131,072
27qwen/qwen3-235b-a22b-thinking-2507Qwen0.100.100.100262,144
28google/gemma-3-27b-itGoogle0.080.160.100131,072
29amazon/nova-lite-v1Amazon0.060.240.105300,000
30mistralai/mistral-small-3.2-24b-instructMistralai0.070.200.106128,000

Price-performance frontier

Best value at each quality tier:
meta-llama/llama-3.1-8b-instruct ($0.022, q74) · qwen/qwen-2.5-7b-instruct ($0.055, q78) · qwen/qwen3-30b-a3b-instruct-2507 ($0.084, q84) · deepseek/deepseek-v3.2 ($0.257, q86) · openai/gpt-5.4-nano ($0.463, q95)

What a real job costs

Summarize 1,000 documents
cheapest: inclusionai/ling-2.6-flash $0.05 · priciest: openai/o1-pro $840.00 (16800.0x spread)
1M-token RAG pipeline
cheapest: inclusionai/ling-2.6-flash $0.01 · priciest: openai/o1-pro $240.00 (24000.0x spread)
10,000 chat turns
cheapest: inclusionai/ling-2.6-flash $0.22 · priciest: openai/o1-pro $3,900.00 (17727.3x spread)
Runpod GPUsCost calc