Now live — DeepSeek R1-0528 reasoning · UltraFast LPU inference available in all plans Explore → ×

Token Plan · CLI tokens, simple pricing

Tokens for Claude Code, Gemini CLI, and OpenAI CLI.

A flat monthly fee, a 5-hour token window, and the latest frontier models in your terminal. One API key, one shared wallet — no idle GPU cost, no surprise overage, no API rate gymnastics.

C Claude Code Opus 4.8 · Sonnet 4.6 · Haiku 4.5
G Gemini CLI 2.5 Pro · Flash · Flash-Lite
O OpenAI CLI GPT-5.5 · 5 mini · 5 nano
Monthly

Annual
2 months free

/ Plans

Pick the plan that matches your CLI workload.

Test

Try every CLI tool, no card

$3
/ one-time

1M · no expiry

1M tokens, valid across Claude Code, Gemini CLI, and OpenAI CLI. Never expires.

API key1 shared
Token limit1M one-time
CLI toolsAll 3
Models
ClaudeOpus 4.8
Sonnet 4.6
Haiku 4.5
Gemini2.5 Pro
Flash
Flash-Lite
OpenAIGPT-5.5
5 mini
5 nano
Priority queue
Concurrent sessions1
Analytics
SupportCommunity

Pro

For solo builders

$25
/ month
$250
/ year

2 months free · save $50

3M · 5-hour window

3M tokens every 5-hour window, with all the latest models across every supported CLI.

API key1 shared
Token limit3M / 5h
CLI toolsAll 3
Models
ClaudeOpus 4.8
Sonnet 4.6
Haiku 4.5
Gemini2.5 Pro
Flash
Flash-Lite
OpenAIGPT-5.5
5 mini
5 nano
Priority queue
Concurrent sessions3
AnalyticsBasic
SupportEmail · 24h

Ultra

For heavy production teams

$80
/ month
$800
/ year

2 months free · save $160

20M · 5-hour window

20M tokens every 5-hour window, for teams running CLIs around the clock.

API key1 shared
Token limit20M / 5h
CLI toolsAll 3
Models
ClaudeOpus 4.8
Sonnet 4.6
Haiku 4.5
Gemini2.5 Pro
Flash
Flash-Lite
OpenAIGPT-5.5
5 mini
5 nano
Priority queueIncluded
Concurrent sessions10
AnalyticsAdvanced + Slack
SupportEmail · 4h

/ Compare

All features, side by side.

Pick the plan that matches your CLI workload. Upgrade or downgrade any time.

Feature Test Pro Max Ultra
Tokens per 5-hour windowTest is one-time, others reset on a rolling 5h window
1M (one-time) 3M 20M
Monthly billing
$3 one-time $25 / mo $80 / mo
Annual billingPay for 10 months, get 12 — 2 months free
$250 / yr
Save $50
$800 / yr
Save $160
CLI tools supportedClaude Code, Gemini CLI, OpenAI CLI
All 3 All 3 All 3
API keys includedOne key works across all 3 CLIs — no per-tool key needed
1 1 1
Top 3 frontier models per providerClaude: Opus 4.8 · Sonnet 4.6 · Haiku 4.5  |  Gemini: 2.5 Pro · Flash · Flash-Lite  |  OpenAI: GPT-5.5 · 5 mini · 5 nano
Priority queue
Concurrent CLI sessions
1 3 10
Usage analytics & alerts
Basic Advanced + Slack
Support
Community Email · 24h Email · 4h

/ FAQ

Common questions.

Everything you need to know about Token Plan pricing and CLI access.

Which CLI tools are supported?
Every plan includes access to Claude Code, Gemini CLI, and OpenAI CLI. All three tools share a single token wallet and a single API key, so you can jump between them mid-session without buying a separate plan or rotating keys.

Which models are included in every plan?
All three CLIs expose the top 3 frontier models from their provider. Claude Code ships with Opus 4.8 (flagship reasoning), Sonnet 4.6 (mid-tier), and Haiku 4.5 (fast/small). Gemini CLI ships with 2.5 Pro (flagship), Flash (fast), and Flash-Lite (cheapest). OpenAI CLI ships with GPT-5.5 (flagship), GPT-5 mini, and GPT-5 nano. Test, Pro, Max, and Ultra all include the same set — the only difference is the 5-hour token allowance.

How many API keys do I get?
One. Every plan — Test, Pro, Max, and Ultra — ships with a single API key that works across Claude Code, Gemini CLI, and OpenAI CLI. There is no per-tool key and no extra key on Ultra. Use the same key in your ANTHROPIC_API_KEY, GEMINI_API_KEY, and OPENAI_API_KEY env vars and Token Factory routes each request to the right model automatically.

How does the 5-hour token window work?
Your token allowance refills on a rolling 5-hour window. Use up to 3M (Pro), 5M (Max), or 20M (Ultra) tokens in any 5-hour period, and the count replenishes continuously — no manual reset, no monthly bill shock.

What happens if I exceed my 5-hour window?
Requests are throttled to a fair-use floor until the window rolls over. You’ll see live usage in your dashboard and can set alerts at 50%, 80%, and 100% of your window. There is no overage charge — you just wait for the next window or upgrade.

Can I switch between CLI tools mid-session?
Yes. All three CLIs share the same wallet, so you can move from Claude Code to Gemini CLI to OpenAI CLI in the same project. Tokens consumed count against your 5-hour window regardless of which CLI used them.

Can I change plans at any time?
Yes. Upgrades take effect immediately and we prorate the difference. Downgrades take effect at the start of your next billing cycle, so you keep your current window for the rest of the month.

What payment methods do you accept?
All major credit cards via Stripe, with line-item invoices for every charge. Annual billing gives you 2 months free — pay for 10 months upfront and use the full 12. Contact us for ACH or wire transfer on annual plans.

How does the annual 2-months-free discount work?
Switch the billing toggle to Annual on any paid plan and you’ll be charged once for 10 months of access, then receive the full 12 months of service. Pro is $250/yr (save $50), Max is $350/yr (save $70), and Ultra is $800/yr (save $160). The Test plan stays a one-time $3 with no annual option.

Is there a free tier?
The Test plan ($3, one-time) gives you 1M tokens to try every supported CLI. It’s the cheapest way to validate Token Factory for your terminal workflow before committing to a monthly plan.