Xai Plans Pricing
Pricing for the xAI API is usage-based per token (input/output, with cached-input discounts) per model, plus per-call fees for built-in tools and discounts for the Batch API. Exact per-model rates have not been reconciled in this artifact; see the xAI Console Models page for current rates.
Plans
Standard API access billed per token consumed, per model, with prepaid or invoice-based credits. Billing covers input tokens, output tokens, and (where applicable) cached-input tokens, plus per-call tool-invocation fees.
- Chat Completions
- Responses
- Embeddings
- Images
- Video
- Voice
- Live Search
Discounted asynchronous processing for non-real-time workloads. Batch requests are 20-50% cheaper than the equivalent synchronous calls and do not count toward rate limits.
- Batch Chat Completions
- Batch Embeddings
Volume commitments, custom capacity, dedicated support, and negotiated pricing for enterprise customers. Contact xAI sales.
- Custom Volume Pricing
- Dedicated Capacity