How does XALEN pricing work?

XALEN uses pay-per-token pricing. Start with a $10 minimum deposit (get $10 bonus on your first $100 deposit). No monthly commitment required. For higher volumes, Growth ($499/mo), Scale ($2,499/mo), and Enterprise plans offer higher rate limits, priority support, and dedicated infrastructure.

Is there a free tier or trial?

XALEN does not offer a free tier. Every API call costs money. However, first-time users get $10 bonus credits on their first $100 deposit, and a free sandbox environment is available at /sandbox for testing with mock responses.

What models are included in the base pricing?

All 200+ models are available on every plan. This includes domain-specialist Vedika models for astrology and faith-tech, plus frontier open-source models like Llama 4, DeepSeek R1, Qwen 3, and more. Pricing varies by model — smaller models start at $0.04/1M tokens.

Can I switch plans or cancel anytime?

Upgrades take effect immediately. Downgrades are deferred to the next billing cycle. Pay-as-you-go has no monthly commitment — you only pay for what you use from your deposited balance.

Do you offer dedicated GPU instances?

Yes. The Dedicated plan (from $5,000/mo) provides reserved H100, A100, or L40S compute with guaranteed capacity, custom model fine-tuning, and 99.99% SLA.

Pricing — XALEN AI Infrastructure

Pay As You Go

Current users

$10

minimum deposit · pay per token

Add funds to your wallet and use any model. No commitment, no monthly fees. Just top up and build.

✓ All 200+ models

✓ 120 req/min rate limit

✓ Pay per token used

✓ Credits valid 1 year

✓ Real-time usage dashboard

✓ Community support

Add Funds

Growth

Scaling teams

$499 / mo

monthly commitment

For teams moving past prototyping. Higher throughput, priority support, and usage analytics.

✓ 300 req/min rate limit

✓ 500K tokens/min

✓ All 200+ models

✓ 5 API keys

✓ Priority support

✓ Usage analytics dashboard

Get Started

Scale

Production workloads

$2,499 / mo

monthly commitment

For production systems demanding guaranteed performance, priority inference, and dedicated account management.

✓ 1,000 req/min rate limit

✓ 2M tokens/min

✓ Priority inference queue

✓ 10 API keys

✓ Dedicated account manager

✓ 99.9% SLA

Get Started

Dedicated

GPU instances

$5,000+ / mo

dedicated GPU · billed per GPU-hour

Isolated GPU instances (H100, A100, L40S) with dedicated capacity. No shared compute, no noisy neighbors.

✓ Dedicated GPU instances

✓ H100 ~$3.50/hr · A100 ~$2.10/hr

✓ Custom fine-tuning included

✓ 99.99% SLA

✓ Multi-tenant isolation

✓ Private endpoints

Talk to Sales

Enterprise

Custom everything

Custom

custom pricing + dedicated cluster

For organizations processing millions of tokens monthly. Full infrastructure control, compliance, and named engineering support.

✓ Custom rate limits

✓ Dedicated cluster

✓ On-premises deployment option

✓ SSO / SAML

✓ SOC 2 · HIPAA (health-faith)

✓ Named engineer

✓ Custom SLA

Contact Sales

All tiers include

✓ OpenAI-compatible API

✓ Streaming

✓ Function calling

✓ 31 languages

✓ Usage monitoring

✓ JSON mode

✓ Batch processing (50% off)

Compute

GPU infrastructure on demand

Provision dedicated GPU instances for inference, fine-tuning, and training. Available on Dedicated and Enterprise tiers.

GPU	VRAM	Price / hr	Best For
NVIDIA H100 SXM	80 GB	$3.49/hr	Large models, fine-tuning, training
NVIDIA A100 SXM	80 GB	$2.09/hr	Training, high-throughput inference
NVIDIA L40S	48 GB	$1.19/hr	Inference, cost-efficient production
NVIDIA A10G	24 GB	$0.75/hr	Lightweight inference, experimentation

GPU instances billed per hour. Minimum 1-hour commitment. Multi-GPU clusters available on Enterprise. Pricing may vary by availability and region.

Tenancy

Choose your isolation level

From shared multi-tenant pools to fully isolated on-premises deployments. Pick the tenancy model that matches your compliance and performance needs.

Default

Shared

Multi-tenant infrastructure with shared GPU pool. Automatic load balancing and scaling. Best for most workloads.

Available on all tiers

Scale+

Reserved

Guaranteed capacity with your own GPU allocation. No cold starts, predictable performance. No shared queue contention.

Scale tier and above

Dedicated

Isolated cluster with private network. Your own hardware, your own endpoints. Full network isolation and data sovereignty.

Dedicated tier and above

Enterprise

On-Premises

Deploy XALEN on your own infrastructure. Air-gapped deployments, custom compliance, full operational control.

Enterprise tier only

Per-Token Pricing

Transparent pricing per model

Every model has clear input and output pricing. Use our Vedika models for faith-domain tasks, or route to any open-source model at competitive rates.

Cost Calculator

Model

Input Tokens

Output Tokens

Batch processing (50% off)

Input Cost

$0.60

Output Cost

$0.90

Total

$1.50

Model	Input (per 1M tokens)	Output (per 1M tokens)	Context
Vedika Standard XALEN	$0.60	$1.80	128K
Vedika Fast XALEN	$0.10	$0.30	128K
Vedika Voice XALEN	$0.02/sec	—	31 langs
Llama 3.1 405B Meta	$0.88	$2.64	128K
Llama 3.1 70B Meta	$0.54	$1.62	128K
Mixtral 8x22B Mistral AI	$0.60	$1.80	65K
Qwen 2.5 72B Alibaba	$0.54	$1.62	128K
DeepSeek V3 DeepSeek	$0.27	$0.81	128K
Gemma 2 27B Google	$0.20	$0.60	8K
Command R+ Cohere	$2.50	$7.50	128K
+190 more models	Full pricing in docs

Batch processing pricing is 50% of the rates shown above. All prices in USD. Custom pricing available for Enterprise plans.

FAQ

Frequently asked questions

How does Pay As You Go work?

Add a minimum of $10 to your wallet via Razorpay (UPI, cards, net banking). Use any of the 200+ models and pay per token consumed. Credits are valid for 1 year from purchase. No monthly fees, no commitments. When your balance hits zero, API requests return 402 until you top up.

What do I get with Growth vs. Scale?

Growth ($499/mo) gives you 300 req/min, 500K tokens/min, 5 API keys, priority support, and usage analytics. Scale ($2,499/mo) upgrades to 1,000 req/min, 2M tokens/min, 10 API keys, priority inference queue, dedicated account manager, and 99.9% SLA. Both tiers include all 200+ models at standard token pricing.

How does Dedicated GPU pricing work?

Dedicated tier starts at $5,000/mo. You get isolated GPU instances billed per GPU-hour: H100 at ~$3.49/hr, A100 at ~$2.09/hr, L40S at ~$1.19/hr. Includes custom fine-tuning, 99.99% SLA, multi-tenant isolation, and private endpoints. Contact sales for exact pricing based on your configuration.

Can I upgrade or downgrade my tier?

Upgrades are immediate — your new rate limits apply instantly. Downgrades take effect at the end of your current billing cycle. You can always fall back to Pay As You Go with no penalty. Contact billing@xalen.io for tier changes.

Do you offer discounts for temples and nonprofits?

Yes. Verified religious organizations and registered nonprofits receive 30% off all token pricing on any tier. Contact enterprise@xalen.io with your organization verification documents and we will apply the discount within 48 hours.

What compliance certifications do you support?

Enterprise tier includes SOC 2 compliance and HIPAA support for health-faith applications. We also offer SSO/SAML integration, data residency options, and custom compliance documentation. Contact our enterprise team for specific certification requirements.

Enterprise

Talk to our team

Processing millions of tokens monthly? Need dedicated infrastructure, custom SLAs, or on-premise deployment? Let us build a plan for your organization.

Full name

Company name

Work email

Phone number

Expected monthly volume

Primary use case

Message

We typically respond within 1 business day.

Ready to build?
Start at $10 or talk to sales.

From pay-as-you-go prototyping to dedicated GPU clusters. Pick the tier that fits your stage.

Start Building Talk to Sales

From prototype to production scale.

GPU infrastructure on demand

Choose your isolation level

Transparent pricing per model

Frequently asked questions

Talk to our team

Ready to build?Start at $10 or talk to sales.

From prototype
to production scale.

Ready to build?
Start at $10 or talk to sales.