#
Subscription Plans & Features
#
Evaluation
- Access to all available models at no cost
- Very limited throughput and no SLA
- 1M tokens for 30 Days
- Access all LLMs at FP16 quality
- Lowest latency, fast responses every time
- Pay-as-you-go inference — tokens only used during runs
#
Exploratory
- Billed in CHF per 1 million input and output tokens, by model, at prevailing market rates
- Access all LLMs at FP16 quality
- Lowest latency, fast responses every time
- Pay-as-you-go inference — tokens only used during runs
- 100% Swiss sovereign — no data sharing or leakage
- Auto-renewed monthly
#
Standard
- Billed in CHF per 1 million input and output tokens, by model, at prevailing market rates
- Access all LLMs at FP16 quality
- Lowest latency, fast responses every time
- Pay-as-you-go inference — tokens only used during runs
- 100% Swiss sovereign — no data sharing or leakage
- Auto-renewed monthly
#
Scale-Up
- Billed in CHF per 1 million input and output tokens, by model, at prevailing market rates
- Access all LLMs at FP16 quality
- Lowest latency, fast responses every time
- Pay-as-you-go inference — tokens only used during runs
- 100% Swiss sovereign — no data sharing or leakage
- Auto-renewed monthly
🚀 Coming Soon: MaaS Private
This feature is in progress and will be documented after the next release.
- Contact us with the model you'd like to deploy (e.g., from Hugging Face)
- Depending on GPU requirements, you may need to purchase multiple MaaS Private plan subscriptions