Platform
Private AI infrastructure — from compute to agent.
Target Groups
For Enterprise, SMBs and private developers.
Knowledge & Support
Everything you need to work successfully with Mycelis.
Mycelis routes every request to the cheapest model capable of solving it — local GPUs, DeepSeek, Claude, GPT and Gemini behind one OpenAI-compatible endpoint.
~60%
lower API cost
< 60s
to first request
0
code changes needed
Drop-in replacement
Point your existing OpenAI SDK at Mycelis. Routing, cost optimization, and model selection happen automatically.
Unified Agent Runtime Architecture
Cost-aware routing
Every request is evaluated against your routing rules. Only when complexity, context length, or tool requirements demand it does Mycelis escalate to a frontier model.
One endpoint, transparent logic
Your app always calls the same URL. Routing decisions are logged and visible — no black box.
Rules you control
Define routing by token count, keywords, tool calls, or regex. Or use the auto-router and let Mycelis decide.
Quality is preserved
Complex tasks still go to the best model. You never trade quality for cost — only unnecessary spend is cut.
Typical cost reduction for a mixed coding workload (80% routine tasks, 20% complex).
Comparison
LiteLLM is a great library. Mycelis is a managed platform. Here's what you get beyond what you'd build yourself.
| Feature | Mycelis | LiteLLM | DIY / CCR |
|---|---|---|---|
| Dedicated GPU provisioning | — | — | |
| Managed frontier API keys | — | — | |
| Local + cloud hybrid routing | partial | partial | |
| Hosted OpenWebUI (chat UI) | — | — | |
| One unified OpenAI-compatible endpoint | |||
| Cost-aware auto routing | limited | limited | |
| MCP tool integration | — | — | |
| Knowledge bases / RAG | — | — | |
| Zero infrastructure to manage | — | — |
Getting started
Start with a single coding agent preset. Advanced routing, MCP tools, and RAG can be added later — when you actually need them.
Point any OpenAI-compatible tool at mycelis.ai/api/proxy/v1. Works with Cursor, Claude Code, OpenCode, or any OpenAI SDK.
The default coding agent preset already routes simple tasks to cheap models and escalates automatically. Costs drop immediately, no configuration needed.
Once you see your routing traces and cost breakdown, tune rules, add MCP tools, attach knowledge bases, or fine-tune a local model.
When you need more
These features are ready when your use case demands them. Start simple, add layers as you grow.
Run DeepSeek, Gemma, Qwen or custom models on your own GPU — fully private, low latency, billed per minute.
Attach tools to your agents — web search, file access, APIs — without changing your client code.
Upload docs, codebases, or PDFs. Mycelis retrieves relevant context before each inference call automatically.
Train domain-specific adapters on your data. Deploy a fine-tuned model that routes to the right base automatically.
One-click chat interface for your workspace. Share with teammates, auto-billed per minute of usage.
Skip identical or near-identical requests. Cache semantically equivalent prompts to cut costs further.
Privacy-first routing
Route private workloads to self-hosted models. Use Claude or GPT only when necessary — and only for requests that don't contain sensitive IP.
Classify by sensitivity
Route requests with proprietary code patterns to local GPU instances. External providers never see that data.
EU-hosted infrastructure
All Mycelis infrastructure runs in the EU. Your data stays in jurisdiction without any enterprise contract.
BYOK or managed keys
Use your own Claude/OpenAI API keys or let Mycelis manage them. No lock-in, your choice.
PRIVACY ROUTING EXAMPLE
Rules defined by you. Evaluated per request, before any external call is made.
Transparent Pricing
No base fee. No minimum term. Transparent billing for all services.
Open Source Models
Scalable & Dynamic
Deploy models like DeepSeek or Gemma 4. Scale up or down instantly. Billed per hour of runtime.
Commercial Models
0% Markup
Managed or BYOK. Connect Claude 4.7, GPT-4o or Gemini with 0% markup on provider prices.
Knowledge Bases
$0.05 / h + $0.01 / GB / h
Secure RAG for your data. Fixed base price plus storage-based billing.
OpenWebUI
$0.05 / h
The world's best interface for your team. Fully managed and connected to all your models.
100k coding tokens / day · mixed complexity
Same workflow. Same endpoint. Lower cost.
~60%
cost reduction
Calculated from the workload example on the left: $420 → $169 per month.
Common Questions
Start for free with Managed Keys — or deploy your first open-source model in under 60 seconds. No base fee.