Plattform
Private KI-Infrastruktur — von Compute bis Agent.
Zielgruppen
Für Enterprise, KMU und private Entwickler.
Wissen & Support
Alles was du brauchst, um erfolgreich mit Mycelis zu arbeiten.
Mycelis routes every request to the cheapest model capable of solving it — local GPUs, DeepSeek, Claude, GPT and Gemini behind one OpenAI-compatible endpoint.
~60%
lower API cost
< 60s
to first request
0
code changes needed
Drop-in replacement
Point your existing OpenAI SDK at Mycelis. Routing, cost optimization, and model selection happen automatically.
Einheitliche Agent-Runtime-Architektur
Cost-aware routing
Every request is evaluated against your routing rules. Only when complexity, context length, or tool requirements demand it does Mycelis escalate to a frontier model.
One endpoint, transparent logic
Your app always calls the same URL. Routing decisions are logged and visible — no black box.
Rules you control
Define routing by token count, keywords, tool calls, or regex. Or use the auto-router and let Mycelis decide.
Quality is preserved
Complex tasks still go to the best model. You never trade quality for cost — only unnecessary spend is cut.
Typical cost reduction for a mixed coding workload (80% routine tasks, 20% complex).
Comparison
LiteLLM is a great library. Mycelis is a managed platform. Here's what you get beyond what you'd build yourself.
| Feature | Mycelis | LiteLLM | DIY / CCR |
|---|---|---|---|
| Dedicated GPU provisioning | — | — | |
| Managed frontier API keys | — | — | |
| Local + cloud hybrid routing | partial | partial | |
| Hosted OpenWebUI (chat UI) | — | — | |
| One unified OpenAI-compatible endpoint | |||
| Cost-aware auto routing | limited | limited | |
| MCP tool integration | — | — | |
| Knowledge bases / RAG | — | — | |
| Zero infrastructure to manage | — | — |
Getting started
Start with a single coding agent preset. Advanced routing, MCP tools, and RAG can be added later — when you actually need them.
Point any OpenAI-compatible tool at mycelis.ai/api/proxy/v1. Works with Cursor, Claude Code, OpenCode, or any OpenAI SDK.
The default coding agent preset already routes simple tasks to cheap models and escalates automatically. Costs drop immediately, no configuration needed.
Once you see your routing traces and cost breakdown, tune rules, add MCP tools, attach knowledge bases, or fine-tune a local model.
When you need more
These features are ready when your use case demands them. Start simple, add layers as you grow.
Run DeepSeek, Gemma, Qwen or custom models on your own GPU — fully private, low latency, billed per minute.
Attach tools to your agents — web search, file access, APIs — without changing your client code.
Upload docs, codebases, or PDFs. Mycelis retrieves relevant context before each inference call automatically.
Train domain-specific adapters on your data. Deploy a fine-tuned model that routes to the right base automatically.
One-click chat interface for your workspace. Share with teammates, auto-billed per minute of usage.
Skip identical or near-identical requests. Cache semantically equivalent prompts to cut costs further.
Privacy-first routing
Route private workloads to self-hosted models. Use Claude or GPT only when necessary — and only for requests that don't contain sensitive IP.
Classify by sensitivity
Route requests with proprietary code patterns to local GPU instances. External providers never see that data.
EU-hosted infrastructure
All Mycelis infrastructure runs in the EU. Your data stays in jurisdiction without any enterprise contract.
BYOK or managed keys
Use your own Claude/OpenAI API keys or let Mycelis manage them. No lock-in, your choice.
PRIVACY ROUTING EXAMPLE
Rules defined by you. Evaluated per request, before any external call is made.
Transparente Preise
Keine Grundgebühr. Keine Mindestlaufzeit. Transparente Abrechnung für alle Dienste.
Open Source Modelle
Skalierbar & Dynamisch
DeepSeek, Gemma 4 & mehr. Skalierung und Laufzeit im Wizard einstellbar. Stündliche Abrechnung.
Kommerzielle Modelle
0 % Aufschlag
Managed oder BYOK. Verbinde Claude 4.7, GPT-4o & Co. mit 0 % Aufschlag auf die Anbieterpreise.
Knowledge Bases
0,05 € / h + 0,01 € / GB / h
Sicheres RAG für deine Daten. Fester Basispreis plus speicherbasierte Abrechnung.
OpenWebUI
0,05 € / h
Das beste Interface für dein Team. Vollständig gemanagt und mit allen Modellen verbunden.
100k coding tokens / day · mixed complexity
Same workflow. Same endpoint. Lower cost.
~60%
cost reduction
Calculated from the workload example on the left: $420 → $169 per month.
Häufige Fragen
Starte kostenlos mit Managed Keys — oder deploye dein erstes Open-Source-Modell in unter 60 Sekunden. Keine Grundgebühr.