ModelPiper pricing
Everything Ollama does, free — model downloads, the native llama.cpp engine, multi-model, the local OpenAI-compatible API. No account, no caps. Paid tiers cover what no model runner does: dictation, speech, vision, system control, and developer tools.
Explore plans
Free for everyone. Pro for the full toolkit. Team for your whole office on one Mac. Studio and Max coming soon.
Free
Everything Ollama does, free. No account, no caps.
- Native llama.cpp engine — run any GGUF model
- Unlimited model downloads, multi-model switching
- Local OpenAI-compatible API + embeddings
- MCP server with all 300+ tools
- Transcription (STT) and chat with bundled model
- Visual pipeline builder
- Connect your own API keys (OpenAI, Anthropic, Groq)
- Free companion apps (VisionPiper, AudioPiper, MediaPiper)
Pro
What no model runner does, at any price.
- Everything in Free
- Push-to-talk dictation, anywhere on your Mac
- Text-to-speech — three engines, eight voices
- Apple Intelligence on the Neural Engine
- Local RAG over your files
- All 9 inference backends
Studio
For content creators and marketers.
- Everything in Pro
- Image upscaling (ANE-native)
- Video upscaling (60fps real-time)
- Video editing pipeline
- Pose detection (60fps streaming)
- Outreach toolkit (queue, posting, Firehose)
Max
For developers and QA engineers.
- Everything in Studio
- CodePiper (IDE AI extension)
- PiperTest (self-healing browser tests)
- Full browser automation (CDP + AX)
- API discovery toolkit
- Priority support
Team
One Mac runs your AI. Your whole team uses it — governed, audited, private.
- Everything in Max, on the deployment
- Unlimited named member tokens — no per-seat pricing
- Per-user attributed audit trail with pull export
- Remote tool governance from modelpiper.com
- Clients on any OS via piper-bridge
- Priced per deployment — add Macs as you grow
Add-ons
Optional. Available on any plan.
Devices add-on
PiperMesh
Your Mac runs the AI. Your phone, watch, and PC just talk to it.
- iPhone, iPad, and Apple Watch apps
- Windows + Linux desktop client
- Unlimited paired devices on your network
- End-to-end encrypted
- Works on any tier, Free included
Compare plans
Everything that ships with each tier.
| Feature | Free | Pro | Studio coming soon | Max coming soon |
|---|---|---|---|---|
| Inference & Models | ||||
| Native llama.cpp engine (Metal GPU) | ||||
| Model downloads (GGUF / MLX) | Unlimited | Unlimited | Unlimited | Unlimited |
| Multi-model switching | ||||
| Local OpenAI-compatible API | ||||
| Local embeddings | ||||
| Local chat (bundled model) | ||||
| Cloud BYOK (OpenAI, Anthropic, Groq) | ||||
| Apple Intelligence | ||||
| Voice | ||||
| Transcription (STT) | ||||
| Text-to-speech (TTS) | ||||
| Push-to-talk dictation | ||||
| Push-to-talk AI commands | ||||
| Pipeline Builder | ||||
| Visual pipeline builder | ||||
| Local RAG over files | ||||
| Tools & Automation | ||||
| MCP server (stdio + HTTP) | ||||
| MCP tools | over 300 tools | over 300 tools | over 300 tools | over 300 tools |
| Browser automation | ||||
| System actions | 26 domains | 26 domains | 26 domains | 26 domains |
| Clipboard integration | ||||
| Companion Apps | ||||
| VisionPiper (screen capture & streaming) | ||||
| AudioPiper (per-app audio capture) | ||||
| MediaPiper (browser extension) | ||||
| Content & Media | ||||
| Image upscaling (ANE-native) | ||||
| Video upscaling (60fps real-time) | ||||
| Video editing pipeline | ||||
| Pose detection (60fps streaming) | ||||
| Outreach toolkit (queue, posting) | ||||
| Developer Tools | ||||
| CodePiper (IDE AI extension) | ||||
| PiperTest (self-healing browser tests) | ||||
| Full browser automation (CDP + AX) | ||||
| API discovery toolkit | ||||
| Priority support | ||||
The free engine is llama.cpp, embedded directly — currently llama-server b9533. The version ships in the app's About panel and updates with every upstream bump.