ModelPiper pricing

Everything Ollama does, free — model downloads, the native llama.cpp engine, multi-model, the local OpenAI-compatible API. No account, no caps. Paid tiers cover what no model runner does: dictation, speech, vision, system control, and developer tools.

Explore plans

Free for everyone. Pro for the full toolkit. Team for your whole office on one Mac. Studio and Max coming soon.

Free

Everything Ollama does, free. No account, no caps.

$0

  • Native llama.cpp engine — run any GGUF model
  • Unlimited model downloads, multi-model switching
  • Local OpenAI-compatible API + embeddings
  • MCP server with all 300+ tools
  • Transcription (STT) and chat with bundled model
  • Visual pipeline builder
  • Connect your own API keys (OpenAI, Anthropic, Groq)
  • Free companion apps (VisionPiper, AudioPiper, MediaPiper)
Most popular

Pro

What no model runner does, at any price.

$10/mo

  • Everything in Free
  • Push-to-talk dictation, anywhere on your Mac
  • Text-to-speech — three engines, eight voices
  • Apple Intelligence on the Neural Engine
  • Local RAG over your files
  • All 9 inference backends
Coming soon

Studio

For content creators and marketers.

$29/mo

  • Everything in Pro
  • Image upscaling (ANE-native)
  • Video upscaling (60fps real-time)
  • Video editing pipeline
  • Pose detection (60fps streaming)
  • Outreach toolkit (queue, posting, Firehose)
Coming soon

Max

For developers and QA engineers.

$49/mo

  • Everything in Studio
  • CodePiper (IDE AI extension)
  • PiperTest (self-healing browser tests)
  • Full browser automation (CDP + AX)
  • API discovery toolkit
  • Priority support
For teams

Team

One Mac runs your AI. Your whole team uses it — governed, audited, private.

$99/mo per deployment

  • Everything in Max, on the deployment
  • Unlimited named member tokens — no per-seat pricing
  • Per-user attributed audit trail with pull export
  • Remote tool governance from modelpiper.com
  • Clients on any OS via piper-bridge
  • Priced per deployment — add Macs as you grow

Add-ons

Optional. Available on any plan.

Coming soon

Devices add-on

PiperMesh

Your Mac runs the AI. Your phone, watch, and PC just talk to it.

+$7/mo

  • iPhone, iPad, and Apple Watch apps
  • Windows + Linux desktop client
  • Unlimited paired devices on your network
  • End-to-end encrypted
  • Works on any tier, Free included
Learn more

Compare plans

Everything that ships with each tier.

Feature Free Pro Studio coming soon Max coming soon
Inference & Models
Native llama.cpp engine (Metal GPU)
Model downloads (GGUF / MLX)UnlimitedUnlimitedUnlimitedUnlimited
Multi-model switching
Local OpenAI-compatible API
Local embeddings
Local chat (bundled model)
Cloud BYOK (OpenAI, Anthropic, Groq)
Apple Intelligence
Voice
Transcription (STT)
Text-to-speech (TTS)
Push-to-talk dictation
Push-to-talk AI commands
Pipeline Builder
Visual pipeline builder
Local RAG over files
Tools & Automation
MCP server (stdio + HTTP)
MCP toolsover 300 toolsover 300 toolsover 300 toolsover 300 tools
Browser automation
System actions26 domains26 domains26 domains26 domains
Clipboard integration
Companion Apps
VisionPiper (screen capture & streaming)
AudioPiper (per-app audio capture)
MediaPiper (browser extension)
Content & Media
Image upscaling (ANE-native)
Video upscaling (60fps real-time)
Video editing pipeline
Pose detection (60fps streaming)
Outreach toolkit (queue, posting)
Developer Tools
CodePiper (IDE AI extension)
PiperTest (self-healing browser tests)
Full browser automation (CDP + AX)
API discovery toolkit
Priority support

The free engine is llama.cpp, embedded directly — currently llama-server b9533. The version ships in the app's About panel and updates with every upstream bump.