ToolPiper API

Offline

ToolPiper is the local AI server that powers ModelPiper. It runs on your Mac, manages inference engines, and exposes an OpenAI-compatible API at localhost:9998.

Base URLhttp://localhost:9998/v1SpecGET /v1/openapi.json

How It Works

ModelPiper is the web app you're looking at right now. It talks to ToolPiper, a native macOS app running in the background. ToolPiper manages the actual AI engines — llama.cpp for LLMs, FluidAudio for speech, CoreML for images, and more.

When you create a Provider in ModelPiper, you're setting up a configuration that pairs an AI model with a specific engine. For example: "Use the Llama 3.2 3B model via llama.cpp" or "Use Parakeet for speech-to-text via FluidAudio." Each provider becomes a usable endpoint on the ToolPiper API.

ModelPiper uses an internal session key to stay connected to ToolPiper — you don't need to think about that. But if you want to build your own app on top of ToolPiper, you'll need a Developer Token.

Developer Tokens Pro

A dev token lets you use ToolPiper from your own code, just like an OpenAI API key. Create one, drop it into any OpenAI-compatible SDK, and point the base URL at localhost:9998/v1. That's it.

ToolPiper is offline. Start it to manage tokens.

Quick Start

Drop-in replacement for the OpenAI SDK — just change the base URL and API key. Questions? @ModelPiper on X.

curl http://localhost:9998/v1/chat/completions \
  -H "Authorization: Bearer tp_dev_YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama-3.2-3b",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

MCP Server 104 tools

ToolPiper is also an MCP server. Install categories individually to control which tools your AI client sees — saves context tokens.

core11LLM inference, TTS, STT, embeddings, OCR
analysis8Image/text analysis, RAG, upscaling
browser17Browser automation, scraping, assertions
testing6PiperTest CRUD and execution
motion5Pose estimation, stream processing
outreach12GitHub, HN, Reddit, X, content queue
system29macOS system actions (ActionPiper)
video12Video creator pipeline
oauth4OAuth connection management

See MCP docs for install commands, profiles, and full tool reference.

Endpoints

Inference

2OpenAI-compatible chat completions and model listing
GET/v1/modelsList available models
POST/v1/chat/completionsCreate chat completion (streaming supported)

Audio

2Speech-to-text and text-to-speech
POST/v1/audio/transcriptionsTranscribe audio (STT)
POST/v1/audio/speechSynthesize speech (TTS), supports voice cloning

Image & Video

7CoreML-powered image and video upscaling (PiperSR)
POST/v1/images/upscaleUpscale image 2x/4x via CoreML (returns PNG)
POST/v1/video/upscaleUpscale video 2x via PiperSR (base64 in/out)
POST/v1/video/upscale/urlUpscale video 2x from URL
POST/v1/video/upscale/fileUpscale video 2x using file paths (no base64 overhead)
GET/v1/video/upscale/{id}Check upscale job status
GET/v1/video/upscale/{id}/resultDownload upscaled video result (streaming)
POST/v1/benchmark/upscaleRun PiperSR benchmark suite (A–G)

Pose Estimation

8Body keypoint detection for ControlNet, AnimateDiff, DWPose

Stream Processing

3Real-time frame processing from WebSocket sources

RAG

11Retrieval-Augmented Generation

Cloud Proxy

4Keychain-backed cloud API proxy

Models

12Model management and HuggingFace integration

Engine

4Inference engine control

Auth

3Developer tokens (Pro)

Apple Vision

13On-device image analysis via Apple Vision framework

Apple NLP

6On-device text analysis via Apple Natural Language framework

Scrape

4CDP-based web scraping with framework-aware readiness — up to 7 output formats from a single page load

Browser (CDP)

54Chrome DevTools Protocol automation — AX selectors, self-healing, mutation diffing

Testing (PiperTest)

13Visual test session management, execution, and coverage

Video Creator

17Screenplays, recording, rendering, and narration

Conversations

6Chat conversation persistence

OAuth

5OAuth 2.1 connection management for third-party services

Outreach

23GitHub monitoring, content queue, HN, Reddit, X/Twitter, mentions

ActionPiper

9macOS system automation proxy (26 action domains)

API Connections

4Manage external API connections (OpenAI, Anthropic, Google, etc.)

Tool Permissions

2MCP tool access control

Configurations

4Endpoint-level inference configurations

Recording

3Unified browser + native app recording

Workflow Templates

1Curated pipeline workflow templates

System

11Health, resources, events, and logging