Blog

Technical papers, product updates, and press releases from the ModelPiper team.

RSS
105 articles
LatestArticle8 min read

Tool Routing for Local AI on Mac: How Small Models Use 303 Tools

Small local models can't fit a full MCP catalog in 8K context. ToolPiper's on-device retrieval model picks the right tools and shrinks schemas to fit.

Article9 min readby Ben Racicot

How PiperMesh Works: Direct on LAN, Encrypted Everywhere Else

PiperMesh tries three paths to connect your devices: direct on LAN, NAT-traversed peer-to-peer, then encrypted relay. All three use Noise encryption end-to-end.

PiperMeshPrivacyNetworkingEnd-to-end EncryptionmacOSiOSCross-platform
Article6 min readby Ben Racicot

Conversational AI for Mac: Beyond Dictation, Into Action

macOS has had dictation since 2012. ToolPiper adds the layer that was always missing: a local LLM that interprets what you said and does something about it.

VoiceSpeech to TextText GenerationText to SpeechPrivacymacOSApple SiliconProductivity
Article5 min readby Ben Racicot

Wispr Flow Alternative for Mac: More Features for Less Money

ToolPiper Pro is $10/month vs Wispr Flow's $12 (at the time of this writing). It runs entirely on your Mac, never sends audio to the cloud, and does far more than dictation.

VoiceDictationSpeech to TextPrivacymacOSProductivity
Article6 min readby Ben Racicot

Is ToolPiper Safe? What You Can Verify and How

ToolPiper's local features process everything on your Mac. No audio, prompts, or documents travel over the network. Here's exactly what stays local, what doesn't, and how to confirm it yourself.

PrivacymacOSLocal AIApple SiliconSecurity
Article7 min readby Ben Racicot

Token Usage and MCP Tools With ToolPiper

ToolPiper exposes 303 MCP tools to Claude Code. Three layers keep your context window, and your token bill, under control: deferred loading, user permissions, and ToolGate.

MCPClaude CodeDeveloper ToolsmacOSAI CodingToken OptimizationPrivacyApple Silicon
Article6 min readby Ben Racicot

ToolPiper vs Wispr Flow: Local Voice AI or Cloud Dictation

Wispr Flow costs $12/month (at the time of this writing), ToolPiper Pro $10/month - and both do voice dictation on Mac. The difference is architectural: Wispr Flow processes audio in the cloud, ToolPiper on your Neural Engine. Here's the full comparison.

VoiceSpeech to TextPrivacymacOSApple SiliconCompetitor Analysis
Article5 min readby Ben Racicot

Wispr Flow's Privacy Incident: What Happened, What Changed, and What It Means

In 2025, Wispr Flow users discovered the app was sending audio and screenshots to cloud servers without clear disclosure. Here's what happened, how the company responded, and what the incident reveals about cloud voice AI.

VoicePrivacymacOSSpeech to TextCompetitor Analysis
Article5 min readby Ben Racicot

$273 Million in Settlements: The Real Cost of Trusting Cloud Voice AI

Amazon, Apple, and Google have paid over $273 million in legal settlements for voice AI privacy violations. This is the complete record - what happened, what each company paid, and what the pattern reveals.

PrivacyVoiceLocal AImacOSSecurityApple Silicon
Paper11 min readby PiperKit

Why PiperKit Exists: Local AI Is All That's Left

PiperKit LLC builds local AI for Apple Silicon. Cloud AI lost the trust argument, open source closed the quality gap, the next chips finish the job.

Local AIApple SiliconOpen SourcePrivacymacOSCloud AI
Article4 min readby Ben Racicot

Zoom Tried to Train AI on Your Meetings. Here's What Happened.

In 2023, Zoom updated its terms to grant a perpetual license to use meeting content for AI training. The backlash was swift. The terms were revised. But the underlying question remains: what happens to your meetings in the cloud?

PrivacyVoiceLocal AISecurity
Article4 min readby Ben Racicot

AI Meeting Recorders and the Consent Problem Nobody Talks About

Otter.ai faces a class-action lawsuit for recording meetings without proper consent from all participants - including a journalist's interview with a Uyghur activist. Here's what AI meeting tools actually do with your audio.

PrivacyVoiceLocal AISecuritymacOS
Article4 min readby Ben Racicot

Amazon's Alexa Listening Program: What Bloomberg Found and What Amazon Paid

In 2019, Bloomberg revealed that Amazon employed thousands of workers to listen to Alexa recordings - including private medical conversations and what appeared to be a sexual assault. Amazon confirmed it. The FTC settlement was $25 million.

PrivacyVoiceLocal AISecurity
Article4 min readby Ben Racicot

Apple's $95 Million Siri Settlement: What Happened and What It Means

Apple settled a class-action lawsuit for $95 million in 2025 over Siri privacy violations. Contractors were listening to recordings including medical conversations, business negotiations, and intimate moments. Here's the full account.

PrivacyVoicemacOSSpeech to TextSecurity
Article4 min readby Ben Racicot

Google's $68 Million Voice Privacy Settlement: What the Whistleblower Found

Google settled a class-action lawsuit for $68 million in 2026 over Google Assistant recordings. A whistleblower leaked over 1,000 recordings to Belgian media - many captured without wake-word activation. Here's what happened.

PrivacyVoiceLocal AImacOSSecurity
Article7 min readby Ben Racicot

Ollama KV Cache Quantization: Fit Longer Contexts in Less Memory

KV cache is the hidden memory hog in Ollama. Quantizing it from FP16 to q8_0 or q4_0 cuts context memory by 2-4x. Here's how to enable it and what you lose.

OllamaText GenerationPrivacymacOSApple SiliconPerformance
Article6 min readby Ben Racicot

Ollama vs ToolPiper: Model Runner or Model Platform

Ollama is a model runner. ToolPiper is a model platform. Same models, different scope. Here's when each makes sense and how to use both.

OllamaChatText GenerationDeveloperPrivacymacOSApple Silicon
Article7 min readby Ben Racicot

The One-Time License in AI Software Is a Promise Nobody Can Keep

AI tools selling lifetime licenses face a math problem: models change weekly, backends break monthly, but revenue per user is zero after day one. We document who is making this promise today.

ComparisonPricingAI ToolsPrivacymacOS
Article9 min readby Ben Racicot

ChatGPT Alternatives on Mac That Don't Charge Per Token

Six ChatGPT alternatives compared by what you actually pay for: token-metered subscriptions, bring-your-own-key apps, and local inference on Apple Silicon.

ComparisonPricingAI ToolsPrivacymacOSChatGPTApple Silicon
ArticleUpdated Apr 14, 202610 min readby Ben Racicot

Run Multiple Ollama Models on Mac: See What Fits in Memory

Ollama won't tell you if two models fit in memory at once. ToolPiper shows per-model usage, GPU allocation, and warns before you exceed capacity.

OllamaChatText GenerationPrivacymacOSApple Silicon
Article6 min readby Ben Racicot

Ollama Vision GUI on Mac: Use LLaVA Without the Terminal

Ollama supports vision models like LLaVA and Gemma 3, but using them means base64 curl commands. ToolPiper lets you drag an image into chat and ask a question.

VisionOllamaImage UnderstandingText GenerationPrivacymacOS
Article5 min readby Ben Racicot

Give Claude 303 macOS Superpowers with One MCP Server

Install ToolPiper and connect 303 MCP tools to Claude Code in under a minute. Local inference, browser automation, desktop control, testing, and video - all on your Mac.

MCPModel Context ProtocolClaude CodeDeveloper ToolsPrivacymacOSApple SiliconAI Tools
Paper11 min readby Ben Racicot

Your AI Coding Agent Needs a Log Store, Not Terminal Output

The write-run-check-fix loop breaks when check means reading terminal scrollback. A structured log store turns AI debugging from guessing into querying.

DebuggingDeveloper ToolsAI CodingAgentic DevelopmentObservabilityFeedback LoopsMCPmacOS
Article10 min readby Ben Racicot

Tracing Multi-Step AI Pipelines with Correlation IDs

When a 3-step AI pipeline returns wrong output, which step failed? Correlation IDs group every event from one execution into a single trace.

DebuggingPipelinesDeveloper ToolsObservabilityCorrelation IDsmacOSLoggingAI Workflows
Article8 min readby Ben Racicot

Full HTTP Body Capture: See Exactly What Your Local AI Sent and Received

LogPiper captures full request and response bodies for every API call through ToolPiper. When your local LLM returns garbage, the exact prompt is in the logs.

DebuggingHTTPDeveloper ToolsAPImacOSLoggingLocal AIMCP
Paper12 min readby Ben Racicot

An MCP Tool That IS a Logging Service: How LogPiper Closes the Agent Feedback Loop

Every MCP observability tool monitors agents from outside. LogPiper is a logging service the agent uses directly - bidirectional, persistent, and queryable.

MCPLoggingArchitectureDeveloper ToolsObservabilitymacOSSwiftAI Coding
Article6 min readby Ben Racicot

Ollama Pipelines on Mac: Chain Models in a Visual Workflow

Ollama runs one model at a time. ToolPiper's visual pipeline builder chains multiple models into workflows using your Ollama models as backends.

PipelinesOllamaText GenerationPrivacymacOSApple Silicon
Article5 min readby Ben Racicot

Voice Chat With Ollama on Mac: Add STT and TTS to Any Local Model

Add voice conversation to Ollama on Mac by chaining three local models: speech-to-text, your LLM, and text-to-speech. All on-device, no cloud APIs.

VoiceOllamaSpeech to TextText GenerationText to SpeechPrivacymacOS
Article4 min readby Ben Racicot

The Samsung ChatGPT Leak: What Developers Need to Know

In April 2023, Samsung engineers pasted proprietary semiconductor source code into ChatGPT across three incidents in 20 days. Samsung banned AI tools company-wide within a month. Here's what happened and what it means for developers using cloud AI.

PrivacyDeveloperLocal AImacOSSecurity
Article9 min readby Ben Racicot

Why Voice AI Should Never Leave Your Device

Voice is the most personal data you generate. Cloud voice AI sends it to servers you can't audit, trains on it without your knowledge, and has a documented history of human reviewers listening in. Here's why local inference is the only appropriate architecture for voice.

VoicePrivacyLocal AImacOSApple SiliconSpeech to Text
Article8 min readby Ben Racicot

Cross-App Debugging on macOS Without Installing a Logging Framework

Python, shell, JavaScript, Swift - all logging to one HTTP endpoint. No SDK, no dependency, no config. LogPiper aggregates logs from any process on localhost.

DebuggingDeveloper ToolsmacOSLoggingPythonJavaScriptSwiftCross-Platform
Article8 min readby Ben Racicot

Cursor Debug Mode vs. a Persistent Log Bus for AI Debugging

Cursor Debug Mode gives your AI a temporary log server. LogPiper gives it a persistent, queryable log store. Both solve the same problem differently.

DebuggingCursorDeveloper ToolsComparisonMCPmacOSAI CodingLogging
Article10 min readby Ben Racicot

How to Debug with Claude Code Using a Local Log Bus

Tell Claude Code to POST logs to LogPiper, reproduce the bug, query the errors, hand them back. A persistent debugging feedback loop in four steps.

DebuggingClaude CodeMCPDeveloper ToolsmacOSLoggingAI CodingAgentic Development
Article8 min readby Ben Racicot

Vibe Coding Hits a Wall at 80%. Debugging the Last 20% Without Debugging Skills

AI gets 80% of the code right. The last 20% requires debugging. LogPiper gives you structured error data without needing to use a traditional debugger.

Vibe CodingDebuggingDeveloper ToolsAI CodingmacOSLoggingBeginnerMCP
Article8 min readby Ben Racicot

LogPiper: A Universal Logging Bus That Ships Free Inside ToolPiper

ToolPiper includes a real-time logging service any app, script, or MCP tool can write to and query. Fire-and-forget ingestion, structured queries, SSE streaming, correlation IDs, and full HTTP body capture - zero setup.

LoggingDeveloper ToolsDebuggingMCPmacOSOn-Device AIObservabilityREST APISSE
Article4 min readby Ben Racicot

Offline Voice Typing on Mac: Dictate Without Internet on Apple Silicon

ToolPiper runs STT on Apple's Neural Engine with zero internet dependency. Push-to-talk and 142 voice commands work offline. $10/month Pro.

Voice DictationOfflineSpeech to TextPrivacymacOSNeural EnginePush to TalkAir GapApple Silicon
Paper12 min readby Ben Racicot

Vibe Debugging: The Observability Gap in AI-Assisted Development

AI coding assistants write code but can't see runtime behavior. LogPiper gives them a queryable log store - the missing feedback loop for agentic development.

DebuggingDeveloper ToolsVibe CodingMCPObservabilityAI CodingAgentic DevelopmentmacOS
Article5 min readby Ben Racicot

Voice Coding on Mac Without the Cloud: Dictate into Cursor, VS Code, and Terminal

Push-to-talk dictation for developers on Apple's Neural Engine. 140ms, private, $10/month Pro. Plus 303 MCP tools for system control from Claude Code.

Voice CodingDeveloper ToolsSpeech to TextPrivacymacOSOn-Device AICursorVS CodeMCPVibe Coding
Article5 min readby Ben Racicot

Best Free Dictation App for Mac in 2026: Apple, Whisper, and the On-Device Option

Four genuinely free dictation options on Mac. Apple built-in, Whisper.cpp, MacWhisper, and ToolPiper compared for accuracy, latency, and privacy.

Voice DictationFreeSpeech to TextmacOSComparisonPrivacyPush to TalkApple DictationWhisper
Article6 min readby Ben Racicot

Private Voice Dictation on Mac: No Cloud, No Data Collection, No Compromise

ToolPiper runs Parakeet STT on Apple's Neural Engine. 140ms, 25 languages, fully offline, $10/month Pro. Your voice never leaves your Mac.

Voice DictationPrivacySpeech to TextOn-Device AImacOSNeural EnginePush to TalkHIPAACompliance
Article9 min readby Ben Racicot

Visual Testing on Mac: Record, Replay, Export to Playwright

Record browser tests visually, replay with self-healing selectors, export to Playwright or Cypress. No code required. AX-native, local, Chrome-based.

TestingBrowser AutomationAccessibilityPrivacymacOSPlaywrightDeveloper Tools
Article9 min readby Ben Racicot

Wispr Flow Alternative for Mac: Private, On-Device Dictation

ToolPiper Pro runs STT on Apple's Neural Engine. Private, 140ms latency, $10/month - less than Wispr (at the time of this writing), plus 142 voice commands Wispr doesn't have.

Voice DictationWispr FlowSpeech to TextPrivacymacOSOn-Device AIPush to TalkFreeAlternative
Article8 min readby Ben Racicot

Best Ollama Frontend for Mac: Every GUI Option Compared

Every Ollama frontend on Mac compared honestly: Ollama App, Open WebUI, LM Studio, Ollamac Pro, Askimo, and ToolPiper. Install complexity, features, and trade-offs.

OllamaChatText GenerationPrivacymacOSApple Silicon
Article10 min readby Ben Racicot

Browser Health Monitoring: Catch What Tests Miss on Mac

Passive health monitors run after every test step. Console errors, JS exceptions, HTTP failures - caught automatically with zero configuration.

TestingBrowser AutomationMonitoringPrivacymacOSDeveloper Tools
Article10 min readby Ben Racicot

Reduce Test Maintenance Cost on Mac: AX Selectors and Self-Healing

Hard numbers on E2E test maintenance. 30-50% of testing time goes to upkeep. AX selectors and self-healing change the equation. Free on Mac.

TestingSelf-HealingQABrowser AutomationPrivacymacOS
Article12 min readby Ben Racicot

Temporal Assertions: Always, Eventually, Next Testing on Mac

Three temporal assertion modes that replace brittle waits. Always holds for duration, eventually within deadline, next on immediate check. 100ms polling.

TestingBrowser AutomationAccessibilityPrivacymacOSDeveloper Tools
Article8 min readby Ben Racicot

Test Coverage on Mac: See Which Interactions Your Tests Actually Cover

Interaction coverage shows which buttons, forms, and links your tests actually touch - not just which code lines execute. PiperProbe scans the AX tree.

Test CoverageTestingBrowser AutomationDeveloper ToolsPrivacymacOS
Article9 min readby Ben Racicot

Testim and mabl Alternative: Enterprise Self-Healing Testing for Free

Enterprise testing tools proved that self-healing and visual authoring matter. They also gate those features behind $500+/month pricing. PiperTest delivers the same capabilities locally on your Mac, for free.

TestingSelf-HealingQABrowser AutomationPrivacymacOS
Article20 min readby Ben Racicot

Accessibility-Native Testing: Why the AX Tree Is the Right Abstraction for Selectors

The technical case for testing against Chrome's real accessibility tree instead of the DOM. How AX selectors work, why Playwright's getByRole is a DOM simulation, and what changes when you query the tree that screen readers actually use.

TestingAccessibilityBrowser AutomationCDPPrivacymacOSDeveloper Tools
Article12 min readby Ben Racicot

Accessibility Testing Automation on Mac: AX Selectors as Audits

AX-native test selectors double as accessibility audits. If your app isn't accessible, your tests won't resolve. Free on Mac.

TestingAccessibilityBrowser AutomationWCAGPrivacymacOS
Article12 min readby Ben Racicot

AI Test Generation on Mac: From AX Snapshot to Running Tests

Generate browser tests with any AI model. PiperTest feeds the AX tree as plain text, AI reasons about what to test, and tests run locally on Mac.

TestingAIBrowser AutomationMCPPrivacymacOSDeveloper Tools
Article10 min readby Ben Racicot

Export Tests to Playwright and Cypress From Mac in One Click

Export PiperTest recordings to clean Playwright or Cypress code. Deterministic selector mapping, mock rendering, temporal comments. Zero vendor lock-in.

TestingPlaywrightCypressBrowser AutomationPrivacymacOS
Article13 min readby Ben Racicot

MCP Testing Tools: How AI Agents Write and Run Browser Tests

20 MCP tools for AI-driven browser testing. AI agents take AX snapshots, generate test steps, run them with self-healing, and export to Playwright. Works with Claude Code, Cursor, or any MCP client.

TestingMCPBrowser AutomationAIClaude CodePrivacymacOSDeveloper Tools
Article11 min readby Ben Racicot

Cypress Alternative: Self-Healing AX Selectors That Don't Need the Cloud

An honest comparison of PiperTest vs Cypress for teams frustrated by Cloud costs, no multi-tab support, Safari still experimental after 6 years, and cy.prompt() sending your page data to remote servers. Real feature comparison, MCP tool analysis, and migration path.

TestingCypressBrowser AutomationAccessibilitySelf-HealingPrivacymacOS
Article11 min readby Ben Racicot

Fix Flaky Tests on Mac: Self-Healing AX Selectors That Work

Stop chasing brittle CSS selectors. PiperTest targets the accessibility tree and self-heals broken selectors in 5-15ms, locally on your Mac.

TestingSelf-HealingBrowser AutomationAccessibilityPrivacymacOSDeveloper Tools
Article10 min readby Ben Racicot

No-Code Browser Testing: The Tool QA Engineers Have Been Waiting For

You know every user flow in your app. You just can't write Playwright tests. PiperTest lets you record, replay, and export browser tests visually - no code, no IDE, no config files.

TestingQABrowser AutomationAccessibilityPrivacymacOSDeveloper Tools
Article12 min readby Ben Racicot

Playwright Alternative: AX-Native Visual Testing With Self-Healing Selectors

An honest comparison of PiperTest vs Playwright for teams frustrated by maintenance burden, selector fragility, and non-developer exclusion. Real AX tree, self-healing, visual format, 20 MCP tools.

TestingPlaywrightBrowser AutomationAccessibilitySelf-HealingMCPPrivacymacOS
Article10 min readby Ben Racicot

Selenium Alternative 2026: An Honest Look at What's Changed and What Hasn't

Selenium still dominates browser automation with 50M monthly downloads and 10K+ job postings. But 45% of teams report constant test breakage. Here's what a modern Selenium alternative actually looks like in 2026.

TestingSeleniumBrowser AutomationAccessibilitySelf-HealingPrivacymacOS
Article9 min readby Ben Racicot

Automate Demo Videos on Mac: AI Screenplay to Rendered MP4

Define a screenplay, let AI record and narrate your demo, and render a finished MP4. All local on Mac. No cloud, no editing timeline.

Video CreatorAutomationScreen RecordingText to SpeechPrivacymacOSDeveloper Tools
Article5 min readby Ben Racicot

Mac Screen Capture with Live AI Streaming: Record, GIF, and Stream to Vision Models

Capture any region of your Mac screen, record to MP4, export as GIF or WebP, and stream live at 30fps to AI vision models - all from a free menu bar app.

Screen CaptureScreen RecordingGIFmacOSVisionAI StreamingPrivacy
Article6 min readby Ben Racicot

Ollama Chat Without Docker on Mac: Native Alternatives to Open WebUI

Open WebUI needs Docker, eating 2-4GB of RAM before you load a model. ToolPiper is a native Mac app that discovers your Ollama models automatically.

OllamaChatText GenerationPrivacymacOSApple Silicon
Article12 min readby Ben Racicot

Playwright vs Cypress vs PiperTest in 2026: The Testing Framework Comparison Nobody Wrote

The definitive 3-way comparison of Playwright, Cypress, and PiperTest for 2026. Master comparison table, MCP tool analysis, real AX tree vs DOM injection, performance data, and honest recommendations.

TestingPlaywrightCypressBrowser AutomationAccessibilitySelf-HealingMCPPrivacymacOS
Article13 min readby Ben Racicot

Self-Healing Test Selectors: How PiperTest Fixes Broken Tests Automatically

How self-healing selectors actually work - three modes from 5ms AX fuzzy matching to AI-assisted repair. Technical deep-dive with comparison to Testim, mabl, Cypress, and Playwright.

TestingSelf-HealingBrowser AutomationAccessibilityPrivacymacOSDeveloper Tools
Article11 min readby Ben Racicot

Test Recorder for Browser on Mac: AX-Enriched, Not Brittle

Record browser tests with AX-enriched selectors that self-heal. No browser driver, no brittle CSS. Direct CDP WebSocket, structured steps, one-click export.

TestingBrowser AutomationAccessibilityPrivacymacOSDeveloper Tools
Article8 min readby Ben Racicot

Desktop Automation on Mac: AI Controls Your System Settings, Windows, and Apps

AI that types and talks but can't move a window or toggle dark mode. ToolPiper closes that gap with 26 action domains, voice commands, and MCP tools.

Desktop AutomationSystem ControlAI ActionsPrivacymacOSAccessibilityProductivity
Article5 min readby Ben Racicot

Hover Zoom Alternative for Chrome: Full-Size Image Preview Without the Privacy Risks

Preview full-size images and videos by hovering over thumbnails - with intelligent CDN pattern discovery, no tracking, and no data collection. For Chrome, Firefox, and Safari.

Browser ExtensionImage PreviewChrome ExtensionFirefox ExtensionPrivacyCDN
Article9 min readby Ben Racicot

Push-to-Talk AI on Mac: Dictate and Command from Any App

Hold a key, speak, release. Text appears at your cursor or your Mac executes a command. All local, all on Apple Neural Engine, 140ms latency.

Voice ControlSpeech to TextText GenerationPrivacymacOSDictationProductivity
Article5 min readby Ben Racicot

Capture Per-App Audio on Mac Without Virtual Drivers

Record and stream audio from any individual Mac app - Chrome, Spotify, Zoom - without installing virtual audio drivers. AudioPiper uses Core Audio Taps built into macOS.

Audio CapturemacOSCore Audio TapsRecordingStreamingPrivacy
Article8 min readby Ben Racicot

Live Transcription on Mac: Real-Time Speech to Text as You Speak

See words appear on screen as people speak, processed entirely on your Mac. Streaming STT with AudioPiper and FluidAudio on the Neural Engine.

Live TranscriptionSpeech to TextStreamingPrivacymacOSNeural EngineProductivity
Article5 min readby Ben Racicot

Mac Clipboard Manager with AI Snippets: Maccy + TextExpander + Dictation in One App

A free Mac clipboard manager with AI-powered text expansion, push-to-talk dictation, and 200+ item history - replacing three separate apps with one menu bar tool.

Clipboard ManagerText ExpanderDictationmacOSProductivityAI
Article8 min readby Ben Racicot

Browser Automation on Mac: Your Local LLM Sees and Controls Chrome

AI browser automation that keeps your page content local. ToolPiper's 14 CDP tools let a local LLM drive Chrome via the accessibility tree.

Browser AutomationCDPAccessibilityAI TestingPrivacymacOSDeveloper Tools
Article7 min readby Ben Racicot

Code Search with Local RAG on Mac: Ask Your Codebase Without Uploading It

Search your codebase with AI using local RAG on Mac. Semantic code search, hybrid vector + keyword retrieval, all on-device. No code leaves your machine.

RAGCode SearchEmbeddingsText GenerationPrivacymacOSDeveloper Tools
Article8 min readby Ben Racicot

Local AI Agents on Mac: Tool Calling Without the Cloud

Run AI agents locally on your Mac with 303 MCP tools. Tool calling, multi-step reasoning, desktop control - no cloud APIs, no data leaving your machine.

AI AgentsTool CallingMCPText GenerationPrivacymacOSAutomation
Article3 min readby Ben Racicot

Ollama CORS Fix on Mac: One Environment Variable, No Configuration

Fix Ollama CORS errors on Mac with one environment variable. Or skip CORS entirely with ToolPiper - same models, zero configuration.

OllamaChatDeveloperPrivacymacOS
Article8 min readby Ben Racicot

Web Scraping on Mac: 7 Extraction Formats with Framework Detection

Scrape JavaScript-heavy websites using a real browser, detect 16 frontend frameworks, and extract content in 7 formats. All local, no cloud APIs.

Web ScrapingBrowser AutomationRAGPrivacymacOSDeveloper Tools
Article7 min readby Ben Racicot

Apple Intelligence and Open Models on Mac: Use Both in One App

Apple Intelligence runs on the Neural Engine. Open models run on Metal GPU. ToolPiper runs both in one app with smart routing and a shared interface.

Apple IntelligenceText GenerationPrivacymacOSNeural EngineApple Silicon
Article8 min readby Ben Racicot

Local Embeddings on Mac: On-Device Vector Search Without Cloud APIs

Run embedding models and vector search entirely on your Mac. Three local paths, HNSW indexing, and an OpenAI-compatible API. Zero data exposure.

EmbeddingsVector SearchRAGPrivacymacOSNeural EngineDeveloper Tools
Article7 min readby Ben Racicot

Local OpenAI-Compatible API on Mac: Drop-In Replacement on Localhost

Run an OpenAI-compatible API on localhost with ToolPiper. Same SDK, same code, same prompts. Just change the base URL and your LLM runs locally on Mac.

APIOpenAI CompatibleDeveloper ToolsPrivacymacOSIntegration
ArticleUpdated Apr 12, 20267 min readby Ben Racicot

Local MCP Server on Mac: 303 AI Tools in One Install

ToolPiper is a 303-tool MCP server for macOS. Local inference, browser automation, voice, vision, desktop control, and testing from one native app.

MCPModel Context ProtocolDeveloper ToolsPrivacymacOSClaude CodeAI ToolsApple Silicon
Article9 min readby Ben Racicot

AI Model Memory on Mac: How Much RAM Do You Actually Need?

How much RAM do LLMs need on Mac? Understand Apple Silicon unified memory, model sizing, and how ToolPiper prevents out-of-memory crashes.

Resource IntelligenceText GenerationPrivacymacOSApple SiliconMemory Management
Article8 min readby Ben Racicot

Local AI vs ChatGPT Plus: What You Get Without the Subscription

ChatGPT Plus costs $240/year. Local AI on your Mac costs nothing per query. Here is what you gain, what you lose, and when each makes sense.

ComparisonText GenerationPrivacymacOSApple SiliconChatGPT
Article7 min readby Ben Racicot

Local LLM Benchmarks on Apple Silicon: Token Speed Across M1 to M4

Consistent benchmarks for local LLM speed on Mac. Token generation rates across M1 through M4 chips, with real models and real workloads.

BenchmarksPerformanceText GenerationmacOSApple SiliconMetal GPU
Article8 min readby Ben Racicot

Which Local LLM on Mac: A Model Selection Guide for Apple Silicon

Hundreds of models on HuggingFace, one Mac. Here's the decision tree for picking the right local LLM based on your RAM, use case, and quality needs.

Model SelectionText GenerationPrivacymacOSApple SiliconLLMQuantization
Article8 min readby Ben Racicot

Install MLX-Audio on Mac: Python TTS and the Zero-Code Alternative

How to install and use MLX-Audio for local text-to-speech on Mac - then discover ToolPiper, which bundles the same models in a native app with zero Python required.

MLX-AudioText to SpeechPrivacymacOSApple Silicon
Article7 min readby Ben Racicot

Install Ollama on Mac: Setup Guide and the One-App Alternative

Step-by-step guide to install and configure Ollama on Mac - then discover ToolPiper, the one-app alternative with built-in inference, voice, vision, and 303 MCP tools.

OllamaText GenerationLocal AIChatPrivacymacOSApple Silicon
Paper8 min readby Ben Racicot

AX-Native Browser Automation: Why We Built Our Own CDP Engine

How ToolPiper's accessibility-tree-first approach to browser automation enables AI-powered test generation that works with any model provider - not just MCP-aware clients.

Browser AutomationCDPAccessibilityTestingPrivacymacOS
Paper7 min readby Ben Racicot

PiperSR: First Open-Source Super-Resolution Model for Apple Neural Engine

Introducing PiperSR - a 453K-parameter super-resolution model purpose-built for Apple Neural Engine. 37.54 dB PSNR on Set5, 44 FPS real-time video upscale on Apple Silicon. CoreML, open source, runs on any Mac.

PiperSRSuper ResolutionApple Neural EngineCoreMLPrivacymacOSApple SiliconOpen Source
Paper8 min readby Ben Racicot

Building Over 140 MCP Tools in Swift: Architecture of a Unified Local AI Server

How ToolPiper became the first MCP server to unify LLM inference, TTS, STT, embeddings, OCR, vision, browser automation, and RAG behind a single install - all in native Swift.

MCPSwiftModel Context ProtocolPrivacymacOSArchitecture
Paper7 min readby Ben Racicot

How We Achieved 44 FPS Video Upscale on Apple Neural Engine

A deep dive into PiperSR's double-buffered ANE+Metal pipeline that upscales 360p video to 720p at 44.4 FPS - 1.5x realtime on Apple Silicon.

Video UpscaleApple Neural EngineCoreMLPrivacymacOSApple SiliconPerformance
Article6 min readby Ben Racicot

Pose Estimation and Mocap on Mac: Skeleton Tracking Without the Cloud

Detect human poses, track skeletons, and stream motion capture data in real time - all running on your Mac's Neural Engine. No cloud, no markers, no special hardware.

Pose EstimationMotion CaptureNeural EngineComputer VisionPrivacymacOSApple SiliconControlNet
Article5 min readby Ben Racicot

Local Video Upscale on Mac: 2x Resolution at Realtime Speed

Upscale video from 360p to 720p at 44 FPS on your Mac's Neural Engine - no cloud upload, no watermark, audio preserved automatically.

Video UpscaleSuper ResolutionNeural EngineCoreMLPrivacymacOSApple Silicon
Article6 min readby Ben Racicot

Local RAG Chat on Mac: Ask Your Documents, Keep Your Data

Index your files and ask questions about them using local AI - retrieval-augmented generation running entirely on your Mac, no documents uploaded anywhere.

RAGText GenerationEmbeddingsPrivacymacOSDocument Q&AApple Silicon
Article4 min readby Ben Racicot

Local Image Upscale on Mac: 4x Resolution Without the Cloud

Upscale photos and screenshots 2x or 4x on your Mac using CoreML super-resolution models - no upload, no API, no quality loss from compression.

Image UpscaleSuper ResolutionNeural EngineCoreMLPrivacymacOSApple Silicon
Article3 min readby Ben Racicot

Deep Reasoning on Mac: Complex Problem-Solving Without the Cloud

Reasoning models think step by step before answering. Now they run locally on your Mac - private, free per query, with full chain-of-thought transparency.

ReasoningText GenerationPrivacymacOSChain of ThoughtProblem Solving
Article2 min readby Ben Racicot

Image Narration on Mac: AI Describes What It Sees and Reads It Aloud

Drop an image into ModelPiper - a vision model describes what's in it, then text-to-speech reads the description aloud. All on-device, all private.

Image NarrationImage UnderstandingText to SpeechPrivacymacOSAccessibility
Article3 min readby Ben Racicot

Local Document OCR on Mac: Extract Text Without Uploading Anything

Extract text from scanned documents, photos, and screenshots using Apple Vision OCR - on-device, fast, and private. No cloud upload required.

OCRImage UnderstandingPrivacymacOSNeural EngineDocument Processing
Article3 min readby Ben Racicot

Screen Q&A on Mac: Ask AI About Anything on Your Screen

Select any region of your screen, ask a question, get an answer - all locally. VisionPiper captures your screen and feeds it to a vision model running on your Mac.

Screen Q&AImage UnderstandingPrivacymacOSVisionPiperApple Silicon
Article3 min readby Ben Racicot

Voice Cloning on Mac: Replicate Any Voice, Entirely Local

Clone any voice from a short audio sample, entirely on your Mac - no biometric data uploaded anywhere. Voice is too sensitive for the cloud.

Voice CloningText to SpeechPrivacymacOSBiometrics
Article3 min readby Ben Racicot

Live Translation on Mac: Speak One Language, Hear Another

Real-time speech translation running entirely on your Mac - speak English, hear Portuguese. No cloud, no Google Translate, no data leaving your machine.

TranslationSpeech to TextText GenerationText to SpeechPrivacymacOSMultilingual
Article3 min readby Ben Racicot

Transcribe and Summarize on Mac: Audio In, Key Points Out

Drop a meeting recording into ModelPiper and get a structured summary - decisions, action items, key points - without uploading confidential audio to any cloud service.

TranscriptionSpeech to TextText GenerationPrivacymacOSMeetingsProductivity
Article4 min readby Ben Racicot

Voice Chat on Mac: Talk to AI Locally, Hear It Respond

A full voice conversation with AI - speech-to-text, language model, text-to-speech - running entirely on your Mac. No cloud, no latency, no data leaving your machine.

Voice ChatSpeech to TextText GenerationText to SpeechPrivacymacOS
Article3 min readby Ben Racicot

Local Text to Speech on Mac: AI Voices Without the Cloud

Modern AI text-to-speech voices sound human - and they run locally on your Mac's GPU. No cloud service ever sees your text. Here's how to use it.

Voice SynthesisText to SpeechPrivacymacOSNeural Engine
Article3 min readby Ben Racicot

Local Voice Transcription on Mac: Meeting Notes Without the Cloud

Transcribe meetings, lectures, and voice memos on your Mac with Whisper-class accuracy - entirely offline, with no audio uploaded to any server.

TranscriptionSpeech to TextPrivacymacOSNeural EngineMeetings
Article4 min readby Ben Racicot

Private Local Chat on Mac: ChatGPT Without the Cloud

Run a private AI chatbot entirely on your Mac - no data leaves your machine, no API keys, no internet required. Here's how local LLM chat actually works on Apple Silicon.

ChatText GenerationPrivacymacOSApple Silicon
Article6 min readby Ben Racicot

Local-First AI on macOS: Why Your Data Should Never Leave Your Machine

Your Mac has dedicated AI hardware built in. Here's why local-first AI matters - privacy by architecture, zero API costs, no rate limits - and how ModelPiper makes it practical.

Local AIText GenerationSpeech to TextText to SpeechPrivacymacOSApple Silicon