---
title: "Can I Run This LLM? — Mac AI Model Checker | ModelPiper"
description: "Check which AI models your Mac can run locally. Apple Silicon optimized — M1, M2, M3, M4. See fit ratings, estimated speed, and recommended quantizations."
canonical: "https://modelpiper.com/fit"
---

# Can I Run This LLM? — Mac AI Model Checker | ModelPiper

> Check which AI models your Mac can run locally. Apple Silicon optimized — M1, M2, M3, M4. See fit ratings, estimated speed, and recommended quantizations.

# What LLMs Can Run on Your Mac?

ChipM5 Pro

Unified Memory24 GB

Bandwidth 307 GB/s

Available for Models ~21 GB

Browse by Mac model

## Model Compatibility

All Categories

Show All

All Sizes

Best Fit

Showing 866 of 866 models

[Qwen3.5 0.8B](/fit/qwen-qwen3-5-0-8b)

General · Alibaba · 2026-02-28

Q8\_0Excellent

1.5 GB6% of RAM~185 tok/s0.87B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 2B](/fit/qwen-qwen3-5-2b)

General · Alibaba · 2026-02-28

Q8\_0Excellent

3.0 GB13% of RAM~71 tok/s2.27B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 0.8B Base](/fit/qwen-qwen3-5-0-8b-base)

General · Alibaba · 2026-02-28

Q8\_0Excellent

1.5 GB6% of RAM~185 tok/s0.87B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 2B Base](/fit/qwen-qwen3-5-2b-base)

General · Alibaba · 2026-02-28

Q8\_0Excellent

3.0 GB13% of RAM~71 tok/s2.27B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2.5 VL 1.6B](/fit/liquidai-lfm2-5-vl-1-6b)

General · Liquid AI · 2026-01-05

Q8\_0Excellent

2.3 GB10% of RAM~101 tok/s1.6B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2.5 1.2B Base](/fit/liquidai-lfm2-5-1-2b-base)

General · Liquid AI · 2026-01-05

Q8\_0Excellent

1.8 GB8% of RAM~137 tok/s1.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2.5 1.2B Thinking](/fit/liquidai-lfm2-5-1-2b-thinking)

General · Liquid AI · 2026-01-20

Q8\_0Excellent

1.8 GB8% of RAM~137 tok/s1.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 2.6B Exp](/fit/liquidai-lfm2-2-6b-exp)

General · Liquid AI · 2025-12-25

Q8\_0Excellent

3.4 GB14% of RAM~63 tok/s2.57B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2.5 1.2B JP](/fit/liquidai-lfm2-5-1-2b-jp)

General · Liquid AI · 2026-01-04

Q8\_0Excellent

1.8 GB8% of RAM~137 tok/s1.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 2.6B Transcript](/fit/liquidai-lfm2-2-6b-transcript)

General · Liquid AI · 2026-01-05

Q8\_0Excellent

3.4 GB14% of RAM~63 tok/s2.57B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 4B](/fit/qwen-qwen3-5-4b)

General · Alibaba · 2026-02-27

Q8\_0Excellent

5.7 GB24% of RAM~35 tok/s4.66B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 E2B it](/fit/google-gemma-4-e2b-it)

General · Google · 2026-03-02

Q8\_0Excellent

6.2 GB26% of RAM~31 tok/s5.12B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2.5 1.2B Instruct](/fit/liquidai-lfm2-5-1-2b-instruct)

Chat · Liquid AI · 2026-01-06

Q8\_0Excellent

1.8 GB8% of RAM~137 tok/s1.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 4B Base](/fit/qwen-qwen3-5-4b-base)

General · Alibaba · 2026-02-27

Q8\_0Excellent

5.7 GB24% of RAM~35 tok/s4.66B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[granite 4.0 h tiny](/fit/ibm-granite-granite-4-0-h-tiny)

General · ibm-granite · 2025-09-16

Q8\_0Excellent

8.2 GB34% of RAM~167 tok/s6.94B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 8B A1B](/fit/liquidai-lfm2-8b-a1b)

General · Liquid AI · 2025-10-07

Q8\_0Excellent

9.8 GB41% of RAM~107 tok/s8.34B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 ColBERT 350M](/fit/liquidai-lfm2-colbert-350m)

General · Liquid AI · 2025-10-28

Q8\_0Excellent

0.9 GB4% of RAM~459 tok/s0.35B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 VL 3B](/fit/liquidai-lfm2-vl-3b)

General · Liquid AI · 2025-10-22

Q8\_0Excellent

3.8 GB16% of RAM~54 tok/s3B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 2.6B](/fit/liquidai-lfm2-2-6b)

General · Liquid AI · 2025-09-22

Q8\_0Excellent

3.4 GB14% of RAM~63 tok/s2.57B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 350M Extract](/fit/liquidai-lfm2-350m-extract)

General · Liquid AI · 2025-09-03

Q8\_0Excellent

0.9 GB4% of RAM~459 tok/s0.35B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 350M PII Extract JP](/fit/liquidai-lfm2-350m-pii-extract-jp)

General · Liquid AI · 2025-09-30

Q8\_0Excellent

0.9 GB4% of RAM~459 tok/s0.35B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 350M ENJP MT](/fit/liquidai-lfm2-350m-enjp-mt)

General · Liquid AI · 2025-09-03

Q8\_0Excellent

0.9 GB4% of RAM~459 tok/s0.35B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 1.2B RAG](/fit/liquidai-lfm2-1-2b-rag)

General · Liquid AI · 2025-09-03

Q8\_0Excellent

1.8 GB8% of RAM~137 tok/s1.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 1.2B Tool](/fit/liquidai-lfm2-1-2b-tool)

General · Liquid AI · 2025-09-03

Q8\_0Excellent

1.8 GB8% of RAM~137 tok/s1.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[granite 4.0 h micro](/fit/ibm-granite-granite-4-0-h-micro)

General · ibm-granite · 2025-09-16

Q8\_0Excellent

4.1 GB17% of RAM~50 tok/s3.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 1.2B](/fit/liquidai-lfm2-1-2b)

General · Liquid AI · 2025-07-10

Q8\_0Excellent

1.8 GB8% of RAM~137 tok/s1.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 350M](/fit/liquidai-lfm2-350m)

General · Liquid AI · 2025-07-10

Q8\_0Excellent

0.9 GB4% of RAM~459 tok/s0.35B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 VL 450M](/fit/liquidai-lfm2-vl-450m)

General · Liquid AI · 2025-08-12

Q8\_0Excellent

1.0 GB4% of RAM~357 tok/s0.45B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 700M](/fit/liquidai-lfm2-700m)

General · Liquid AI · 2025-07-10

Q8\_0Excellent

1.3 GB6% of RAM~217 tok/s0.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 VL 1.6B](/fit/liquidai-lfm2-vl-1-6b)

General · Liquid AI · 2025-08-12

Q8\_0Excellent

2.3 GB9% of RAM~102 tok/s1.58B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 1.2B Extract](/fit/liquidai-lfm2-1-2b-extract)

General · Liquid AI · 2025-08-22

Q8\_0Excellent

1.8 GB8% of RAM~137 tok/s1.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 350M Math](/fit/liquidai-lfm2-350m-math)

General · Liquid AI · 2025-08-25

Q8\_0Excellent

0.9 GB4% of RAM~459 tok/s0.35B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolLM3 3B](/fit/huggingfacetb-smollm3-3b)

General · huggingfacetb · 2025-07-08

Q8\_0Excellent

3.9 GB16% of RAM~52 tok/s3.08B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[EXAONE 4.0 1.2B](/fit/lgai-exaone-exaone-4-0-1-2b)

General · lgai-exaone · 2025-07-11

Q8\_0Excellent

1.9 GB8% of RAM~126 tok/s1.28B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 0.6B](/fit/qwen-qwen3-0-6b)

General · Alibaba · 2025-04-27

Q8\_0Excellent

1.3 GB6% of RAM~214 tok/s0.75B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 1.7B](/fit/qwen-qwen3-1-7b)

General · Alibaba · 2025-04-27

Q8\_0Excellent

2.8 GB12% of RAM~79 tok/s2.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 1.5B](/fit/qwen-qwen2-5-1-5b)

General · Alibaba

Q8\_0Excellent

2.2 GB9% of RAM~104 tok/s1.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DeepSeek R1 Distill Qwen 1.5B](/fit/deepseek-ai-deepseek-r1-distill-qwen-1-5b)

Reasoning · DeepSeek

Q8\_0Excellent

2.5 GB10% of RAM~90 tok/s1.78B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 Coder 30B A3B Instruct gptq 8bit](/fit/btbtyler09-qwen3-coder-30b-a3b-instruct-gptq-8bit)

Coding · btbtyler09

Q8\_0Excellent

10.9 GB45% of RAM~158 tok/s9.3B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DeepSeek R1 0528 Qwen3 8B](/fit/lmstudio-community-deepseek-r1-0528-qwen3-8b-mlx-4bit)

Reasoning · lmstudio-community

mlx-8bitExcellent

1.9 GB8% of RAM~132 tok/s1.28B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[PaddleOCR VL 1.5](/fit/paddlepaddle-paddleocr-vl-1-5)

General · paddlepaddle

Q8\_0Excellent

1.6 GB7% of RAM~168 tok/s0.96B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 12b it quantized W4A16](/fit/abhishekchohan-gemma-3-12b-it-quantized-w4a16)

General · abhishekchohan

Q8\_0Excellent

3.7 GB15% of RAM~56 tok/s2.86B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[tiny random BambaForCausalLM](/fit/hmellor-tiny-random-bambaforcausallm)

General · hmellor

Q8\_0Excellent

0.5 GB2% of RAM~5,360 tok/s0.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2 1.5B](/fit/qwen-qwen2-1-5b)

General · Alibaba

Q8\_0Excellent

2.2 GB9% of RAM~104 tok/s1.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[VideoLLaMA3 2B Image HF](/fit/lkhl-videollama3-2b-image-hf)

General · lkhl

Q8\_0Excellent

2.7 GB11% of RAM~82 tok/s1.96B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Ilama 3.2 1B](/fit/hmellor-ilama-3-2-1b)

General · hmellor

Q8\_0Excellent

1.9 GB8% of RAM~130 tok/s1.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[tiny mixtral](/fit/titanml-tiny-mixtral)

General · titanml

Q8\_0Excellent

0.8 GB3% of RAM~2,265 tok/s0.25B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[plamo 2 1b](/fit/pfnet-plamo-2-1b)

General · pfnet

Q8\_0Excellent

1.9 GB8% of RAM~125 tok/s1.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 4B Thinking 2507](/fit/lmstudio-community-qwen3-4b-thinking-2507-mlx-4bit)

General · lmstudio-community

mlx-8bitExcellent

1.2 GB5% of RAM~268 tok/s0.63B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 Coder 30B A3B Instruct AWQ 4bit](/fit/cyankiwi-qwen3-coder-30b-a3b-instruct-awq-4bit)

Coding · cyankiwi

Q8\_0Excellent

6.4 GB27% of RAM~277 tok/s5.31B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[granite 4.0 h tiny AWQ 4bit](/fit/cyankiwi-granite-4-0-h-tiny-awq-4bit)

General · cyankiwi

Q8\_0Excellent

2.7 GB11% of RAM~579 tok/s2B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 2B Thinking](/fit/qwen-qwen3-vl-2b-thinking)

General · Alibaba

Q8\_0Excellent

2.9 GB12% of RAM~75 tok/s2.13B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Phi 4 mini reasoning](/fit/lmstudio-community-phi-4-mini-reasoning-mlx-4bit)

Reasoning · lmstudio-community

mlx-8bitExcellent

1.1 GB5% of RAM~281 tok/s0.6B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[olmOCR 2 7B 1025 INT4](/fit/winninghealth-olmocr-2-7b-1025-int4)

General · winninghealth

Q8\_0Excellent

2.5 GB10% of RAM~91 tok/s1.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.2 1B Aegis SFT DPO](/fit/ahczhg-llama-3-2-1b-aegis-sft-dpo)

General · ahczhg

Q8\_0Excellent

1.9 GB8% of RAM~130 tok/s1.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[CyberXP\_Agent\_Llama\_3.2\_1B](/fit/abaryan-cyberxp_agent_llama_3-2_1b)

General · abaryan

Q8\_0Excellent

1.9 GB8% of RAM~130 tok/s1.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[tiny random llama4](/fit/optimum-intel-internal-testing-tiny-random-llama4)

General · optimum-intel-internal-testing

Q8\_0Excellent

0.5 GB2% of RAM~223,726 tok/s0.01B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llama 3.2 1b code instruct](/fit/shahriarferdoush-llama-3-2-1b-code-instruct)

Coding · shahriarferdoush

Q8\_0Excellent

1.9 GB8% of RAM~130 tok/s1.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[granite 4.0 h 1b](/fit/ibm-granite-granite-4-0-h-1b)

General · ibm-granite

Q8\_0Excellent

2.1 GB9% of RAM~110 tok/s1.46B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[typhoon ocr1.5 2b](/fit/typhoon-ai-typhoon-ocr1-5-2b)

Reasoning · typhoon-ai

Q8\_0Excellent

2.9 GB12% of RAM~75 tok/s2.13B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 2B AWQ](/fit/quanttrio-qwen3-5-2b-awq)

General · quanttrio

Q8\_0Excellent

3.0 GB13% of RAM~71 tok/s2.27B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Orpo Llama 3.2 1B 15k](/fit/adamlucek-orpo-llama-3-2-1b-15k)

General · adamlucek

Q8\_0Excellent

1.9 GB8% of RAM~130 tok/s1.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 1.2B](/fit/lmstudio-community-lfm2-1-2b-mlx-8bit)

General · lmstudio-community

mlx-8bitExcellent

0.9 GB4% of RAM~512 tok/s0.33B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 1.2B MLX bf16](/fit/lmstudio-community-lfm2-1-2b-mlx-bf16)

General · lmstudio-community

mlx-8bitExcellent

1.7 GB7% of RAM~144 tok/s1.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 4B PARO](/fit/z-lab-qwen3-5-4b-paro)

General · z-lab

Q8\_0Excellent

2.1 GB9% of RAM~110 tok/s1.46B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 tiny random](/fit/yujiepan-gemma-3-tiny-random)

General · yujiepan

Q8\_0Excellent

0.5 GB2% of RAM~16,081 tok/s0.01B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 35B A3B quantized.w4a16](/fit/apolo13x-qwen3-5-35b-a3b-quantized-w4a16)

General · apolo13x

Q8\_0Excellent

7.6 GB32% of RAM~316 tok/s6.38B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[tiny random Idefics3ForConditionalGeneration](/fit/optimum-intel-internal-testing-tiny-random-idefics3forconditionalgeneration)

General · optimum-intel-internal-testing

Q8\_0Excellent

0.5 GB2% of RAM~8,040 tok/s0.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 4B Thinking AWQ 4bit](/fit/cyankiwi-qwen3-vl-4b-thinking-awq-4bit)

General · cyankiwi

Q8\_0Excellent

2.5 GB10% of RAM~91 tok/s1.76B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Huihui Qwen3.5 2B abliterated](/fit/huihui-ai-huihui-qwen3-5-2b-abliterated)

General · huihui-ai

Q8\_0Excellent

3.0 GB13% of RAM~71 tok/s2.27B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[PaddleOCR VL](/fit/paddlepaddle-paddleocr-vl)

General · paddlepaddle

Q8\_0Excellent

1.6 GB7% of RAM~168 tok/s0.96B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 2B RRG SFT](/fit/dmusingu-qwen3-vl-2b-rrg-sft)

General · dmusingu

Q8\_0Excellent

2.9 GB12% of RAM~75 tok/s2.13B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 2B AWQ 4bit](/fit/cyankiwi-qwen3-5-2b-awq-4bit)

General · cyankiwi

Q8\_0Excellent

3.1 GB13% of RAM~69 tok/s2.32B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[dots.ocr](/fit/rednote-hilab-dots-ocr)

General · rednote-hilab

Q8\_0Excellent

3.9 GB16% of RAM~53 tok/s3.04B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[granite 4.0 micro base](/fit/ibm-granite-granite-4-0-micro-base)

General · ibm-granite

Q8\_0Excellent

4.3 GB18% of RAM~47 tok/s3.4B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[dots.ocr 1.5](/fit/kristaller486-dots-ocr-1-5)

General · kristaller486

Q8\_0Excellent

3.9 GB16% of RAM~53 tok/s3.04B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 4B](/fit/qwen-qwen3-4b-mlx-4bit)

General · Alibaba

mlx-8bitExcellent

1.1 GB5% of RAM~296 tok/s0.57B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[dots.mocr](/fit/rednote-hilab-dots-mocr)

General · rednote-hilab

Q8\_0Excellent

3.9 GB16% of RAM~53 tok/s3.04B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B PARO](/fit/z-lab-qwen3-5-9b-paro)

General · z-lab

Q8\_0Excellent

4.3 GB18% of RAM~47 tok/s3.44B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[HTML Pruner Phi 3.8B](/fit/zstanjj-html-pruner-phi-3-8b)

General · zstanjj

Q8\_0Excellent

4.8 GB20% of RAM~42 tok/s3.82B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Bonsai 8B](/fit/prism-ml-bonsai-8b-mlx-1bit)

General · prism-ml

mlx-8bitExcellent

0.9 GB4% of RAM~444 tok/s0.38B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Nanonets OCR s](/fit/nanonets-nanonets-ocr-s)

General · nanonets

Q8\_0Excellent

4.7 GB20% of RAM~43 tok/s3.75B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 4B Claude 4.6 Opus Reasoning Distilled](/fit/jackrong-qwen3-5-4b-claude-4-6-opus-reasoning-distilled)

Reasoning · jackrong

Q8\_0Excellent

5.7 GB24% of RAM~35 tok/s4.66B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pixtral 12b quantized.w4a16](/fit/redhatai-pixtral-12b-quantized-w4a16)

General · redhatai

Q8\_0Excellent

4.1 GB17% of RAM~50 tok/s3.23B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[CheXOne](/fit/stanfordaimi-chexone)

General · stanfordaimi

Q8\_0Excellent

4.7 GB20% of RAM~43 tok/s3.75B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NV Reason CXR 3B](/fit/nvidia-nv-reason-cxr-3b)

Reasoning · nvidia

Q8\_0Excellent

4.7 GB20% of RAM~43 tok/s3.75B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2.5 Audio 1.5B](/fit/liquidai-lfm2-5-audio-1-5b)

General · Liquid AI · 2025-12-18

Q8\_0Excellent

2.1 GB9% of RAM~109 tok/s1.47B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 0.5B](/fit/qwen-qwen2-5-0-5b)

General · Alibaba

Q8\_0Excellent

1.0 GB4% of RAM~328 tok/s0.49B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 0.6B FP8](/fit/qwen-qwen3-0-6b-fp8)

General · Alibaba

Q8\_0Excellent

1.3 GB6% of RAM~214 tok/s0.75B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 4B Thinking 2507](/fit/qwen-qwen3-4b-thinking-2507)

General · Alibaba

Q8\_0Excellent

5.0 GB21% of RAM~40 tok/s4.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 1.5B quantized.w8a8](/fit/redhatai-qwen2-5-1-5b-quantized-w8a8)

General · redhatai

Q8\_0Excellent

2.5 GB10% of RAM~90 tok/s1.78B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[OTel LLM 1B IT](/fit/farbodtavakkoli-otel-llm-1b-it)

General · farbodtavakkoli

Q8\_0Excellent

1.6 GB7% of RAM~161 tok/s1B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DeepSeek R1 Distill Qwen 7B](/fit/deepseek-ai-deepseek-r1-distill-qwen-7b)

Reasoning · DeepSeek · 2025-01-20

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[OTel LLM 270M IT](/fit/farbodtavakkoli-otel-llm-270m-it)

General · farbodtavakkoli

Q8\_0Excellent

0.8 GB3% of RAM~596 tok/s0.27B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 E4B it](/fit/google-gemma-4-e4b-it)

General · Google · 2026-03-02

Q8\_0Excellent

9.4 GB39% of RAM~20 tok/s8B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Nanbeige4.1 3B](/fit/nanbeige-nanbeige4-1-3b)

General · nanbeige

Q8\_0Excellent

4.9 GB20% of RAM~41 tok/s3.93B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 4B FP8](/fit/lovedheart-qwen3-5-4b-fp8)

General · lovedheart

Q8\_0Excellent

5.7 GB24% of RAM~35 tok/s4.66B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 1.7B Base](/fit/qwen-qwen3-1-7b-base)

General · Alibaba

Q8\_0Excellent

2.4 GB10% of RAM~93 tok/s1.72B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[OTel LLM 4B IT](/fit/farbodtavakkoli-otel-llm-4b-it)

General · farbodtavakkoli

Q8\_0Excellent

5.3 GB22% of RAM~37 tok/s4.3B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NVIDIA Nemotron 3 Nano 30B A3B AWQ](/fit/stelterlab-nvidia-nemotron-3-nano-30b-a3b-awq)

General · stelterlab

Q8\_0Excellent

6.1 GB26% of RAM~32 tok/s5.05B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 27b it GPTQ 4b 128g](/fit/ista-daslab-gemma-3-27b-it-gptq-4b-128g)

General · ista-daslab

Q8\_0Excellent

6.3 GB26% of RAM~31 tok/s5.23B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3Guard Gen 0.6B](/fit/qwen-qwen3guard-gen-0-6b)

General · Alibaba

Q8\_0Excellent

1.3 GB6% of RAM~214 tok/s0.75B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava interleave qwen 0.5b hf](/fit/llava-hf-llava-interleave-qwen-0-5b-hf)

General · llava-hf

Q8\_0Excellent

1.5 GB6% of RAM~187 tok/s0.86B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 4B AWQ 4bit](/fit/cyankiwi-qwen3-5-4b-awq-4bit)

General · cyankiwi

Q8\_0Excellent

5.8 GB24% of RAM~34 tok/s4.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3 1B hf](/fit/opengvlab-internvl3-1b-hf)

General · opengvlab

Q8\_0Excellent

1.5 GB6% of RAM~171 tok/s0.94B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[qwen base invoicev1.01 1.5B](/fit/laap-ai-qwen-base-invoicev1-01-1-5b)

General · laap-ai

Q8\_0Excellent

2.2 GB9% of RAM~104 tok/s1.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Jan nano AWQ](/fit/warshanks-jan-nano-awq)

General · warshanks

Q8\_0Excellent

1.9 GB8% of RAM~128 tok/s1.26B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[OTel LLM 3B IT](/fit/farbodtavakkoli-otel-llm-3b-it)

General · farbodtavakkoli

Q8\_0Excellent

5.2 GB22% of RAM~38 tok/s4.25B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 4B Thinking](/fit/qwen-qwen3-vl-4b-thinking)

General · Alibaba

Q8\_0Excellent

5.5 GB23% of RAM~36 tok/s4.44B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Ovis2.5 2B](/fit/aidc-ai-ovis2-5-2b)

General · aidc-ai

Q8\_0Excellent

3.4 GB14% of RAM~63 tok/s2.57B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Jan v1 4B](/fit/janhq-jan-v1-4b)

General · janhq

Q8\_0Excellent

5.0 GB21% of RAM~40 tok/s4.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[VLM2Vec Full](/fit/tiger-lab-vlm2vec-full)

General · tiger-lab

Q8\_0Excellent

5.1 GB21% of RAM~39 tok/s4.15B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 4B DFlash b16](/fit/z-lab-qwen3-4b-dflash-b16)

General · z-lab

Q8\_0Excellent

1.1 GB5% of RAM~298 tok/s0.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pii extractor gemma 3 270m it](/fit/jakobhuss-pii-extractor-gemma-3-270m-it)

General · jakobhuss

Q8\_0Excellent

0.8 GB3% of RAM~596 tok/s0.27B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolLM3 3B Base](/fit/huggingfacetb-smollm3-3b-base)

General · huggingfacetb

Q8\_0Excellent

3.9 GB16% of RAM~52 tok/s3.08B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Isaac 0.2 2B Preview](/fit/perceptronai-isaac-0-2-2b-preview)

General · perceptronai

Q8\_0Excellent

3.4 GB14% of RAM~63 tok/s2.57B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[chandra ocr 2](/fit/datalab-to-chandra-ocr-2)

General · datalab-to

Q8\_0Excellent

6.4 GB27% of RAM~30 tok/s5.3B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Isaac 0.1](/fit/perceptronai-isaac-0-1)

General · perceptronai

Q8\_0Excellent

3.4 GB14% of RAM~63 tok/s2.57B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Hulu Med 4B](/fit/zju-ai4h-hulu-med-4b)

General · zju-ai4h

Q8\_0Excellent

5.9 GB25% of RAM~33 tok/s4.83B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen1.5 0.5B](/fit/qwen-qwen1-5-0-5b)

General · Alibaba

Q8\_0Excellent

1.2 GB5% of RAM~259 tok/s0.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 12b it quantized.w4a16](/fit/redhatai-gemma-3-12b-it-quantized-w4a16)

General · redhatai

Q8\_0Excellent

4.8 GB20% of RAM~42 tok/s3.86B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[granite 4.0 350m](/fit/ibm-granite-granite-4-0-350m)

General · ibm-granite

Q8\_0Excellent

0.9 GB4% of RAM~459 tok/s0.35B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 14B](/fit/lmstudio-community-qwen3-14b-mlx-4bit)

General · lmstudio-community

mlx-8bitExcellent

3.0 GB12% of RAM~73 tok/s2.31B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[ue3 qw35 4b v1](/fit/tristepin-ue3-qw35-4b-v1)

General · tristepin

Q8\_0Excellent

5.7 GB24% of RAM~35 tok/s4.66B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[QwQ 32B](/fit/lmstudio-community-qwq-32b-mlx-4bit)

General · lmstudio-community

mlx-8bitExcellent

6.0 GB25% of RAM~33 tok/s5.12B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[xLAM 2 1b fc r](/fit/salesforce-xlam-2-1b-fc-r)

General · salesforce

Q8\_0Excellent

2.2 GB9% of RAM~104 tok/s1.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 1.7B FP8](/fit/qwen-qwen3-1-7b-fp8)

General · Alibaba

Q8\_0Excellent

2.8 GB12% of RAM~79 tok/s2.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 8B](/fit/lmstudio-community-qwen3-8b-mlx-4bit)

General · lmstudio-community

mlx-8bitExcellent

1.9 GB8% of RAM~132 tok/s1.28B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 1.7B](/fit/lmstudio-community-qwen3-1-7b-mlx-8bit)

General · lmstudio-community

mlx-8bitExcellent

1.0 GB4% of RAM~352 tok/s0.48B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Rex Omni](/fit/idea-research-rex-omni)

General · idea-research

Q8\_0Excellent

5.0 GB21% of RAM~40 tok/s4.07B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 4B AWQ BF16 INT4](/fit/cyankiwi-qwen3-5-4b-awq-bf16-int4)

General · cyankiwi

Q8\_0Excellent

5.8 GB24% of RAM~34 tok/s4.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 1.7B MLX bf16](/fit/lmstudio-community-qwen3-1-7b-mlx-bf16)

General · lmstudio-community

mlx-8bitExcellent

2.3 GB10% of RAM~98 tok/s1.72B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen1.5 1.8B](/fit/qwen-qwen1-5-1-8b)

General · Alibaba

Q8\_0Excellent

2.6 GB11% of RAM~87 tok/s1.84B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.1 Nemotron Nano 4B v1.1](/fit/nvidia-llama-3-1-nemotron-nano-4b-v1-1)

General · nvidia

Q8\_0Excellent

5.5 GB23% of RAM~36 tok/s4.51B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 0.6B](/fit/primeintellect-qwen3-0-6b)

General · primeintellect

Q8\_0Excellent

1.3 GB6% of RAM~214 tok/s0.75B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 4B AWQ](/fit/quanttrio-qwen3-5-4b-awq)

General · quanttrio

Q8\_0Excellent

5.7 GB24% of RAM~35 tok/s4.66B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Heron NVILA Lite 1B hf](/fit/turing-motors-heron-nvila-lite-1b-hf)

General · turing-motors

Q8\_0Excellent

1.5 GB6% of RAM~177 tok/s0.91B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[tiny random minicpm v 4\_5](/fit/optimum-intel-internal-testing-tiny-random-minicpm-v-4_5)

General · optimum-intel-internal-testing

Q8\_0Excellent

0.5 GB2% of RAM~8,040 tok/s0.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Huihui Qwen3.5 4B abliterated](/fit/huihui-ai-huihui-qwen3-5-4b-abliterated)

General · huihui-ai

Q8\_0Excellent

5.6 GB23% of RAM~35 tok/s4.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3\_5 1B HF](/fit/opengvlab-internvl3_5-1b-hf)

General · opengvlab

Q8\_0Excellent

1.7 GB7% of RAM~152 tok/s1.06B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[vllm translategemma 4b it](/fit/infomaniak-ai-vllm-translategemma-4b-it)

General · infomaniak-ai

Q8\_0Excellent

6.0 GB25% of RAM~32 tok/s4.97B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3\_5 2B HF](/fit/opengvlab-internvl3_5-2b-hf)

General · opengvlab

Q8\_0Excellent

3.1 GB13% of RAM~68 tok/s2.35B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3 2B hf](/fit/opengvlab-internvl3-2b-hf)

General · opengvlab

Q8\_0Excellent

2.8 GB12% of RAM~77 tok/s2.09B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GLM 4.6V Flash AWQ 8bit](/fit/cyankiwi-glm-4-6v-flash-awq-8bit)

General · cyankiwi

Q8\_0Excellent

5.4 GB23% of RAM~36 tok/s4.43B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Huihui Qwen3.5 4B Claude 4.6 Opus abliterated](/fit/huihui-ai-huihui-qwen3-5-4b-claude-4-6-opus-abliterated)

General · huihui-ai

Q8\_0Excellent

5.7 GB24% of RAM~35 tok/s4.66B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Holo2 4B](/fit/hcompany-holo2-4b)

General · hcompany

Q8\_0Excellent

5.5 GB23% of RAM~36 tok/s4.44B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL2\_5 2B MPO hf](/fit/opengvlab-internvl2_5-2b-mpo-hf)

General · opengvlab

Q8\_0Excellent

3.0 GB12% of RAM~73 tok/s2.21B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NuExtract 2.0 2B](/fit/numind-nuextract-2-0-2b)

General · numind

Q8\_0Excellent

3.0 GB12% of RAM~73 tok/s2.21B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[granite 4.0 3b vision](/fit/ibm-granite-granite-4-0-3b-vision)

General · ibm-granite

Q8\_0Excellent

5.0 GB21% of RAM~40 tok/s4B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Ovis2 1B hf](/fit/thisisiron-ovis2-1b-hf)

General · thisisiron

Q8\_0Excellent

1.8 GB7% of RAM~142 tok/s1.13B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Gemma 4 26B A4B JANG\_4M CRACK](/fit/dealignai-gemma-4-26b-a4b-jang_4m-crack)

General · dealignai

Q8\_0Excellent

5.8 GB24% of RAM~34 tok/s4.72B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 Audio 1.5B](/fit/liquidai-lfm2-audio-1-5b)

General · Liquid AI · 2025-08-28

Q8\_0Excellent

2.1 GB9% of RAM~109 tok/s1.47B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 VL 3B Instruct](/fit/qwen-qwen2-5-vl-3b-instruct)

Chat · Alibaba · 2025-01-26

Q8\_0Excellent

4.7 GB20% of RAM~43 tok/s3.75B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LightOnOCR 2 1B](/fit/lightonai-lightonocr-2-1b)

General · lightonai

Q8\_0Excellent

1.6 GB7% of RAM~159 tok/s1.01B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 1.5B Instruct](/fit/qwen-qwen2-5-coder-1-5b-instruct)

Coding · Alibaba · 2024-09-18

Q8\_0Excellent

2.2 GB9% of RAM~104 tok/s1.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 3B](/fit/qwen-qwen2-5-3b)

General · Alibaba

Q8\_0Excellent

3.9 GB16% of RAM~52 tok/s3.09B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 0.5B Instruct](/fit/qwen-qwen2-5-coder-0-5b-instruct)

Coding · Alibaba

Q8\_0Excellent

1.0 GB4% of RAM~328 tok/s0.49B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 3B Instruct](/fit/qwen-qwen2-5-coder-3b-instruct)

Coding · Alibaba

Q8\_0Excellent

3.9 GB16% of RAM~52 tok/s3.09B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[QVikhr 3 1.7B Instruction noreasoning](/fit/vikhrmodels-qvikhr-3-1-7b-instruction-noreasoning)

Reasoning · vikhrmodels

Q8\_0Excellent

2.4 GB10% of RAM~93 tok/s1.72B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 14B Instruct](/fit/lmstudio-community-qwen2-5-coder-14b-instruct-mlx-4bit)

Coding · lmstudio-community

mlx-8bitExcellent

3.0 GB12% of RAM~73 tok/s2.31B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[MinerU2.5 2509 1.2B](/fit/opendatalab-mineru2-5-2509-1-2b)

General · opendatalab

Q8\_0Excellent

1.8 GB7% of RAM~139 tok/s1.16B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 3B](/fit/qwen-qwen2-5-coder-3b)

Coding · Alibaba

Q8\_0Excellent

3.9 GB16% of RAM~52 tok/s3.09B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Falcon H1 0.5B Base](/fit/tiiuae-falcon-h1-0-5b-base)

General · TII

Q8\_0Excellent

1.1 GB5% of RAM~309 tok/s0.52B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 0.5B](/fit/qwen-qwen2-5-coder-0-5b)

Coding · Alibaba

Q8\_0Excellent

1.0 GB4% of RAM~328 tok/s0.49B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Phi 4 reasoning plus](/fit/lmstudio-community-phi-4-reasoning-plus-mlx-4bit)

Reasoning · lmstudio-community

mlx-8bitExcellent

2.9 GB12% of RAM~74 tok/s2.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[xLAM 2 3b fc r](/fit/salesforce-xlam-2-3b-fc-r)

General · salesforce

Q8\_0Excellent

3.9 GB16% of RAM~52 tok/s3.09B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LightOnOCR 2 1B base](/fit/lightonai-lightonocr-2-1b-base)

General · lightonai

Q8\_0Excellent

1.6 GB7% of RAM~159 tok/s1.01B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 2B Instruct](/fit/qwen-qwen3-vl-2b-instruct)

Chat · Alibaba

Q8\_0Excellent

2.9 GB12% of RAM~75 tok/s2.13B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.2 1B Instruct FP8 dynamic](/fit/redhatai-llama-3-2-1b-instruct-fp8-dynamic)

Chat · redhatai

Q8\_0Excellent

2.2 GB9% of RAM~107 tok/s1.5B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolLM2 135M](/fit/huggingfacetb-smollm2-135m)

General · huggingfacetb

Q8\_0Excellent

0.6 GB3% of RAM~1,237 tok/s0.13B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.2 1B Instruct FP8](/fit/redhatai-llama-3-2-1b-instruct-fp8)

Chat · redhatai

Q8\_0Excellent

2.2 GB9% of RAM~107 tok/s1.5B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 4B Base](/fit/qwen-qwen3-4b-base)

General · Alibaba

Q8\_0Excellent

5.0 GB21% of RAM~40 tok/s4.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 4B Instruct AWQ 4bit](/fit/cyankiwi-qwen3-vl-4b-instruct-awq-4bit)

Chat · cyankiwi

Q8\_0Excellent

2.5 GB10% of RAM~91 tok/s1.76B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Phi 4 multimodal instruct](/fit/microsoft-phi-4-multimodal-instruct)

Chat · Microsoft · 2025-02-24

Q8\_0Excellent

6.7 GB28% of RAM~29 tok/s5.57B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[tiny random Gemma2ForCausalLM](/fit/hmellor-tiny-random-gemma2forcausallm)

General · hmellor

Q8\_0Excellent

0.5 GB2% of RAM~16,081 tok/s0.01B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 4B Instruct](/fit/lmstudio-community-qwen3-vl-4b-instruct-mlx-4bit)

Chat · lmstudio-community

mlx-8bitExcellent

1.6 GB7% of RAM~162 tok/s1.04B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2.5 1.2B Instruct](/fit/lmstudio-community-lfm2-5-1-2b-instruct-mlx-8bit)

Chat · lmstudio-community

mlx-8bitExcellent

0.9 GB4% of RAM~512 tok/s0.33B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DeepSeek R1 0528 Qwen3 8B](/fit/deepseek-ai-deepseek-r1-0528-qwen3-8b)

Reasoning · DeepSeek

Q8\_0Excellent

9.6 GB40% of RAM~20 tok/s8.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B Instruct 2507 AWQ](/fit/stelterlab-qwen3-30b-a3b-instruct-2507-awq)

Chat · stelterlab

Q8\_0Excellent

5.6 GB24% of RAM~319 tok/s4.61B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolLM2 1.7B](/fit/huggingfacetb-smollm2-1-7b)

General · huggingfacetb

Q8\_0Excellent

2.4 GB10% of RAM~94 tok/s1.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 8B Instruct AWQ 4bit](/fit/cyankiwi-qwen3-vl-8b-instruct-awq-4bit)

Chat · cyankiwi

Q8\_0Excellent

3.7 GB16% of RAM~55 tok/s2.91B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolLM2 360M](/fit/huggingfacetb-smollm2-360m)

General · huggingfacetb

Q8\_0Excellent

0.9 GB4% of RAM~447 tok/s0.36B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[starvector 1b im2svg](/fit/starvector-starvector-1b-im2svg)

General · starvector

Q8\_0Excellent

2.1 GB9% of RAM~112 tok/s1.43B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 2 2b it](/fit/efficient-large-model-gemma-2-2b-it)

General · efficient-large-model

Q8\_0Excellent

3.4 GB14% of RAM~62 tok/s2.61B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 4B SafeRL](/fit/qwen-qwen3-4b-saferl)

General · Alibaba

Q8\_0Excellent

5.4 GB23% of RAM~36 tok/s4.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 4B Instruct 2507](/fit/lmstudio-community-qwen3-4b-instruct-2507-mlx-4bit)

Chat · lmstudio-community

mlx-8bitExcellent

1.2 GB5% of RAM~268 tok/s0.63B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[R 4B](/fit/yannqi-r-4b)

General · yannqi

Q8\_0Excellent

5.9 GB24% of RAM~33 tok/s4.82B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 32B Instruct](/fit/lmstudio-community-qwen2-5-coder-32b-instruct-mlx-4bit)

Coding · lmstudio-community

mlx-8bitExcellent

6.0 GB25% of RAM~33 tok/s5.12B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 4B FP8](/fit/qwen-qwen3-4b-fp8)

General · Alibaba

Q8\_0Excellent

5.4 GB23% of RAM~36 tok/s4.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 2B Instruct FP8](/fit/qwen-qwen3-vl-2b-instruct-fp8)

Chat · Alibaba

Q8\_0Excellent

3.2 GB13% of RAM~66 tok/s2.44B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 8B NVFP4](/fit/nvidia-qwen3-8b-nvfp4)

General · nvidia

Q8\_0Excellent

5.8 GB24% of RAM~34 tok/s4.72B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B Instruct 2507 AWQ 4bit](/fit/cyankiwi-qwen3-30b-a3b-instruct-2507-awq-4bit)

Chat · cyankiwi

Q8\_0Excellent

6.4 GB27% of RAM~277 tok/s5.31B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[academic ds 9B](/fit/bytedance-seed-academic-ds-9b)

General · bytedance-seed

Q8\_0Excellent

11.0 GB46% of RAM~215 tok/s9.37B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Vikhr Llama 3.2 1B Instruct](/fit/vikhrmodels-vikhr-llama-3-2-1b-instruct)

Chat · vikhrmodels

Q8\_0Excellent

1.9 GB8% of RAM~130 tok/s1.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Fine R1 7B](/fit/stevenhh2000-fine-r1-7b)

Reasoning · stevenhh2000

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Cosmos Reason2 2B W4A16 Edge2](/fit/embedl-cosmos-reason2-2b-w4a16-edge2)

Embedding · embedl

Q8\_0Excellent

2.9 GB12% of RAM~75 tok/s2.14B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 30B A3B Instruct AWQ 4bit](/fit/cyankiwi-qwen3-vl-30b-a3b-instruct-awq-4bit)

Chat · cyankiwi

Q8\_0Excellent

7.0 GB29% of RAM~345 tok/s5.85B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3\_5 4B HF](/fit/opengvlab-internvl3_5-4b-hf)

General · opengvlab

Q8\_0Excellent

5.8 GB24% of RAM~34 tok/s4.73B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava gemma 2b](/fit/intel-llava-gemma-2b)

General · intel

Q8\_0Excellent

3.6 GB15% of RAM~57 tok/s2.82B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B](/fit/qwen-qwen3-5-9b)

General · Alibaba · 2026-02-27

Q8\_0Excellent

11.3 GB47% of RAM~17 tok/s9.65B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DeepSeek OCR](/fit/deepseek-ai-deepseek-ocr)

General · DeepSeek

Q8\_0Excellent

4.2 GB18% of RAM~48 tok/s3.34B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DeepSeek OCR 2](/fit/deepseek-ai-deepseek-ocr-2)

General · DeepSeek

Q8\_0Excellent

4.3 GB18% of RAM~47 tok/s3.39B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 Coder Next AWQ 4bit](/fit/cyankiwi-qwen3-coder-next-awq-4bit)

Coding · cyankiwi

Q8\_0Excellent

16.6 GB69% of RAM~162 tok/s14.44B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GigaChat3 10B A1.8B](/fit/ai-sage-gigachat3-10b-a1-8b)

Chat · ai-sage

Q8\_0Excellent

13.3 GB55% of RAM~216 tok/s11.48B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 Coder Next AWQ 4bit](/fit/bullpoint-qwen3-coder-next-awq-4bit)

Coding · bullpoint

Q8\_0Excellent

16.6 GB69% of RAM~162 tok/s14.44B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B Claude 4.6 Opus Reasoning Distilled](/fit/jackrong-qwen3-5-9b-claude-4-6-opus-reasoning-distilled)

Reasoning · jackrong

Q8\_0Excellent

11.3 GB47% of RAM~17 tok/s9.65B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B Base](/fit/qwen-qwen3-5-9b-base)

General · Alibaba · 2026-02-26

Q8\_0Excellent

11.3 GB47% of RAM~17 tok/s9.65B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[starcoder2 3b](/fit/bigcode-starcoder2-3b)

Coding · BigCode

Q8\_0Excellent

3.9 GB16% of RAM~53 tok/s3.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.2 3B Instruct FP8](/fit/redhatai-llama-3-2-3b-instruct-fp8)

Chat · redhatai

Q8\_0Excellent

4.5 GB19% of RAM~45 tok/s3.61B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B Claude 4.6 Opus Reasoning Distilled v2](/fit/jackrong-qwen3-5-9b-claude-4-6-opus-reasoning-distilled-v2)

Reasoning · jackrong

Q8\_0Excellent

11.3 GB47% of RAM~17 tok/s9.65B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 24B A2B](/fit/liquidai-lfm2-24b-a2b)

General · Liquid AI · 2026-02-24

Q5\_K\_MTight

18.3 GB76% of RAM~108 tok/s23.84B params

Try Q4\_K\_M (15.9 GB, ~127 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 VL 3B Instruct abliterated](/fit/huihui-ai-qwen2-5-vl-3b-instruct-abliterated)

Chat · huihui-ai

Q8\_0Excellent

4.7 GB20% of RAM~43 tok/s3.75B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B gemini 3.1 opus 4.6 reasoning](/fit/momix-44-qwen3-5-9b-gemini-3-1-opus-4-6-reasoning)

Reasoning · momix-44

Q8\_0Excellent

11.0 GB46% of RAM~17 tok/s9.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 1.5B Instruct](/fit/qwen-qwen2-5-1-5b-instruct)

Chat · Alibaba

Q8\_0Excellent

2.2 GB9% of RAM~104 tok/s1.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 4B Instruct 2507](/fit/qwen-qwen3-4b-instruct-2507)

Chat · Alibaba

Q8\_0Excellent

5.0 GB21% of RAM~40 tok/s4.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 0.5B Instruct](/fit/qwen-qwen2-5-0-5b-instruct)

Chat · Alibaba

Q8\_0Excellent

1.0 GB4% of RAM~328 tok/s0.49B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2 1.5B Instruct](/fit/qwen-qwen2-1-5b-instruct)

Chat · Alibaba

Q8\_0Excellent

2.2 GB9% of RAM~104 tok/s1.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 4B Instruct](/fit/qwen-qwen3-vl-4b-instruct)

Chat · Alibaba

Q8\_0Excellent

5.5 GB23% of RAM~36 tok/s4.44B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 4B Instruct 2507 FP8](/fit/qwen-qwen3-4b-instruct-2507-fp8)

Chat · Alibaba

Q8\_0Excellent

5.4 GB23% of RAM~36 tok/s4.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Phi 3.5 mini instruct](/fit/microsoft-phi-3-5-mini-instruct)

Chat · Microsoft · 2024-08-16

Q8\_0Excellent

4.8 GB20% of RAM~42 tok/s3.82B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2 0.5B Instruct](/fit/qwen-qwen2-0-5b-instruct)

Chat · Alibaba

Q8\_0Excellent

1.0 GB4% of RAM~328 tok/s0.49B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 4B Instruct 2507 GPTQ Int4](/fit/junhowie-qwen3-4b-instruct-2507-gptq-int4)

Chat · junhowie

Q8\_0Excellent

5.0 GB21% of RAM~40 tok/s4.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 Next 80B A3B Thinking AWQ 4bit](/fit/cyankiwi-qwen3-next-80b-a3b-thinking-awq-4bit)

General · cyankiwi

Q8\_0Excellent

16.9 GB71% of RAM~159 tok/s14.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 0.5B Instruct](/fit/gensyn-qwen2-5-0-5b-instruct)

Chat · gensyn

Q8\_0Excellent

1.0 GB4% of RAM~328 tok/s0.49B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen1.5 0.5B Chat](/fit/qwen-qwen1-5-0-5b-chat)

Chat · Alibaba

Q8\_0Excellent

1.2 GB5% of RAM~259 tok/s0.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Nemotron H 4B Instruct 128K](/fit/nvidia-nemotron-h-4b-instruct-128k)

Chat · nvidia

Q8\_0Excellent

5.5 GB23% of RAM~36 tok/s4.49B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen1.5 1.8B Chat](/fit/qwen-qwen1-5-1-8b-chat)

Chat · Alibaba

Q8\_0Excellent

2.6 GB11% of RAM~87 tok/s1.84B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[EXAONE 3.5 2.4B Instruct](/fit/lgai-exaone-exaone-3-5-2-4b-instruct)

Chat · lgai-exaone

Q8\_0Excellent

3.2 GB13% of RAM~67 tok/s2.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SDAR 1.7B Chat](/fit/jetlm-sdar-1-7b-chat)

Chat · jetlm

Q8\_0Excellent

2.8 GB12% of RAM~79 tok/s2.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Gemma 4 31B JANG\_4M CRACK](/fit/dealignai-gemma-4-31b-jang_4m-crack)

General · dealignai

Q8\_0Excellent

7.7 GB32% of RAM~25 tok/s6.43B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 4B Instruct FP8](/fit/qwen-qwen3-vl-4b-instruct-fp8)

Chat · Alibaba

Q8\_0Excellent

5.9 GB25% of RAM~33 tok/s4.83B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[tiny\_starcoder\_py](/fit/bigcode-tiny_starcoder_py)

Coding · BigCode

Q8\_0Excellent

0.7 GB3% of RAM~1,005 tok/s0.16B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 VL 3B Instruct quantized.w8a8](/fit/redhatai-qwen2-5-vl-3b-instruct-quantized-w8a8)

Chat · redhatai

Q8\_0Excellent

5.0 GB21% of RAM~40 tok/s4.07B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2 VL 2B Instruct AWQ](/fit/qwen-qwen2-vl-2b-instruct-awq)

Chat · Alibaba

Q8\_0Excellent

3.2 GB13% of RAM~66 tok/s2.44B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Mistral Small 3.1 24B Instruct 2503 GPTQ 4b 128g](/fit/ista-daslab-mistral-small-3-1-24b-instruct-2503-gptq-4b-128g)

Chat · ista-daslab

Q8\_0Excellent

5.8 GB24% of RAM~34 tok/s4.73B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qari OCR v0.3 VL 2B Instruct](/fit/namaa-space-qari-ocr-v0-3-vl-2b-instruct)

Chat · namaa-space

Q8\_0Excellent

3.0 GB12% of RAM~73 tok/s2.21B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 VL 3B Instruct FP8 dynamic](/fit/redhatai-qwen2-5-vl-3b-instruct-fp8-dynamic)

Chat · redhatai

Q8\_0Excellent

5.0 GB21% of RAM~40 tok/s4.07B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Huihui Qwen3 VL 4B Instruct abliterated](/fit/huihui-ai-huihui-qwen3-vl-4b-instruct-abliterated)

Chat · huihui-ai

Q8\_0Excellent

5.5 GB23% of RAM~36 tok/s4.44B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 4B Instruct Unredacted MAX](/fit/prithivmlmods-qwen3-vl-4b-instruct-unredacted-max)

Chat · prithivmlmods

Q8\_0Excellent

5.5 GB23% of RAM~36 tok/s4.44B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2 VL OCR2 2B Instruct](/fit/prithivmlmods-qwen2-vl-ocr2-2b-instruct)

Chat · prithivmlmods

Q8\_0Excellent

3.0 GB12% of RAM~73 tok/s2.21B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 3B Instruct](/fit/qwen-qwen2-5-3b-instruct)

Chat · Alibaba

Q8\_0Excellent

3.9 GB16% of RAM~52 tok/s3.09B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[moondream2](/fit/vikhyatk-moondream2)

General · vikhyatk

Q8\_0Excellent

2.7 GB11% of RAM~83 tok/s1.93B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.2 1B](/fit/meta-llama-llama-3-2-1b)

General · Meta · 2024-09-18

Q8\_0Excellent

1.9 GB8% of RAM~130 tok/s1.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL2 2B](/fit/opengvlab-internvl2-2b)

General · opengvlab

Q8\_0Excellent

3.0 GB12% of RAM~73 tok/s2.21B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Florence 2 large](/fit/microsoft-florence-2-large)

General · Microsoft

Q8\_0Excellent

1.4 GB6% of RAM~206 tok/s0.78B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[h2ovl mississippi 800m](/fit/h2oai-h2ovl-mississippi-800m)

General · h2oai

Q8\_0Excellent

1.4 GB6% of RAM~194 tok/s0.83B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[h2ovl mississippi 2b](/fit/h2oai-h2ovl-mississippi-2b)

General · h2oai

Q8\_0Excellent

2.9 GB12% of RAM~75 tok/s2.15B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Math 1.5B](/fit/qwen-qwen2-5-math-1-5b)

General · Alibaba

Q8\_0Excellent

2.2 GB9% of RAM~104 tok/s1.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[PowerMoE 3b](/fit/ibm-research-powermoe-3b)

General · ibm-research

Q8\_0Excellent

4.3 GB18% of RAM~199 tok/s3.37B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[t5gemma s s prefixlm](/fit/google-t5gemma-s-s-prefixlm)

General · Google

Q8\_0Excellent

0.8 GB4% of RAM~519 tok/s0.31B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL2 1B](/fit/opengvlab-internvl2-1b)

General · opengvlab

Q8\_0Excellent

1.5 GB6% of RAM~171 tok/s0.94B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava onevision qwen2 0.5b ov hf](/fit/llava-hf-llava-onevision-qwen2-0-5b-ov-hf)

General · llava-hf

Q8\_0Excellent

1.5 GB6% of RAM~181 tok/s0.89B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NVIDIA Nemotron Nano 9B v2](/fit/nvidia-nvidia-nemotron-nano-9b-v2)

General · nvidia · 2025-08-12

Q8\_0Excellent

10.4 GB43% of RAM~18 tok/s8.89B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[OLMo 2 0425 1B](/fit/allenai-olmo-2-0425-1b)

General · allenai

Q8\_0Excellent

2.2 GB9% of RAM~109 tok/s1.48B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 2 2b it](/fit/google-gemma-2-2b-it)

General · Google · 2024-07-16

Q8\_0Excellent

3.4 GB14% of RAM~62 tok/s2.61B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[kanana 1.5 v 3b instruct](/fit/kakaocorp-kanana-1-5-v-3b-instruct)

Chat · kakaocorp

Q8\_0Excellent

4.6 GB19% of RAM~44 tok/s3.67B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[mamba 130m hf](/fit/state-spaces-mamba-130m-hf)

General · state-spaces

Q8\_0Excellent

0.6 GB3% of RAM~1,237 tok/s0.13B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[bloom 560m](/fit/bigscience-bloom-560m)

General · bigscience

Q8\_0Excellent

1.1 GB5% of RAM~287 tok/s0.56B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[paligemma 3b mix 224](/fit/google-paligemma-3b-mix-224)

General · Google

Q8\_0Excellent

3.8 GB16% of RAM~55 tok/s2.92B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 2b](/fit/google-gemma-2b)

General · Google

Q8\_0Excellent

3.3 GB14% of RAM~64 tok/s2.51B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[texify](/fit/vikp-texify)

General · vikp

Q8\_0Excellent

0.8 GB4% of RAM~519 tok/s0.31B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3 1B](/fit/opengvlab-internvl3-1b)

General · opengvlab

Q8\_0Excellent

1.5 GB6% of RAM~171 tok/s0.94B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[udop large](/fit/microsoft-udop-large)

General · Microsoft

Q8\_0Excellent

1.3 GB6% of RAM~217 tok/s0.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GA\_Guard\_Lite](/fit/generalanalysis-ga_guard_lite)

General · generalanalysis

Q8\_0Excellent

1.2 GB5% of RAM~268 tok/s0.6B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[tiny random internvl2](/fit/optimum-intel-internal-testing-tiny-random-internvl2)

General · optimum-intel-internal-testing

Q8\_0Excellent

0.5 GB2% of RAM~8,040 tok/s0.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 1.1 2b it](/fit/google-gemma-1-1-2b-it)

General · Google

Q8\_0Excellent

3.3 GB14% of RAM~64 tok/s2.51B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 8B speculator.eagle3](/fit/redhatai-qwen3-8b-speculator-eagle3)

General · redhatai

Q8\_0Excellent

1.6 GB7% of RAM~158 tok/s1.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[t5gemma 2 1b 1b](/fit/google-t5gemma-2-1b-1b)

General · Google

Q8\_0Excellent

2.9 GB12% of RAM~76 tok/s2.12B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 270m](/fit/google-gemma-3-270m)

General · Google

Q8\_0Excellent

0.8 GB3% of RAM~596 tok/s0.27B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Minnow Math 1.5B](/fit/kitefishai-minnow-math-1-5b)

General · kitefishai

Q8\_0Excellent

2.3 GB10% of RAM~99 tok/s1.63B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL2\_5 2B](/fit/opengvlab-internvl2_5-2b)

General · opengvlab

Q8\_0Excellent

3.0 GB12% of RAM~73 tok/s2.21B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama Guard 3 1B](/fit/meta-llama-llama-guard-3-1b)

General · Meta

Q8\_0Excellent

2.2 GB9% of RAM~107 tok/s1.5B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[bloom 1b7](/fit/bigscience-bloom-1b7)

General · bigscience

Q8\_0Excellent

2.4 GB10% of RAM~93 tok/s1.72B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3\_5 GPT OSS 20B A4B Preview](/fit/opengvlab-internvl3_5-gpt-oss-20b-a4b-preview)

General · opengvlab

Q8\_0Excellent

0.9 GB4% of RAM~412 tok/s0.39B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B NVFP4](/fit/axionml-qwen3-5-9b-nvfp4)

General · axionml

Q8\_0Excellent

7.9 GB33% of RAM~24 tok/s6.63B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Ovis2 1B](/fit/aidc-ai-ovis2-1b)

General · aidc-ai

Q8\_0Excellent

1.9 GB8% of RAM~127 tok/s1.27B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Florence 2 large](/fit/florence-community-florence-2-large)

General · florence-community

Q8\_0Excellent

1.4 GB6% of RAM~206 tok/s0.78B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[stablelm 3b 4e1t](/fit/stabilityai-stablelm-3b-4e1t)

General · Stability AI

Q8\_0Excellent

3.6 GB15% of RAM~57 tok/s2.8B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolVLM Instruct](/fit/huggingfacetb-smolvlm-instruct)

Chat · huggingfacetb

Q8\_0Excellent

3.0 GB13% of RAM~71 tok/s2.25B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 122B A10B heretic int4 AutoRound](/fit/happypatrick-qwen3-5-122b-a10b-heretic-int4-autoround)

General · happypatrick

Q6\_KExcellent

16.5 GB69% of RAM~143 tok/s18.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 2 2b jpn it](/fit/google-gemma-2-2b-jpn-it)

General · Google

Q8\_0Excellent

3.4 GB14% of RAM~62 tok/s2.61B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GOT OCR 2.0 hf](/fit/stepfun-ai-got-ocr-2-0-hf)

General · stepfun-ai

Q8\_0Excellent

1.1 GB5% of RAM~287 tok/s0.56B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[falcon mamba tiny dev](/fit/tiiuae-falcon-mamba-tiny-dev)

General · TII

Q8\_0Excellent

0.5 GB2% of RAM~16,081 tok/s0.01B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL2\_5 1B](/fit/opengvlab-internvl2_5-1b)

General · opengvlab

Q8\_0Excellent

1.5 GB6% of RAM~171 tok/s0.94B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[t5gemma 2 270m 270m](/fit/google-t5gemma-2-270m-270m)

General · Google

Q8\_0Excellent

1.4 GB6% of RAM~204 tok/s0.79B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 27b it quantized.w4a16](/fit/redhatai-gemma-3-27b-it-quantized-w4a16)

General · redhatai

Q8\_0Excellent

7.9 GB33% of RAM~24 tok/s6.64B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[paligemma 3b mix 224](/fit/fal-paligemma-3b-mix-224)

General · fal

Q8\_0Excellent

3.8 GB16% of RAM~55 tok/s2.92B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[moondream 2b 2025 04 14 4bit](/fit/moondream-moondream-2b-2025-04-14-4bit)

General · moondream

Q8\_0Excellent

2.0 GB8% of RAM~123 tok/s1.31B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[chartgemma](/fit/ahmed-masry-chartgemma)

General · ahmed-masry

Q8\_0Excellent

3.8 GB16% of RAM~55 tok/s2.92B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Perception LM 1B](/fit/facebook-perception-lm-1b)

General · facebook

Q8\_0Excellent

2.2 GB9% of RAM~105 tok/s1.53B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[glm edge v 2b](/fit/zai-org-glm-edge-v-2b)

General · zai-org

Q8\_0Excellent

2.8 GB12% of RAM~78 tok/s2.07B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Vintern 1B v3\_5](/fit/5cd-ai-vintern-1b-v3_5)

General · 5cd-ai

Q8\_0Excellent

1.5 GB6% of RAM~171 tok/s0.94B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 7B Instruct](/fit/qwen-qwen2-5-coder-7b-instruct)

Coding · Alibaba · 2024-09-17

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.2 3B](/fit/meta-llama-llama-3-2-3b)

General · Meta · 2024-09-18

Q8\_0Excellent

4.1 GB17% of RAM~50 tok/s3.21B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[bloomz 560m](/fit/bigscience-bloomz-560m)

General · bigscience

Q8\_0Excellent

1.1 GB5% of RAM~287 tok/s0.56B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pythia 70m deduped](/fit/eleutherai-pythia-70m-deduped)

General · eleutherai

Q8\_0Excellent

0.6 GB3% of RAM~1,608 tok/s0.1B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolLM2 135M Instruct](/fit/huggingfacetb-smollm2-135m-instruct)

Chat · huggingfacetb

Q8\_0Excellent

0.6 GB3% of RAM~1,237 tok/s0.13B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gpt neo 125m](/fit/eleutherai-gpt-neo-125m)

General · eleutherai

Q8\_0Excellent

0.7 GB3% of RAM~1,072 tok/s0.15B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[blip2 opt 2.7b](/fit/salesforce-blip2-opt-2-7b)

General · salesforce

Q8\_0Excellent

4.7 GB19% of RAM~43 tok/s3.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolVLM 256M Instruct](/fit/huggingfacetb-smolvlm-256m-instruct)

Chat · huggingfacetb

Q8\_0Excellent

0.8 GB3% of RAM~618 tok/s0.26B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DeepSeek V2 Lite](/fit/deepseek-ai-deepseek-v2-lite)

General · DeepSeek

Q8\_0Excellent

18.0 GB75% of RAM~74 tok/s15.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[japanese gpt neox small](/fit/rinna-japanese-gpt-neox-small)

General · rinna

Q8\_0Excellent

0.7 GB3% of RAM~804 tok/s0.2B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen1.5 MoE A2.7B](/fit/qwen-qwen1-5-moe-a2-7b)

General · Alibaba

Q8\_0Excellent

16.5 GB69% of RAM~99 tok/s14.32B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pythia 410m](/fit/eleutherai-pythia-410m)

General · eleutherai

Q8\_0Excellent

1.1 GB4% of RAM~315 tok/s0.51B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolLM 135M](/fit/huggingfacetb-smollm-135m)

General · huggingfacetb

Q8\_0Excellent

0.6 GB3% of RAM~1,237 tok/s0.13B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[opt 125m](/fit/peft-internal-testing-opt-125m)

General · peft-internal-testing

Q8\_0Excellent

0.6 GB3% of RAM~1,237 tok/s0.13B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolVLM2 256M Video Instruct](/fit/huggingfacetb-smolvlm2-256m-video-instruct)

Chat · huggingfacetb

Q8\_0Excellent

0.8 GB3% of RAM~618 tok/s0.26B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pythia 1.4b](/fit/eleutherai-pythia-1-4b)

General · eleutherai

Q8\_0Excellent

2.2 GB9% of RAM~106 tok/s1.52B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gpt sw3 126m](/fit/ai-sweden-models-gpt-sw3-126m)

General · ai-sweden-models

Q8\_0Excellent

0.7 GB3% of RAM~846 tok/s0.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pythia 14m](/fit/eleutherai-pythia-14m)

General · eleutherai

Q8\_0Excellent

0.5 GB2% of RAM~16,081 tok/s0.01B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pythia 160m deduped](/fit/eleutherai-pythia-160m-deduped)

General · eleutherai

Q8\_0Excellent

0.7 GB3% of RAM~766 tok/s0.21B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pythia 1b](/fit/eleutherai-pythia-1b)

General · eleutherai

Q8\_0Excellent

1.7 GB7% of RAM~149 tok/s1.08B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 7B](/fit/qwen-qwen2-5-coder-7b)

Coding · Alibaba

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[phi 1\_5](/fit/microsoft-phi-1_5)

General · Microsoft

Q8\_0Excellent

2.1 GB9% of RAM~113 tok/s1.42B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DeepSeek Coder V2 Lite Instruct FP8](/fit/redhatai-deepseek-coder-v2-lite-instruct-fp8)

Coding · redhatai

Q8\_0Excellent

18.0 GB75% of RAM~74 tok/s15.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolLM 1.7B](/fit/huggingfacetb-smollm-1-7b)

General · huggingfacetb

Q8\_0Excellent

2.4 GB10% of RAM~94 tok/s1.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llama 160m](/fit/jackfram-llama-160m)

General · jackfram

Q8\_0Excellent

0.7 GB3% of RAM~1,005 tok/s0.16B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[PowerLM 3b](/fit/ibm-research-powerlm-3b)

General · ibm-research

Q8\_0Excellent

4.4 GB18% of RAM~46 tok/s3.51B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[paligemma2 3b pt 448](/fit/google-paligemma2-3b-pt-448)

General · Google

Q8\_0Excellent

3.9 GB16% of RAM~53 tok/s3.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolVLM 500M Instruct](/fit/huggingfacetb-smolvlm-500m-instruct)

Chat · huggingfacetb

Q8\_0Excellent

1.1 GB4% of RAM~315 tok/s0.51B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[tiny aya global](/fit/coherelabs-tiny-aya-global)

General · coherelabs

Q8\_0Excellent

4.2 GB18% of RAM~48 tok/s3.35B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DanTagGen delta rev2](/fit/kblueleaf-dantaggen-delta-rev2)

General · kblueleaf

Q8\_0Excellent

0.9 GB4% of RAM~412 tok/s0.39B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gpt neo 1.3B](/fit/eleutherai-gpt-neo-1-3b)

General · eleutherai

Q8\_0Excellent

2.0 GB8% of RAM~117 tok/s1.37B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pythia 31m](/fit/eleutherai-pythia-31m)

General · eleutherai

Q8\_0Excellent

0.5 GB2% of RAM~5,360 tok/s0.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Ristretto 3B](/fit/liautoad-ristretto-3b)

General · liautoad

Q8\_0Excellent

4.8 GB20% of RAM~42 tok/s3.84B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[h2o danube3 500m chat](/fit/h2oai-h2o-danube3-500m-chat)

Chat · h2oai

Q8\_0Excellent

1.1 GB4% of RAM~315 tok/s0.51B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[paligemma2 3b ft docci 448](/fit/google-paligemma2-3b-ft-docci-448)

General · Google

Q8\_0Excellent

3.9 GB16% of RAM~53 tok/s3.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[paligemma2 3b mix 224](/fit/google-paligemma2-3b-mix-224)

General · Google

Q8\_0Excellent

3.9 GB16% of RAM~53 tok/s3.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[tinyllama oneshot w8w8 test static shape change](/fit/nm-testing-tinyllama-oneshot-w8w8-test-static-shape-change)

General · nm-testing

Q8\_0Excellent

1.7 GB7% of RAM~146 tok/s1.1B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[deepseek vl2 tiny](/fit/isotr0py-deepseek-vl2-tiny)

General · isotr0py

Q8\_0Excellent

4.3 GB18% of RAM~48 tok/s3.37B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[TIPO 500M ft](/fit/kblueleaf-tipo-500m-ft)

General · kblueleaf

Q8\_0Excellent

1.1 GB4% of RAM~315 tok/s0.51B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[OLMo 1B hf](/fit/allenai-olmo-1b-hf)

General · allenai

Q8\_0Excellent

1.8 GB8% of RAM~136 tok/s1.18B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pythia 410m deduped](/fit/eleutherai-pythia-410m-deduped)

General · eleutherai

Q8\_0Excellent

1.1 GB4% of RAM~315 tok/s0.51B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL2\_5 4B MPO](/fit/opengvlab-internvl2_5-4b-mpo)

General · opengvlab

Q8\_0Excellent

4.6 GB19% of RAM~43 tok/s3.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL2\_5 4B](/fit/opengvlab-internvl2_5-4b)

General · opengvlab

Q8\_0Excellent

4.6 GB19% of RAM~43 tok/s3.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Mono InternVL 2B](/fit/opengvlab-mono-internvl-2b)

General · opengvlab

Q8\_0Excellent

4.0 GB17% of RAM~52 tok/s3.11B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Vintern 3B R beta](/fit/5cd-ai-vintern-3b-r-beta)

General · 5cd-ai

Q8\_0Excellent

4.6 GB19% of RAM~43 tok/s3.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Crow 9B Opus 4.6 Distill Heretic\_Qwen3.5 NVFP4](/fit/rhoninseiei-crow-9b-opus-4-6-distill-heretic_qwen3-5-nvfp4)

General · rhoninseiei

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.3B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 8B](/fit/qwen-qwen3-8b)

General · Alibaba · 2025-04-27

Q8\_0Excellent

9.6 GB40% of RAM~20 tok/s8.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[phi 2](/fit/microsoft-phi-2)

General · Microsoft

Q8\_0Excellent

3.6 GB15% of RAM~58 tok/s2.78B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 7B](/fit/qwen-qwen2-5-7b)

General · Alibaba

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Florence 2 base](/fit/microsoft-florence-2-base)

General · Microsoft

Q8\_0Excellent

0.8 GB3% of RAM~699 tok/s0.23B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DeepSeek Coder V2 Lite Instruct](/fit/deepseek-ai-deepseek-coder-v2-lite-instruct)

Coding · DeepSeek · 2024-06-14

Q8\_0Excellent

18.0 GB75% of RAM~67 tok/s15.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gpt neo 2.7B](/fit/eleutherai-gpt-neo-2-7b)

General · eleutherai

Q8\_0Excellent

3.5 GB15% of RAM~59 tok/s2.72B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[CodeLlama 7b Instruct hf](/fit/codellama-codellama-7b-instruct-hf)

Coding · codellama

Q8\_0Excellent

8.0 GB33% of RAM~24 tok/s6.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 4b pt](/fit/google-gemma-3-4b-pt)

General · Google

Q8\_0Excellent

5.3 GB22% of RAM~37 tok/s4.3B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pythia 2.8b](/fit/eleutherai-pythia-2-8b)

General · eleutherai

Q8\_0Excellent

3.7 GB16% of RAM~55 tok/s2.91B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[sqlcoder 7b 2](/fit/defog-sqlcoder-7b-2)

Coding · defog

Q8\_0Excellent

8.0 GB33% of RAM~24 tok/s6.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Ovis2 4B](/fit/aidc-ai-ovis2-4b)

General · aidc-ai

Q8\_0Excellent

5.7 GB24% of RAM~35 tok/s4.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2 7B](/fit/qwen-qwen2-7b)

General · Alibaba

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[deepseek coder 6.7b instruct](/fit/deepseek-ai-deepseek-coder-6-7b-instruct)

Coding · DeepSeek

Q8\_0Excellent

8.0 GB33% of RAM~24 tok/s6.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[CodeLlama 7b hf](/fit/codellama-codellama-7b-hf)

Coding · codellama

Q8\_0Excellent

8.0 GB33% of RAM~24 tok/s6.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DialoGPT small](/fit/microsoft-dialogpt-small)

General · Microsoft

Q8\_0Excellent

0.7 GB3% of RAM~893 tok/s0.18B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[deepseek coder 6.7b base](/fit/deepseek-ai-deepseek-coder-6-7b-base)

Coding · DeepSeek

Q8\_0Excellent

8.0 GB33% of RAM~24 tok/s6.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Florence 2 base ft](/fit/microsoft-florence-2-base-ft)

General · Microsoft

Q8\_0Excellent

0.8 GB3% of RAM~699 tok/s0.23B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Florence 2 large ft](/fit/microsoft-florence-2-large-ft)

General · Microsoft

Q8\_0Excellent

1.4 GB6% of RAM~209 tok/s0.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qianfan OCR](/fit/baidu-qianfan-ocr)

General · baidu

Q8\_0Excellent

5.8 GB24% of RAM~34 tok/s4.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Ovis1.6 Llama3.2 3B](/fit/aidc-ai-ovis1-6-llama3-2-3b)

General · aidc-ai

Q8\_0Excellent

5.1 GB21% of RAM~39 tok/s4.14B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Tiny LLM](/fit/arnir0-tiny-llm)

General · arnir0

Q8\_0Excellent

0.5 GB2% of RAM~16,081 tok/s0.01B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Florence 2 SD3 Captioner](/fit/gokaygokay-florence-2-sd3-captioner)

General · gokaygokay

Q8\_0Excellent

0.8 GB3% of RAM~596 tok/s0.27B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Florence 2 Flux](/fit/gokaygokay-florence-2-flux)

General · gokaygokay

Q8\_0Excellent

0.8 GB3% of RAM~596 tok/s0.27B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Florence 2 base](/fit/florence-community-florence-2-base)

General · florence-community

Q8\_0Excellent

0.8 GB3% of RAM~699 tok/s0.23B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[TF ID large](/fit/yifeihu-tf-id-large)

General · yifeihu

Q8\_0Excellent

1.4 GB6% of RAM~196 tok/s0.82B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3\_5 4B MPO](/fit/opengvlab-internvl3_5-4b-mpo)

General · opengvlab

Q8\_0Excellent

5.8 GB24% of RAM~34 tok/s4.73B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL2 4B](/fit/opengvlab-internvl2-4b)

General · opengvlab

Q8\_0Excellent

5.1 GB21% of RAM~39 tok/s4.15B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B NVFP4](/fit/apolo13x-qwen3-5-9b-nvfp4)

General · apolo13x

Q8\_0Excellent

8.9 GB37% of RAM~21 tok/s7.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Mistral 7B v0.1](/fit/mistralai-mistral-7b-v0-1)

General · Mistral AI

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[stories15M\_MOE](/fit/ggml-org-stories15m_moe)

General · ggml-org

Q8\_0Excellent

0.5 GB2% of RAM~8,425 tok/s0.04B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[olmOCR 2 7B 1025](/fit/allenai-olmocr-2-7b-1025)

General · allenai

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Meta Llama 3.1 8B FP8](/fit/redhatai-meta-llama-3-1-8b-fp8)

General · redhatai

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[RolmOCR](/fit/reducto-rolmocr)

General · reducto

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B NVFP4](/fit/nvidia-qwen3-30b-a3b-nvfp4)

General · nvidia

Q8\_0Excellent

17.9 GB75% of RAM~94 tok/s15.58B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[olmOCR 2 7B 1025 FP8](/fit/allenai-olmocr-2-7b-1025-fp8)

General · allenai

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.1 Nemotron Nano 8B v1](/fit/nvidia-llama-3-1-nemotron-nano-8b-v1)

General · nvidia

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Olmo 3 1025 7B](/fit/allenai-olmo-3-1025-7b)

General · allenai

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.3B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[UI TARS 1.5 7B](/fit/bytedance-seed-ui-tars-1-5-7b)

General · bytedance-seed

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Apriel 5B Instruct](/fit/servicenow-ai-apriel-5b-instruct)

Chat · servicenow-ai

Q8\_0Excellent

5.9 GB25% of RAM~33 tok/s4.83B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[T5\_Paraphrase\_Paws](/fit/vamsi-t5_paraphrase_paws)

General · vamsi

Q8\_0Excellent

0.7 GB3% of RAM~731 tok/s0.22B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Hermes 3 Llama 3.1 8B](/fit/nousresearch-hermes-3-llama-3-1-8b)

General · NousResearch

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gpt2 mini](/fit/erwanf-gpt2-mini)

General · erwanf

Q8\_0Excellent

0.5 GB2% of RAM~4,020 tok/s0.04B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NuExtract 2.0 8B](/fit/numind-nuextract-2-0-8b)

General · numind

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[OpenCUA 7B](/fit/xlangai-opencua-7b)

General · xlangai

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[MiMo VL 7B RL 2508](/fit/xiaomimimo-mimo-vl-7b-rl-2508)

General · xiaomimimo

Q8\_0Excellent

9.8 GB41% of RAM~19 tok/s8.31B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[xLAM 7b r](/fit/salesforce-xlam-7b-r)

General · salesforce

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[prometheus 7b v2.0](/fit/prometheus-eval-prometheus-7b-v2-0)

General · prometheus-eval

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[TreeVGR 7B CI](/fit/haochenwang-treevgr-7b-ci)

General · haochenwang

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Flux Prompt Enhance](/fit/gokaygokay-flux-prompt-enhance)

General · gokaygokay

Q8\_0Excellent

0.7 GB3% of RAM~731 tok/s0.22B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NeuralMonarch 7B](/fit/mlabonne-neuralmonarch-7b)

General · mlabonne

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GritLM 7B vllm](/fit/parasail-ai-gritlm-7b-vllm)

General · parasail-ai

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[shisa gamma 7b v1](/fit/augmxnt-shisa-gamma-7b-v1)

General · augmxnt

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llama2.c stories15M](/fit/xenova-llama2-c-stories15m)

General · xenova

Q8\_0Excellent

0.5 GB2% of RAM~8,040 tok/s0.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[OctoMed 7B](/fit/octomed-octomed-7b)

General · octomed

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[typhoon ocr 7b](/fit/typhoon-ai-typhoon-ocr-7b)

General · typhoon-ai

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DiegoCropper](/fit/singh8898-diegocropper)

General · singh8898

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[olmOCR 7B 0825 FP8](/fit/allenai-olmocr-7b-0825-fp8)

General · allenai

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.2 1B Instruct](/fit/meta-llama-llama-3-2-1b-instruct)

Chat · Meta

Q8\_0Excellent

1.9 GB8% of RAM~130 tok/s1.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Phi tiny MoE instruct](/fit/microsoft-phi-tiny-moe-instruct)

Chat · Microsoft

Q8\_0Excellent

4.7 GB20% of RAM~254 tok/s3.76B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava v1.6 mistral 7b hf](/fit/llava-hf-llava-v1-6-mistral-7b-hf)

General · llava-hf

Q8\_0Excellent

8.9 GB37% of RAM~21 tok/s7.57B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NVIDIA Nemotron Nano 9B v2 Japanese](/fit/nvidia-nvidia-nemotron-nano-9b-v2-japanese)

General · nvidia

Q8\_0Excellent

10.4 GB43% of RAM~18 tok/s8.89B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LFM2 24B A2B](/fit/lmstudio-community-lfm2-24b-a2b-mlx-4bit)

General · lmstudio-community

mlx-4bitGreat

15.2 GB63% of RAM~118 tok/s23.84B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3n E4B it MLX bf16](/fit/lmstudio-community-gemma-3n-e4b-it-mlx-bf16)

General · lmstudio-community

mlx-8bitExcellent

8.9 GB37% of RAM~22 tok/s7.85B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3\_5 1B Instruct](/fit/opengvlab-internvl3_5-1b-instruct)

Chat · opengvlab

Q8\_0Excellent

1.7 GB7% of RAM~152 tok/s1.06B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SWE agent LM 7B](/fit/swe-bench-swe-agent-lm-7b)

General · swe-bench

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[chandra](/fit/datalab-to-chandra)

General · datalab-to

Q8\_0Excellent

10.3 GB43% of RAM~18 tok/s8.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[zephyr 7b beta](/fit/huggingfaceh4-zephyr-7b-beta)

General · HuggingFace · 2023-10-26

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Phi mini MoE instruct](/fit/microsoft-phi-mini-moe-instruct)

Chat · Microsoft

Q8\_0Excellent

9.0 GB38% of RAM~125 tok/s7.65B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Math 1.5B Instruct](/fit/qwen-qwen2-5-math-1-5b-instruct)

Chat · Alibaba

Q8\_0Excellent

2.2 GB9% of RAM~104 tok/s1.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen1.5 7B](/fit/qwen-qwen1-5-7b)

General · Alibaba

Q8\_0Excellent

9.1 GB38% of RAM~21 tok/s7.72B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NVIDIA Nemotron Nano 9B v2 Base](/fit/nvidia-nvidia-nemotron-nano-9b-v2-base)

General · nvidia

Q8\_0Excellent

10.4 GB43% of RAM~18 tok/s8.89B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NVIDIA Nemotron Nano 9B v2 FP8](/fit/nvidia-nvidia-nemotron-nano-9b-v2-fp8)

General · nvidia

Q8\_0Excellent

10.4 GB43% of RAM~18 tok/s8.89B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Zamba2 1.2B instruct](/fit/zyphra-zamba2-1-2b-instruct)

Chat · zyphra

Q8\_0Excellent

1.9 GB8% of RAM~132 tok/s1.22B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen 7B](/fit/qwen-qwen-7b)

General · Alibaba

Q8\_0Excellent

9.1 GB38% of RAM~21 tok/s7.72B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 8B Thinking](/fit/qwen-qwen3-vl-8b-thinking)

General · Alibaba

Q8\_0Excellent

10.3 GB43% of RAM~18 tok/s8.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[blip2 flan t5 xl](/fit/salesforce-blip2-flan-t5-xl)

General · salesforce

Q8\_0Excellent

4.9 GB20% of RAM~41 tok/s3.94B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llama joycaption beta one hf llava](/fit/fancyfeast-llama-joycaption-beta-one-hf-llava)

General · fancyfeast

Q8\_0Excellent

10.0 GB41% of RAM~19 tok/s8.48B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[OLMoE 1B 7B 0125 Instruct](/fit/allenai-olmoe-1b-7b-0125-instruct)

Chat · allenai

Q8\_0Excellent

8.2 GB34% of RAM~138 tok/s6.92B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Molmo2 O 7B](/fit/allenai-molmo2-o-7b)

General · allenai

Q8\_0Excellent

9.2 GB38% of RAM~21 tok/s7.76B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[VulnLLM R 7B](/fit/virtue-ai-hub-vulnllm-r-7b)

General · virtue-ai-hub

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[OLMo 2 0425 1B Instruct](/fit/allenai-olmo-2-0425-1b-instruct)

Chat · allenai

Q8\_0Excellent

2.2 GB9% of RAM~109 tok/s1.48B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Abliterated Llama 3.2 1B Instruct](/fit/cazzz307-abliterated-llama-3-2-1b-instruct)

Chat · cazzz307

Q8\_0Excellent

1.9 GB8% of RAM~130 tok/s1.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[starcoder2 7b](/fit/bigcode-starcoder2-7b)

Coding · BigCode · 2024-02-20

Q8\_0Excellent

8.5 GB35% of RAM~22 tok/s7.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava v1.6 mistral 7b](/fit/liuhaotian-llava-v1-6-mistral-7b)

General · liuhaotian

Q8\_0Excellent

8.9 GB37% of RAM~21 tok/s7.57B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava med v1.5 mistral 7b](/fit/microsoft-llava-med-v1-5-mistral-7b)

General · Microsoft

Q8\_0Excellent

8.9 GB37% of RAM~21 tok/s7.57B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[ZwZ 8B](/fit/inclusionai-zwz-8b)

General · inclusionai

Q8\_0Excellent

10.3 GB43% of RAM~18 tok/s8.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[bakLlava v1 hf](/fit/llava-hf-bakllava-v1-hf)

General · llava-hf

Q8\_0Excellent

8.9 GB37% of RAM~21 tok/s7.57B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 8B Thinking FP8](/fit/qwen-qwen3-vl-8b-thinking-fp8)

General · Alibaba

Q8\_0Excellent

10.3 GB43% of RAM~18 tok/s8.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[deepseek vl 1.3b chat](/fit/deepseek-ai-deepseek-vl-1-3b-chat)

Chat · DeepSeek

Q8\_0Excellent

2.7 GB11% of RAM~81 tok/s1.98B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[MedMO 8B Next](/fit/mbzuai-medmo-8b-next)

General · mbzuai

Q8\_0Excellent

10.3 GB43% of RAM~18 tok/s8.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[next ocr](/fit/thelamapi-next-ocr)

General · thelamapi

Q8\_0Excellent

10.3 GB43% of RAM~18 tok/s8.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Ling lite](/fit/inclusionai-ling-lite)

General · inclusionai · 2025-02-28

Q8\_0Excellent

19.2 GB80% of RAM~69 tok/s16.8B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.2 3B Instruct](/fit/meta-llama-llama-3-2-3b-instruct)

Chat · Meta

Q8\_0Excellent

4.1 GB17% of RAM~50 tok/s3.21B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 VL 7B Instruct](/fit/qwen-qwen2-5-vl-7b-instruct)

Chat · Alibaba · 2025-01-26

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 8B Base](/fit/qwen-qwen3-8b-base)

General · Alibaba

Q8\_0Excellent

9.6 GB40% of RAM~20 tok/s8.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Phi 3 mini 4k instruct gptq 4bit](/fit/kaitchup-phi-3-mini-4k-instruct-gptq-4bit)

Chat · kaitchup

Q8\_0Excellent

4.8 GB20% of RAM~42 tok/s3.82B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DeepSeek V2 Lite Chat](/fit/deepseek-ai-deepseek-v2-lite-chat)

Chat · DeepSeek

Q8\_0Excellent

18.0 GB75% of RAM~74 tok/s15.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Phi 3 mini 4k instruct](/fit/microsoft-phi-3-mini-4k-instruct)

Chat · Microsoft

Q8\_0Excellent

4.8 GB20% of RAM~42 tok/s3.82B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DeepSeek R1 Distill Qwen 14B](/fit/deepseek-ai-deepseek-r1-distill-qwen-14b)

Reasoning · DeepSeek

Q8\_0Excellent

17.0 GB71% of RAM~11 tok/s14.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 14B NVFP4](/fit/nvidia-qwen3-14b-nvfp4)

General · nvidia

Q8\_0Excellent

9.6 GB40% of RAM~20 tok/s8.16B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[EXAONE Deep 7.8B](/fit/lgai-exaone-exaone-deep-7-8b)

General · lgai-exaone

Q8\_0Excellent

9.2 GB38% of RAM~21 tok/s7.82B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[AIN](/fit/mbzuai-ain)

General · mbzuai

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 8B FP8](/fit/qwen-qwen3-8b-fp8)

General · Alibaba

Q8\_0Excellent

9.6 GB40% of RAM~20 tok/s8.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3Guard Gen 8B](/fit/qwen-qwen3guard-gen-8b)

General · Alibaba

Q8\_0Excellent

9.6 GB40% of RAM~20 tok/s8.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava onevision qwen2 7b ov](/fit/lmms-lab-llava-onevision-qwen2-7b-ov)

General · lmms-lab

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[xflux\_text\_encoders](/fit/xlabs-ai-xflux_text_encoders)

Coding · xlabs-ai

Q8\_0Excellent

5.8 GB24% of RAM~34 tok/s4.76B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[MiMo 7B Base](/fit/xiaomimimo-mimo-7b-base)

General · xiaomimimo

Q8\_0Excellent

9.2 GB38% of RAM~21 tok/s7.83B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B Instruct 2507 FP4](/fit/nvfp4-qwen3-30b-a3b-instruct-2507-fp4)

Chat · nvfp4

Q8\_0Excellent

17.9 GB75% of RAM~94 tok/s15.58B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LLaDA2.0 mini](/fit/inclusionai-llada2-0-mini)

General · inclusionai

Q8\_0Excellent

18.6 GB78% of RAM~124 tok/s16.26B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3 8B hf](/fit/opengvlab-internvl3-8b-hf)

General · opengvlab

Q8\_0Excellent

9.4 GB39% of RAM~20 tok/s7.94B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Kunoichi DPO v2 7B](/fit/sanjiwatsuki-kunoichi-dpo-v2-7b)

General · sanjiwatsuki

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[olmOCR 7B 0225 preview](/fit/allenai-olmocr-7b-0225-preview)

General · allenai

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SmolLM 135M Instruct](/fit/huggingfacetb-smollm-135m-instruct)

Chat · huggingfacetb

Q8\_0Excellent

0.6 GB3% of RAM~1,237 tok/s0.13B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[MiniCPM V 2\_6 int4](/fit/openbmb-minicpm-v-2_6-int4)

General · openbmb

Q8\_0Excellent

9.8 GB41% of RAM~19 tok/s8.32B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 8B.w8a8](/fit/nytopop-qwen3-8b-w8a8)

General · nytopop

Q8\_0Excellent

9.6 GB40% of RAM~20 tok/s8.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 8B FP8](/fit/nvidia-qwen3-8b-fp8)

General · nvidia

Q8\_0Excellent

9.6 GB40% of RAM~20 tok/s8.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2 VL 7B](/fit/qwen-qwen2-vl-7b)

General · Alibaba

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LLaDA2.1 mini](/fit/inclusionai-llada2-1-mini)

General · inclusionai

Q8\_0Excellent

18.6 GB78% of RAM~124 tok/s16.26B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 32B Instruct AWQ 4bit](/fit/cyankiwi-qwen3-vl-32b-instruct-awq-4bit)

Chat · cyankiwi

Q8\_0Excellent

8.3 GB35% of RAM~23 tok/s7.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Gliese Qwen3.5 9B Abliterated Caption](/fit/prithivmlmods-gliese-qwen3-5-9b-abliterated-caption)

General · prithivmlmods

Q8\_0Excellent

11.0 GB46% of RAM~17 tok/s9.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B Claude 4.6 HighIQ THINKING HERETIC UNCENSORED](/fit/davidau-qwen3-5-9b-claude-4-6-highiq-thinking-heretic-uncensored)

General · davidau

Q8\_0Excellent

11.0 GB46% of RAM~17 tok/s9.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[UI TARS 7B SFT](/fit/bytedance-seed-ui-tars-7b-sft)

General · bytedance-seed

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Hulu Med 7B](/fit/zju-ai4h-hulu-med-7b)

General · zju-ai4h

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.04B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[stablelm 2 1\_6b chat](/fit/stabilityai-stablelm-2-1_6b-chat)

Chat · Stability AI · 2024-04-08

Q8\_0Excellent

2.3 GB10% of RAM~98 tok/s1.64B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gpt oss 20b](/fit/openai-gpt-oss-20b)

General · openai

Q6\_KExcellent

19.1 GB80% of RAM~58 tok/s21.51B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Molmo2 8B](/fit/allenai-molmo2-8b)

General · allenai

Q8\_0Excellent

10.2 GB42% of RAM~19 tok/s8.66B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B AWQ](/fit/quanttrio-qwen3-5-9b-awq)

General · quanttrio

Q8\_0Excellent

11.3 GB47% of RAM~17 tok/s9.65B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B AWQ 4bit](/fit/cyankiwi-qwen3-5-9b-awq-4bit)

General · cyankiwi

Q8\_0Excellent

11.5 GB48% of RAM~16 tok/s9.88B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B FP8](/fit/lovedheart-qwen3-5-9b-fp8)

General · lovedheart

Q8\_0Excellent

11.3 GB47% of RAM~17 tok/s9.65B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[idefics2 8b](/fit/huggingfacem4-idefics2-8b)

General · huggingfacem4

Q8\_0Excellent

9.9 GB41% of RAM~19 tok/s8.4B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Dream v0 Instruct 7B](/fit/dream-org-dream-v0-instruct-7b)

Chat · dream-org

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[MiniCPM V 4\_5](/fit/openbmb-minicpm-v-4_5)

General · openbmb

Q8\_0Excellent

10.2 GB43% of RAM~18 tok/s8.7B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 7B Instruct 1M](/fit/qwen-qwen2-5-7b-instruct-1m)

Chat · Alibaba

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Huihui Qwen3.5 9B abliterated](/fit/huihui-ai-huihui-qwen3-5-9b-abliterated)

General · huihui-ai

Q8\_0Excellent

11.3 GB47% of RAM~17 tok/s9.65B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Bee 8B RL](/fit/open-bee-bee-8b-rl)

General · open-bee

Q8\_0Excellent

10.2 GB42% of RAM~19 tok/s8.68B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gpt\_bigcode santacoder](/fit/bigcode-gpt_bigcode-santacoder)

Coding · BigCode

Q8\_0Excellent

1.7 GB7% of RAM~144 tok/s1.12B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Moonlight 16B A3B](/fit/moonshotai-moonlight-16b-a3b)

General · moonshotai

Q8\_0Excellent

18.3 GB76% of RAM~139 tok/s15.96B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3\_5 4B Instruct](/fit/opengvlab-internvl3_5-4b-instruct)

Chat · opengvlab

Q8\_0Excellent

5.8 GB24% of RAM~34 tok/s4.73B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GLM 4.6V Flash](/fit/zai-org-glm-4-6v-flash)

General · zai-org

Q8\_0Excellent

12.0 GB50% of RAM~16 tok/s10.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[granite 3b code base 2k](/fit/ibm-granite-granite-3b-code-base-2k)

Coding · ibm-granite

Q8\_0Excellent

4.4 GB18% of RAM~46 tok/s3.48B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3\_5 8B HF](/fit/opengvlab-internvl3_5-8b-hf)

General · opengvlab

Q8\_0Excellent

10.0 GB42% of RAM~19 tok/s8.53B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 4 Scout 17B 16E Instruct quantized.w4a16](/fit/redhatai-llama-4-scout-17b-16e-instruct-quantized-w4a16)

Chat · redhatai

Q6\_KExcellent

17.5 GB73% of RAM~98 tok/s19.6B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwopus3.5 9B v3](/fit/jackrong-qwopus3-5-9b-v3)

General · jackrong

Q8\_0Excellent

11.3 GB47% of RAM~17 tok/s9.65B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Falcon3 7B Instruct](/fit/tiiuae-falcon3-7b-instruct)

Chat · TII · 2024-11-29

Q8\_0Excellent

8.8 GB37% of RAM~22 tok/s7.46B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 14B NVFP4](/fit/redhatai-qwen3-14b-nvfp4)

General · redhatai

Q8\_0Excellent

10.5 GB44% of RAM~18 tok/s8.99B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Huihui Qwen3.5 9B Claude 4.6 Opus abliterated](/fit/huihui-ai-huihui-qwen3-5-9b-claude-4-6-opus-abliterated)

General · huihui-ai

Q8\_0Excellent

11.3 GB47% of RAM~17 tok/s9.65B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B AWQ BF16 INT8](/fit/cyankiwi-qwen3-5-9b-awq-bf16-int8)

General · cyankiwi

Q8\_0Excellent

11.5 GB48% of RAM~16 tok/s9.83B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[MolmoPoint 8B](/fit/allenai-molmopoint-8b)

General · allenai

Q8\_0Excellent

10.2 GB42% of RAM~19 tok/s8.68B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[nomic embed text v1.5](/fit/nomic-ai-nomic-embed-text-v1-5)

Embedding · Nomic · 2024-02-10

Q8\_0Excellent

0.7 GB3% of RAM~1,149 tok/s0.14B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[TinyLlama 1.1B Chat v1.0](/fit/tinyllama-tinyllama-1-1b-chat-v1-0)

Chat · Community · 2023-12-30

Q8\_0Excellent

1.7 GB7% of RAM~146 tok/s1.1B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Mistral 7B Instruct v0.2](/fit/mistralai-mistral-7b-instruct-v0-2)

Chat · Mistral AI

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 2 7b hf](/fit/meta-llama-llama-2-7b-hf)

General · Meta

Q8\_0Excellent

8.0 GB33% of RAM~24 tok/s6.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[phi 3 mini 4k instruct](/fit/microsoft-phi-3-mini-4k-instruct)

Chat · Microsoft · 2024-04-22

Q8\_0Excellent

4.8 GB20% of RAM~42 tok/s3.82B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[granite 3.3 8b instruct](/fit/ibm-granite-granite-3-3-8b-instruct)

Chat · ibm-granite

Q8\_0Excellent

9.6 GB40% of RAM~20 tok/s8.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[saiga\_llama3\_8b](/fit/ilyagusev-saiga_llama3_8b)

General · ilyagusev

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.1 8B Instruct FP8](/fit/nvidia-llama-3-1-8b-instruct-fp8)

Chat · nvidia

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Meta Llama 3.1 8B Instruct FP8](/fit/redhatai-meta-llama-3-1-8b-instruct-fp8)

Chat · redhatai

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Step3 VL 10B](/fit/stepfun-ai-step3-vl-10b)

General · stepfun-ai

Q8\_0Excellent

11.8 GB49% of RAM~16 tok/s10.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Hermes 2 Pro Llama 3 8B](/fit/nousresearch-hermes-2-pro-llama-3-8b)

General · NousResearch

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Meta Llama 3.1 8B Instruct](/fit/nousresearch-meta-llama-3-1-8b-instruct)

Chat · NousResearch

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 122B A10B AWQ 4bit](/fit/cyankiwi-qwen3-5-122b-a10b-awq-4bit)

General · cyankiwi

Q5\_K\_MTight

18.6 GB77% of RAM~128 tok/s24.27B params

Try Q4\_K\_M (16.2 GB, ~151 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 2 7b hf](/fit/nousresearch-llama-2-7b-hf)

General · NousResearch

Q8\_0Excellent

8.0 GB33% of RAM~24 tok/s6.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Mistral 7B Instruct v0.3 GPTQ](/fit/thesven-mistral-7b-instruct-v0-3-gptq)

Chat · thesven

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.25B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Olmo 3 7B Instruct SFT](/fit/allenai-olmo-3-7b-instruct-sft)

Chat · allenai

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.3B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3 Patronus Lynx 8B Instruct v1.1](/fit/patronusai-llama-3-patronus-lynx-8b-instruct-v1-1)

Chat · patronusai

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Nemotron H 8B Base 8K](/fit/nvidia-nemotron-h-8b-base-8k)

General · nvidia

Q8\_0Excellent

9.5 GB40% of RAM~20 tok/s8.1B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Meta Llama 3.1 8B Instruct quantized.w4a16](/fit/redhatai-meta-llama-3-1-8b-instruct-quantized-w4a16)

Chat · redhatai

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llammas base p1 GPT 4o human error mix paragraph GEC](/fit/tartunlp-llammas-base-p1-gpt-4o-human-error-mix-paragraph-gec)

General · tartunlp

Q8\_0Excellent

8.0 GB33% of RAM~24 tok/s6.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3 8B Instruct Gradient 1048k](/fit/gradientai-llama-3-8b-instruct-gradient-1048k)

Chat · gradientai

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 27b it int4 awq](/fit/gaunernst-gemma-3-27b-it-int4-awq)

General · gaunernst

Q8\_0Excellent

7.1 GB30% of RAM~27 tok/s5.93B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Meta Llama 3.1 8B Instruct FP8 dynamic](/fit/redhatai-meta-llama-3-1-8b-instruct-fp8-dynamic)

Chat · redhatai

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 7B Instruct](/fit/qwen-qwen2-5-7b-instruct)

Chat · Alibaba · 2024-09-16

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[bge large en v1.5](/fit/baai-bge-large-en-v1-5)

Embedding · BAAI · 2023-09-12

Q8\_0Excellent

0.9 GB4% of RAM~473 tok/s0.34B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 8B Instruct](/fit/qwen-qwen3-vl-8b-instruct)

Chat · Alibaba

Q8\_0Excellent

10.3 GB43% of RAM~18 tok/s8.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava 1.5 7b hf](/fit/llava-hf-llava-1-5-7b-hf)

General · llava-hf

Q8\_0Excellent

8.4 GB35% of RAM~23 tok/s7.06B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GLM 4.1V 9B Thinking](/fit/zai-org-glm-4-1v-9b-thinking)

General · zai-org

Q8\_0Excellent

12.0 GB50% of RAM~16 tok/s10.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 8B Instruct FP8](/fit/qwen-qwen3-vl-8b-instruct-fp8)

Chat · Alibaba

Q8\_0Excellent

10.3 GB43% of RAM~18 tok/s8.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2 7B Instruct](/fit/qwen-qwen2-7b-instruct)

Chat · Alibaba

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[falcon 7b](/fit/tiiuae-falcon-7b)

General · TII

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.22B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[XCurOS 1.2 8B VLBF16 Instruct](/fit/xcuros-xcuros-1-2-8b-vlbf16-instruct)

Chat · xcuros

Q8\_0Excellent

10.3 GB43% of RAM~18 tok/s8.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[XCurOS 0.1 8B Instruct](/fit/xcuros-xcuros-0-1-8b-instruct)

Chat · xcuros

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen 7B Chat](/fit/qwen-qwen-7b-chat)

Chat · Alibaba

Q8\_0Excellent

9.1 GB38% of RAM~21 tok/s7.72B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GLM 4.1V 9B Thinking AWQ](/fit/dengcao-glm-4-1v-9b-thinking-awq)

General · dengcao

Q8\_0Excellent

12.1 GB50% of RAM~16 tok/s10.36B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[chameleon 7b](/fit/facebook-chameleon-7b)

General · facebook

Q8\_0Excellent

8.4 GB35% of RAM~23 tok/s7.04B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Tarsier 7b](/fit/omni-research-tarsier-7b)

General · omni-research

Q8\_0Excellent

8.4 GB35% of RAM~23 tok/s7.06B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[wildguard](/fit/allenai-wildguard)

General · allenai

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.25B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[fuyu 8b](/fit/adept-fuyu-8b)

General · adept

Q8\_0Excellent

11.0 GB46% of RAM~17 tok/s9.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[MiniCPM Llama3 V 2\_5](/fit/openbmb-minicpm-llama3-v-2_5)

General · openbmb

Q8\_0Excellent

10.0 GB42% of RAM~19 tok/s8.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Cosmos Reason2 8B](/fit/nvidia-cosmos-reason2-8b)

Reasoning · nvidia

Q8\_0Excellent

10.3 GB43% of RAM~18 tok/s8.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava v1.6 vicuna 7b](/fit/liuhaotian-llava-v1-6-vicuna-7b)

General · liuhaotian

Q8\_0Excellent

8.4 GB35% of RAM~23 tok/s7.06B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Emu3 Chat hf](/fit/baai-emu3-chat-hf)

Chat · BAAI

Q8\_0Excellent

10.3 GB43% of RAM~18 tok/s8.76B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Mantis 8B siglip llama3](/fit/tiger-lab-mantis-8b-siglip-llama3)

General · tiger-lab

Q8\_0Excellent

10.0 GB41% of RAM~19 tok/s8.48B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[internlm2\_5 7b chat](/fit/internlm-internlm2_5-7b-chat)

Chat · internlm

Q8\_0Excellent

9.1 GB38% of RAM~21 tok/s7.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava llama 3 8b v1\_1 transformers](/fit/xtuner-llava-llama-3-8b-v1_1-transformers)

General · xtuner

Q8\_0Excellent

9.8 GB41% of RAM~19 tok/s8.36B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B NVFP4](/fit/redhatai-qwen3-30b-a3b-nvfp4)

General · redhatai

Q8\_0Excellent

20.0 GB83% of RAM~84 tok/s17.45B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava v1.6 vicuna 7b hf](/fit/llava-hf-llava-v1-6-vicuna-7b-hf)

General · llava-hf

Q8\_0Excellent

8.4 GB35% of RAM~23 tok/s7.06B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[e5 v](/fit/royokong-e5-v)

General · royokong

Q8\_0Excellent

9.8 GB41% of RAM~19 tok/s8.36B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llama3 llava next 8b hf](/fit/llava-hf-llama3-llava-next-8b-hf)

General · llava-hf

Q8\_0Excellent

9.8 GB41% of RAM~19 tok/s8.36B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B Claude Opus 4.6 High Reasoning NVFP4](/fit/harleywang-qwen3-5-27b-claude-opus-4-6-high-reasoning-nvfp4)

Reasoning · harleywang

Q6\_KExcellent

17.1 GB71% of RAM~11 tok/s19.14B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Huihui Qwen3 VL 8B Instruct abliterated](/fit/huihui-ai-huihui-qwen3-vl-8b-instruct-abliterated)

Chat · huihui-ai

Q8\_0Excellent

10.3 GB43% of RAM~18 tok/s8.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 VL 7B Instruct FP4](/fit/asi992h-qwen2-5-vl-7b-instruct-fp4)

Chat · asi992h

Q8\_0Excellent

9.9 GB41% of RAM~19 tok/s8.4B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Mistral 7B Instruct v0.3](/fit/mistralai-mistral-7b-instruct-v0-3)

Chat · Mistral AI · 2024-05-22

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.25B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2 VL 7B Instruct](/fit/qwen-qwen2-vl-7b-instruct)

Chat · Alibaba

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llama 7b](/fit/huggyllama-llama-7b)

General · huggyllama

Q8\_0Excellent

8.0 GB33% of RAM~24 tok/s6.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[rnj 1 instruct](/fit/essentialai-rnj-1-instruct)

Chat · essentialai

Q8\_0Excellent

9.8 GB41% of RAM~19 tok/s8.31B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SeeClick](/fit/cckevinn-seeclick)

General · cckevinn

Q8\_0Excellent

11.3 GB47% of RAM~17 tok/s9.66B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Math 7B](/fit/qwen-qwen2-5-math-7b)

General · Alibaba

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pythia 6.9b](/fit/eleutherai-pythia-6-9b)

General · eleutherai

Q8\_0Excellent

8.3 GB35% of RAM~23 tok/s6.99B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Amber](/fit/llm360-amber)

General · llm360

Q8\_0Excellent

8.0 GB33% of RAM~24 tok/s6.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Turkish Gemma 9b T1](/fit/ytu-ce-cosmos-turkish-gemma-9b-t1)

General · ytu-ce-cosmos

Q8\_0Excellent

10.8 GB45% of RAM~17 tok/s9.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SDAR 8B Chat b32](/fit/jetlm-sdar-8b-chat-b32)

Chat · jetlm

Q8\_0Excellent

9.6 GB40% of RAM~20 tok/s8.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[glm 4 9b chat hf](/fit/zai-org-glm-4-9b-chat-hf)

Chat · zai-org

Q8\_0Excellent

11.0 GB46% of RAM~17 tok/s9.4B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[blip2 opt 6.7b](/fit/salesforce-blip2-opt-6-7b)

General · salesforce

Q8\_0Excellent

9.1 GB38% of RAM~21 tok/s7.75B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Video LLaVA 7B hf](/fit/languagebind-video-llava-7b-hf)

General · languagebind

Q8\_0Excellent

8.7 GB36% of RAM~22 tok/s7.37B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[openvla 7b finetuned libero spatial](/fit/openvla-openvla-7b-finetuned-libero-spatial)

General · openvla

Q8\_0Excellent

8.9 GB37% of RAM~21 tok/s7.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 9B Claude 4.6 HighIQ INSTRUCT HERETIC UNCENSORED MLX mxfp8](/fit/thecluster-qwen3-5-9b-claude-4-6-highiq-instruct-heretic-uncensored-mlx-mxfp8)

Chat · thecluster

mlx-8bitExcellent

10.5 GB44% of RAM~18 tok/s9.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[openvla 7b finetuned libero 10](/fit/openvla-openvla-7b-finetuned-libero-10)

General · openvla

Q8\_0Excellent

8.9 GB37% of RAM~21 tok/s7.54B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[CodeLlama 7b Instruct hf](/fit/meta-llama-codellama-7b-instruct-hf)

Coding · Meta · 2024-03-13

Q8\_0Excellent

8.0 GB33% of RAM~24 tok/s6.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Meta Llama 3 8B](/fit/meta-llama-meta-llama-3-8b)

General · Meta

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.1 8B](/fit/meta-llama-llama-3-1-8b)

General · Meta · 2024-07-14

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL2 8B](/fit/opengvlab-internvl2-8b)

General · opengvlab

Q8\_0Excellent

9.5 GB40% of RAM~20 tok/s8.08B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[hf moshiko](/fit/kmhf-hf-moshiko)

General · kmhf

Q8\_0Excellent

9.2 GB38% of RAM~21 tok/s7.78B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Moonlight 16B A3B Instruct](/fit/moonshotai-moonlight-16b-a3b-instruct)

Chat · moonshotai

Q8\_0Excellent

18.3 GB76% of RAM~139 tok/s15.96B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama Guard 3 8B](/fit/meta-llama-llama-guard-3-8b)

General · Meta

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[salamandra 7b instruct](/fit/bsc-lt-salamandra-7b-instruct)

Chat · bsc-lt

Q8\_0Excellent

9.2 GB38% of RAM~21 tok/s7.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava onevision qwen2 7b ov hf](/fit/llava-hf-llava-onevision-qwen2-7b-ov-hf)

General · llava-hf

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL2\_5 8B](/fit/opengvlab-internvl2_5-8b)

General · opengvlab

Q8\_0Excellent

9.5 GB40% of RAM~20 tok/s8.08B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Eagle2.5 8B](/fit/nvidia-eagle2-5-8b)

General · nvidia

Q8\_0Excellent

9.5 GB40% of RAM~20 tok/s8.07B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 2 9b it AWQ](/fit/solidrust-gemma-2-9b-it-awq)

General · solidrust

Q8\_0Excellent

11.8 GB49% of RAM~16 tok/s10.16B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Phi 4 reasoning vision 15B](/fit/microsoft-phi-4-reasoning-vision-15b)

Reasoning · Microsoft

Q8\_0Excellent

17.4 GB72% of RAM~11 tok/s15.12B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 12b it qat q4\_0 unquantized](/fit/lightricks-gemma-3-12b-it-qat-q4_0-unquantized)

General · lightricks

Q8\_0Excellent

14.1 GB59% of RAM~13 tok/s12.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Molmo 7B D 0924](/fit/allenai-molmo-7b-d-0924)

General · allenai

Q8\_0Excellent

9.4 GB39% of RAM~20 tok/s8.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[LLaVA OneVision 1.5 8B Instruct](/fit/lmms-lab-llava-onevision-1-5-8b-instruct)

Chat · lmms-lab

Q8\_0Excellent

10.0 GB42% of RAM~19 tok/s8.53B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Skywork VL Reward 7B](/fit/skywork-skywork-vl-reward-7b)

General · skywork

Q8\_0Excellent

9.7 GB41% of RAM~19 tok/s8.29B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 12b it FP8 dynamic](/fit/redhatai-gemma-3-12b-it-fp8-dynamic)

General · redhatai

Q8\_0Excellent

14.1 GB59% of RAM~13 tok/s12.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 14B Instruct](/fit/qwen-qwen2-5-coder-14b-instruct)

Coding · Alibaba · 2024-11-06

Q8\_0Excellent

17.0 GB71% of RAM~11 tok/s14.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[HyperCLOVAX SEED Omni 8B](/fit/naver-hyperclovax-hyperclovax-seed-omni-8b)

General · naver-hyperclovax

Q8\_0Excellent

12.5 GB52% of RAM~15 tok/s10.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pixtral 12b](/fit/mistral-experimental-pixtral-12b)

General · mistral-experimental

Q8\_0Excellent

14.6 GB61% of RAM~13 tok/s12.68B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[glm 4 9b chat](/fit/thudm-glm-4-9b-chat)

Chat · thudm · 2024-06-04

Q8\_0Excellent

11.0 GB46% of RAM~17 tok/s9.4B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[aya vision 8b](/fit/coherelabs-aya-vision-8b)

General · coherelabs

Q8\_0Excellent

10.1 GB42% of RAM~19 tok/s8.63B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 2 7b chat hf](/fit/nousresearch-llama-2-7b-chat-hf)

Chat · NousResearch

Q8\_0Excellent

8.0 GB33% of RAM~24 tok/s6.74B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Yi 6B Chat](/fit/01-ai-yi-6b-chat)

Chat · 01.ai · 2023-11-22

Q8\_0Excellent

7.3 GB30% of RAM~27 tok/s6.06B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[t5gemma 2 4b 4b](/fit/google-t5gemma-2-4b-4b)

General · Google

Q8\_0Excellent

10.4 GB43% of RAM~18 tok/s8.85B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.2 11B Vision Instruct abliterated](/fit/huihui-ai-llama-3-2-11b-vision-instruct-abliterated)

Chat · huihui-ai

Q8\_0Excellent

12.4 GB52% of RAM~15 tok/s10.67B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NVIDIA Nemotron Nano 12B v2 VL FP8](/fit/nvidia-nvidia-nemotron-nano-12b-v2-vl-fp8)

General · nvidia

Q8\_0Excellent

15.2 GB63% of RAM~12 tok/s13.18B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NVIDIA Nemotron Nano 12B v2 VL BF16](/fit/nvidia-nvidia-nemotron-nano-12b-v2-vl-bf16)

General · nvidia

Q8\_0Excellent

15.2 GB63% of RAM~12 tok/s13.18B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Mistral NeMo Minitron 8B Instruct](/fit/nvidia-mistral-nemo-minitron-8b-instruct)

Chat · nvidia

Q8\_0Excellent

9.9 GB41% of RAM~19 tok/s8.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[falcon mamba 7b instruct](/fit/tiiuae-falcon-mamba-7b-instruct)

Chat · TII

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.27B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 14B](/fit/qwen-qwen2-5-coder-14b)

Coding · Alibaba

Q8\_0Excellent

17.0 GB71% of RAM~11 tok/s14.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3 9B](/fit/opengvlab-internvl3-9b)

General · opengvlab

Q8\_0Excellent

10.7 GB45% of RAM~18 tok/s9.14B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[moondream3 preview](/fit/moondream-moondream3-preview)

General · moondream

Q8\_0Excellent

10.8 GB45% of RAM~17 tok/s9.27B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[deepseek vl 7b chat](/fit/deepseek-ai-deepseek-vl-7b-chat)

Chat · DeepSeek

Q8\_0Excellent

8.7 GB36% of RAM~22 tok/s7.34B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 2 9b it](/fit/google-gemma-2-9b-it)

General · Google · 2024-06-24

Q8\_0Excellent

10.8 GB45% of RAM~17 tok/s9.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Math 7B Instruct](/fit/qwen-qwen2-5-math-7b-instruct)

Chat · Alibaba

Q8\_0Excellent

9.0 GB38% of RAM~21 tok/s7.62B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Meta Llama 3 8B Instruct](/fit/meta-llama-meta-llama-3-8b-instruct)

Chat · Meta

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 Coder 30B A3B Instruct](/fit/lmstudio-community-qwen3-coder-30b-a3b-instruct-mlx-4bit)

Coding · lmstudio-community

mlx-4bitTight

19.3 GB80% of RAM~92 tok/s30.53B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[mistral nemo instruct 2407 awq](/fit/casperhansen-mistral-nemo-instruct-2407-awq)

Chat · casperhansen

Q8\_0Excellent

14.2 GB59% of RAM~13 tok/s12.25B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[falcon 7b instruct](/fit/tiiuae-falcon-7b-instruct)

Chat · TII · 2023-04-25

Q8\_0Excellent

8.6 GB36% of RAM~22 tok/s7.22B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SauerkrautLM Nemo 12b Instruct](/fit/vagosolutions-sauerkrautlm-nemo-12b-instruct)

Chat · vagosolutions

Q8\_0Excellent

14.2 GB59% of RAM~13 tok/s12.25B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.2 11B Vision](/fit/meta-llama-llama-3-2-11b-vision)

General · Meta

Q8\_0Excellent

12.4 GB52% of RAM~15 tok/s10.64B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3 8B Instruct](/fit/opengvlab-internvl3-8b-instruct)

Chat · opengvlab

Q8\_0Excellent

9.4 GB39% of RAM~20 tok/s7.94B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.1 8B Instruct](/fit/meta-llama-llama-3-1-8b-instruct)

Chat · Meta · 2024-07-18

Q8\_0Excellent

9.5 GB39% of RAM~20 tok/s8.03B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 35B A3B](/fit/qwen-qwen3-5-35b-a3b)

General · Alibaba · 2026-02-24

Q3\_K\_MTight

20.1 GB84% of RAM~117 tok/s35.95B params

Try Q2\_K (16.2 GB, ~152 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[HyperCLOVAX SEED Think 14B GPTQ](/fit/k-compression-hyperclovax-seed-think-14b-gptq)

General · k-compression

Q8\_0Excellent

17.0 GB71% of RAM~11 tok/s14.75B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[HyperCLOVAX SEED Think 14B](/fit/naver-hyperclovax-hyperclovax-seed-think-14b)

General · naver-hyperclovax

Q8\_0Excellent

17.0 GB71% of RAM~11 tok/s14.75B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B Claude 4.6 Opus Reasoning Distilled NVFP4](/fit/mconcat-qwen3-5-27b-claude-4-6-opus-reasoning-distilled-nvfp4)

Reasoning · mconcat

Q6\_KExcellent

19.7 GB82% of RAM~10 tok/s22.15B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[instructblip vicuna 7b](/fit/salesforce-instructblip-vicuna-7b)

Chat · salesforce

Q8\_0Excellent

9.3 GB39% of RAM~20 tok/s7.91B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[lavida llada v1.0 instruct hf transformers](/fit/konstantinoskk-lavida-llada-v1-0-instruct-hf-transformers)

Chat · konstantinoskk

Q8\_0Excellent

9.9 GB41% of RAM~19 tok/s8.43B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NVIDIA Nemotron 3 Nano 30B A3B NVFP4](/fit/nvidia-nvidia-nemotron-3-nano-30b-a3b-nvfp4)

General · nvidia

Q6\_KExcellent

16.3 GB68% of RAM~12 tok/s18.24B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 14B](/fit/qwen-qwen2-5-14b)

General · Alibaba

Q8\_0Excellent

17.0 GB71% of RAM~11 tok/s14.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Ovis2.6 30B A3B](/fit/aidc-ai-ovis2-6-30b-a3b)

General · aidc-ai

Q3\_K\_MTight

17.6 GB73% of RAM~102 tok/s31.38B params

Try Q2\_K (14.2 GB, ~133 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B Instruct 2507](/fit/lmstudio-community-qwen3-30b-a3b-instruct-2507-mlx-4bit)

Chat · lmstudio-community

mlx-4bitTight

19.3 GB80% of RAM~92 tok/s30.53B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen1.5 14B](/fit/qwen-qwen1-5-14b)

General · Alibaba

Q8\_0Excellent

16.3 GB68% of RAM~11 tok/s14.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 14B AWQ](/fit/qwen-qwen3-14b-awq)

General · Alibaba

Q8\_0Excellent

17.0 GB71% of RAM~11 tok/s14.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B Thinking 2507](/fit/qwen-qwen3-30b-a3b-thinking-2507)

General · Alibaba

Q4\_K\_MTight

20.2 GB84% of RAM~87 tok/s30.53B params

Try Q3\_K\_M (17.2 GB, ~105 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Llama 3.2 11B Vision Instruct](/fit/meta-llama-llama-3-2-11b-vision-instruct)

Chat · Meta · 2024-09-18

Q8\_0Excellent

12.4 GB52% of RAM~15 tok/s10.67B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[sarvam 30b uncensored](/fit/aoxo-sarvam-30b-uncensored)

General · aoxo

Q3\_K\_MTight

18.0 GB75% of RAM~116 tok/s32.15B params

Try Q2\_K (14.5 GB, ~150 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[sarvam 30b](/fit/sarvamai-sarvam-30b)

General · sarvamai

Q3\_K\_MTight

18.0 GB75% of RAM~116 tok/s32.15B params

Try Q2\_K (14.5 GB, ~150 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[typhoon2.5 qwen3 30b a3b](/fit/typhoon-ai-typhoon2-5-qwen3-30b-a3b)

General · typhoon-ai

Q4\_K\_MTight

20.2 GB84% of RAM~87 tok/s30.53B params

Try Q3\_K\_M (17.2 GB, ~105 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[ERNIE 4.5 21B A3B](/fit/lmstudio-community-ernie-4-5-21b-a3b-mlx-4bit)

General · lmstudio-community

mlx-4bitExcellent

13.9 GB58% of RAM~14 tok/s21.83B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 12b pt](/fit/google-gemma-3-12b-pt)

General · Google

Q8\_0Excellent

14.1 GB59% of RAM~13 tok/s12.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3\_5 38B AWQ 4bit](/fit/cyankiwi-internvl3_5-38b-awq-4bit)

General · cyankiwi

Q8\_0Excellent

14.0 GB58% of RAM~13 tok/s12.06B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[CodeLlama 13b Instruct hf](/fit/meta-llama-codellama-13b-instruct-hf)

Coding · Meta · 2024-03-13

Q8\_0Excellent

15.0 GB63% of RAM~12 tok/s13.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 Coder 30B A3B Instruct](/fit/qwen-qwen3-coder-30b-a3b-instruct)

Coding · Alibaba

Q4\_K\_MTight

20.2 GB84% of RAM~87 tok/s30.53B params

Try Q3\_K\_M (17.2 GB, ~105 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 Coder 30B A3B Instruct FP8](/fit/qwen-qwen3-coder-30b-a3b-instruct-fp8)

Coding · Alibaba

Q4\_K\_MTight

20.2 GB84% of RAM~87 tok/s30.53B params

Try Q3\_K\_M (17.2 GB, ~105 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 30B A3B Thinking](/fit/qwen-qwen3-vl-30b-a3b-thinking)

General · Alibaba

Q4\_K\_MTight

20.6 GB86% of RAM~118 tok/s31.07B params

Try Q3\_K\_M (17.4 GB, ~142 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 14B Base](/fit/qwen-qwen3-14b-base)

General · Alibaba

Q8\_0Excellent

17.0 GB71% of RAM~11 tok/s14.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[pythia 12b](/fit/eleutherai-pythia-12b)

General · eleutherai

Q8\_0Excellent

13.9 GB58% of RAM~13 tok/s12B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 Coder 30B A3B Instruct AWQ](/fit/quanttrio-qwen3-coder-30b-a3b-instruct-awq)

Coding · quanttrio

Q4\_K\_MTight

20.2 GB84% of RAM~87 tok/s30.53B params

Try Q3\_K\_M (17.2 GB, ~105 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[bu 30b a3b preview](/fit/browser-use-bu-30b-a3b-preview)

General · browser-use

Q4\_K\_MTight

20.6 GB86% of RAM~118 tok/s31.07B params

Try Q3\_K\_M (17.4 GB, ~142 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Bielik 11B v3.0 Instruct](/fit/speakleash-bielik-11b-v3-0-instruct)

Chat · speakleash

Q8\_0Excellent

13.0 GB54% of RAM~14 tok/s11.17B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B GPTQ Int4](/fit/qwen-qwen3-30b-a3b-gptq-int4)

General · Alibaba

Q4\_K\_MTight

20.2 GB84% of RAM~87 tok/s30.53B params

Try Q3\_K\_M (17.2 GB, ~105 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava v1.6 vicuna 13b](/fit/liuhaotian-llava-v1-6-vicuna-13b)

General · liuhaotian

Q8\_0Excellent

15.4 GB64% of RAM~12 tok/s13.35B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B Base](/fit/qwen-qwen3-30b-a3b-base)

General · Alibaba

Q4\_K\_MTight

20.2 GB84% of RAM~87 tok/s30.53B params

Try Q3\_K\_M (17.2 GB, ~105 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[SOLAR 10.7B Instruct v1.0](/fit/upstage-solar-10-7b-instruct-v1-0)

Chat · Upstage · 2023-12-12

Q8\_0Excellent

12.5 GB52% of RAM~15 tok/s10.73B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B AWQ](/fit/quixiai-qwen3-30b-a3b-awq)

General · quixiai

Q4\_K\_MTight

20.2 GB84% of RAM~87 tok/s30.53B params

Try Q3\_K\_M (17.2 GB, ~105 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 32B NVFP4](/fit/redhatai-qwen3-32b-nvfp4)

General · redhatai

Q6\_KExcellent

17.0 GB71% of RAM~11 tok/s19.11B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B.w8a8](/fit/nytopop-qwen3-30b-a3b-w8a8)

General · nytopop

Q4\_K\_MTight

20.2 GB84% of RAM~87 tok/s30.55B params

Try Q3\_K\_M (17.2 GB, ~105 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GLM 4.6V AWQ 4bit](/fit/cyankiwi-glm-4-6v-awq-4bit)

General · cyankiwi

Q6\_KExcellent

17.4 GB72% of RAM~11 tok/s19.49B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 12b it int4 awq](/fit/gaunernst-gemma-3-12b-it-int4-awq)

General · gaunernst

Q8\_0Excellent

14.1 GB59% of RAM~13 tok/s12.19B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava 1.5 13b hf](/fit/llava-hf-llava-1-5-13b-hf)

General · llava-hf

Q8\_0Excellent

15.4 GB64% of RAM~12 tok/s13.35B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3 14B hf](/fit/opengvlab-internvl3-14b-hf)

General · opengvlab

Q8\_0Excellent

17.4 GB72% of RAM~11 tok/s15.12B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B Instruct 2507 FP8](/fit/qwen-qwen3-30b-a3b-instruct-2507-fp8)

Chat · Alibaba

Q4\_K\_MTight

20.2 GB84% of RAM~87 tok/s30.53B params

Try Q3\_K\_M (17.2 GB, ~105 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[HarmBench Llama 2 13b cls](/fit/cais-harmbench-llama-2-13b-cls)

General · cais

Q8\_0Excellent

15.0 GB63% of RAM~12 tok/s13.02B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 30B A3B Instruct 2507 GPTQ Int4](/fit/junhowie-qwen3-30b-a3b-instruct-2507-gptq-int4)

Chat · junhowie

Q4\_K\_MTight

20.2 GB84% of RAM~87 tok/s30.53B params

Try Q3\_K\_M (17.2 GB, ~105 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llm jp 3.1 13b](/fit/llm-jp-llm-jp-3-1-13b)

General · llm-jp

Q8\_0Excellent

15.8 GB66% of RAM~12 tok/s13.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 31B it NVFP4](/fit/redhatai-gemma-4-31b-it-nvfp4)

General · redhatai

Q6\_KExcellent

17.7 GB74% of RAM~11 tok/s19.87B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 30B A3B Instruct AWQ](/fit/quanttrio-qwen3-vl-30b-a3b-instruct-awq)

Chat · quanttrio

Q4\_K\_MTight

20.6 GB86% of RAM~118 tok/s31.07B params

Try Q3\_K\_M (17.4 GB, ~142 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 26B A4B it](/fit/google-gemma-4-26b-a4b-it)

General · Google · 2026-03-11

Q5\_K\_MTight

20.3 GB85% of RAM~9 tok/s26.54B params

Try Q4\_K\_M (17.6 GB, ~11 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 14B Instruct](/fit/openpipe-qwen3-14b-instruct)

Chat · openpipe

Q8\_0Excellent

17.0 GB71% of RAM~11 tok/s14.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Kimi VL A3B Thinking](/fit/moonshotai-kimi-vl-a3b-thinking)

General · moonshotai

Q8\_0Excellent

18.8 GB78% of RAM~10 tok/s16.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Kimi VL A3B Thinking 2506](/fit/moonshotai-kimi-vl-a3b-thinking-2506)

General · moonshotai

Q8\_0Excellent

18.8 GB78% of RAM~10 tok/s16.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B Claude 4.6 Opus Reasoning Distilled FP8 Dynamic](/fit/mconcat-qwen3-5-27b-claude-4-6-opus-reasoning-distilled-fp8-dynamic)

Reasoning · mconcat

Q4\_K\_MTight

18.2 GB76% of RAM~11 tok/s27.36B params

Try Q3\_K\_M (15.4 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B Claude Opus 4.6 High Reasoning](/fit/harleywang-qwen3-5-27b-claude-opus-4-6-high-reasoning)

Reasoning · harleywang

Q4\_K\_MTight

18.2 GB76% of RAM~11 tok/s27.36B params

Try Q3\_K\_M (15.4 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B](/fit/qwen-qwen3-5-27b)

General · Alibaba · 2026-02-24

Q4\_K\_MTight

18.4 GB77% of RAM~10 tok/s27.78B params

Try Q3\_K\_M (15.7 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 14B Instruct AWQ](/fit/qwen-qwen2-5-14b-instruct-awq)

Chat · Alibaba

Q8\_0Excellent

17.0 GB71% of RAM~11 tok/s14.77B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 35B A3B AWQ 4bit](/fit/cyankiwi-qwen3-5-35b-a3b-awq-4bit)

General · cyankiwi

Q2\_KGreat

16.6 GB69% of RAM~155 tok/s36.98B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B Claude 4.6 Opus Reasoning Distilled](/fit/jackrong-qwen3-5-27b-claude-4-6-opus-reasoning-distilled)

Reasoning · jackrong

Q4\_K\_MTight

18.4 GB77% of RAM~10 tok/s27.78B params

Try Q3\_K\_M (15.7 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Gemma 4 31B IT NVFP4](/fit/nvidia-gemma-4-31b-it-nvfp4)

General · nvidia

Q6\_KExcellent

18.6 GB77% of RAM~10 tok/s20.87B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B NVFP4](/fit/apolo13x-qwen3-5-27b-nvfp4)

General · apolo13x

Q8\_0Excellent

19.1 GB80% of RAM~10 tok/s16.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B Claude 4.6 Opus Reasoning Distilled v2](/fit/jackrong-qwen3-5-27b-claude-4-6-opus-reasoning-distilled-v2)

Reasoning · jackrong

Q4\_K\_MTight

18.4 GB77% of RAM~10 tok/s27.78B params

Try Q3\_K\_M (15.7 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 35B A3B AWQ 8bit](/fit/cyankiwi-qwen3-5-35b-a3b-awq-8bit)

General · cyankiwi

Q2\_KGreat

16.6 GB69% of RAM~155 tok/s36.98B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B Claude 4.6 Opus Reasoning Distilled GPTQ int4](/fit/codgician-qwen3-5-27b-claude-4-6-opus-reasoning-distilled-gptq-int4)

Reasoning · codgician

Q4\_K\_MTight

18.4 GB77% of RAM~10 tok/s27.78B params

Try Q3\_K\_M (15.7 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B Claude 4.6 Opus Reasoning Distilled v2 AWQ](/fit/quanttrio-qwen3-5-27b-claude-4-6-opus-reasoning-distilled-v2-awq)

Reasoning · quanttrio

Q4\_K\_MTight

18.4 GB77% of RAM~10 tok/s27.78B params

Try Q3\_K\_M (15.7 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B Claude 4.6 Opus Reasoning Distilled GPTQ int4](/fit/oxzoid-qwen3-5-27b-claude-4-6-opus-reasoning-distilled-gptq-int4)

Reasoning · oxzoid

Q4\_K\_MTight

18.4 GB77% of RAM~10 tok/s27.78B params

Try Q3\_K\_M (15.7 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B heretic v3 NVFP4](/fit/deaquay-qwen3-5-27b-heretic-v3-nvfp4)

General · deaquay

Q8\_0Excellent

19.1 GB80% of RAM~10 tok/s16.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 35B A3B FP8](/fit/qwen-qwen3-5-35b-a3b-fp8)

General · Alibaba

Q3\_K\_MTight

20.1 GB84% of RAM~123 tok/s35.95B params

Try Q2\_K (16.2 GB, ~159 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3\_5 GPT OSS 20B A4B Preview HF](/fit/opengvlab-internvl3_5-gpt-oss-20b-a4b-preview-hf)

General · opengvlab

Q6\_KExcellent

18.9 GB79% of RAM~10 tok/s21.23B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 35B A3B AWQ](/fit/quanttrio-qwen3-5-35b-a3b-awq)

General · quanttrio

Q3\_K\_MTight

20.1 GB84% of RAM~123 tok/s35.95B params

Try Q2\_K (16.2 GB, ~159 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Huihui Qwen3.5 35B A3B Claude 4.6 Opus abliterated](/fit/huihui-ai-huihui-qwen3-5-35b-a3b-claude-4-6-opus-abliterated)

General · huihui-ai

Q3\_K\_MTight

20.1 GB84% of RAM~123 tok/s35.95B params

Try Q2\_K (16.2 GB, ~159 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Huihui Qwen3.5 35B A3B abliterated](/fit/huihui-ai-huihui-qwen3-5-35b-a3b-abliterated)

General · huihui-ai

Q3\_K\_MTight

20.1 GB84% of RAM~123 tok/s35.95B params

Try Q2\_K (16.2 GB, ~159 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 35B A3B Base](/fit/qwen-qwen3-5-35b-a3b-base)

General · Alibaba

Q3\_K\_MTight

20.1 GB84% of RAM~123 tok/s35.95B params

Try Q2\_K (16.2 GB, ~159 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B NVFP4](/fit/axionml-qwen3-5-27b-nvfp4)

General · axionml

Q8\_0Excellent

19.6 GB82% of RAM~9 tok/s17.13B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B heretic](/fit/coder3101-qwen3-5-27b-heretic)

Coding · coder3101

Q4\_K\_MTight

18.2 GB76% of RAM~11 tok/s27.36B params

Try Q3\_K\_M (15.4 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[MiniMax M2.1 AWQ 4bit](/fit/cyankiwi-minimax-m2-1-awq-4bit)

General · cyankiwi

Q3\_K\_MTight

20.6 GB86% of RAM~120 tok/s36.81B params

Try Q2\_K (16.5 GB, ~156 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llm jp 3.1 13b instruct4](/fit/llm-jp-llm-jp-3-1-13b-instruct4)

Chat · llm-jp

Q8\_0Excellent

15.8 GB66% of RAM~12 tok/s13.71B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Kimi VL A3B Instruct](/fit/moonshotai-kimi-vl-a3b-instruct)

Chat · moonshotai

Q8\_0Excellent

18.8 GB78% of RAM~10 tok/s16.41B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 32B NVFP4](/fit/nvidia-qwen3-32b-nvfp4)

General · nvidia

Q8\_0Excellent

19.6 GB82% of RAM~9 tok/s17.16B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Dolphin Mistral 24B Venice Edition](/fit/dphn-dolphin-mistral-24b-venice-edition)

General · dphn

Q5\_K\_MTight

18.1 GB75% of RAM~11 tok/s23.57B params

Try Q4\_K\_M (15.7 GB, ~12 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NVIDIA Nemotron 3 Nano 30B A3B BF16](/fit/nvidia-nvidia-nemotron-3-nano-30b-a3b-bf16)

General · nvidia · 2025-12-04

Q3\_K\_MTight

17.7 GB74% of RAM~11 tok/s31.58B params

Try Q2\_K (14.3 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[DeepSeek R1 Distill Qwen 32B](/fit/deepseek-ai-deepseek-r1-distill-qwen-32b)

Reasoning · DeepSeek · 2025-01-20

Q3\_K\_MTight

18.4 GB77% of RAM~11 tok/s32.76B params

Try Q2\_K (14.8 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 31B it](/fit/google-gemma-4-31b-it)

General · Google · 2026-03-11

Q3\_K\_MTight

18.3 GB76% of RAM~11 tok/s32.68B params

Try Q2\_K (14.7 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 35B A3B Claude 4.6 Opus Reasoning Distilled GPTQ int4](/fit/codgician-qwen3-5-35b-a3b-claude-4-6-opus-reasoning-distilled-gptq-int4)

Reasoning · codgician

Q3\_K\_MTight

20.1 GB84% of RAM~123 tok/s35.95B params

Try Q2\_K (16.2 GB, ~159 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 35B A3B Claude 4.6 Opus Reasoning Distilled](/fit/jackrong-qwen3-5-35b-a3b-claude-4-6-opus-reasoning-distilled)

Reasoning · jackrong

Q3\_K\_MTight

20.1 GB84% of RAM~123 tok/s35.95B params

Try Q2\_K (16.2 GB, ~159 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[deepseek vl2 small](/fit/deepseek-ai-deepseek-vl2-small)

General · DeepSeek

Q8\_0Excellent

18.5 GB77% of RAM~10 tok/s16.15B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Mistral Small 24B Instruct 2501 AWQ](/fit/stelterlab-mistral-small-24b-instruct-2501-awq)

Chat · stelterlab

Q5\_K\_MTight

18.1 GB75% of RAM~11 tok/s23.57B params

Try Q4\_K\_M (15.7 GB, ~12 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GLM 4.7 Flash REAP 23B A3B](/fit/cerebras-glm-4-7-flash-reap-23b-a3b)

General · cerebras

Q6\_KTight

20.4 GB85% of RAM~9 tok/s23B params

Try Q5\_K\_M (17.6 GB, ~11 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Aria\_hf\_2](/fit/m-ric-aria_hf_2)

General · m-ric

Q5\_K\_MTight

19.4 GB81% of RAM~10 tok/s25.31B params

Try Q4\_K\_M (16.8 GB, ~12 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Aria](/fit/rhymes-ai-aria)

General · rhymes-ai

Q5\_K\_MTight

19.4 GB81% of RAM~10 tok/s25.31B params

Try Q4\_K\_M (16.8 GB, ~12 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Mistral Small 3.2 24B Instruct hf AWQ](/fit/gghfez-mistral-small-3-2-24b-instruct-hf-awq)

Chat · gghfez

Q5\_K\_MTight

18.1 GB75% of RAM~11 tok/s23.57B params

Try Q4\_K\_M (15.7 GB, ~12 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[t5gemma 9b 9b ul2](/fit/google-t5gemma-9b-9b-ul2)

General · Google

Q6\_KExcellent

18.1 GB75% of RAM~10 tok/s20.33B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 31B it heretic](/fit/coder3101-gemma-4-31b-it-heretic)

Coding · coder3101

Q3\_K\_MTight

17.6 GB73% of RAM~11 tok/s31.27B params

Try Q2\_K (14.1 GB, ~15 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GLM 4.7 Flash](/fit/lmstudio-community-glm-4-7-flash-mlx-8bit)

General · lmstudio-community

mlx-4bitTight

18.9 GB79% of RAM~10 tok/s29.94B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[OpenReasoning Nemotron 32B](/fit/nvidia-openreasoning-nemotron-32b)

Reasoning · nvidia

Q3\_K\_MTight

18.4 GB77% of RAM~11 tok/s32.76B params

Try Q2\_K (14.8 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 26B A4B it AWQ 4bit](/fit/cyankiwi-gemma-4-26b-a4b-it-awq-4bit)

General · cyankiwi

Q5\_K\_MTight

20.3 GB85% of RAM~9 tok/s26.55B params

Try Q4\_K\_M (17.6 GB, ~11 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 27b it FP8 dynamic](/fit/redhatai-gemma-3-27b-it-fp8-dynamic)

General · redhatai

Q4\_K\_MTight

18.2 GB76% of RAM~11 tok/s27.44B params

Try Q3\_K\_M (15.5 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 26B A4B](/fit/google-gemma-4-26b-a4b)

General · Google

Q5\_K\_MTight

20.3 GB85% of RAM~9 tok/s26.54B params

Try Q4\_K\_M (17.6 GB, ~11 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B Claude 4.6 OS Auto Variable Thinking](/fit/davidau-qwen3-5-27b-claude-4-6-os-auto-variable-thinking)

General · davidau

Q4\_K\_MTight

18.2 GB76% of RAM~11 tok/s27.36B params

Try Q3\_K\_M (15.4 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwopus3.5 27B v3](/fit/jackrong-qwopus3-5-27b-v3)

General · jackrong

Q4\_K\_MTight

18.2 GB76% of RAM~11 tok/s27.36B params

Try Q3\_K\_M (15.4 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B Deckard PKD Heretic Uncensored Thinking](/fit/davidau-qwen3-5-27b-deckard-pkd-heretic-uncensored-thinking)

General · davidau

Q4\_K\_MTight

18.2 GB76% of RAM~11 tok/s27.36B params

Try Q3\_K\_M (15.4 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B earica](/fit/voidful-qwen3-5-27b-earica)

General · voidful

Q4\_K\_MTight

18.2 GB76% of RAM~11 tok/s27.36B params

Try Q3\_K\_M (15.4 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B earica hardness](/fit/voidful-qwen3-5-27b-earica-hardness)

General · voidful

Q4\_K\_MTight

18.2 GB76% of RAM~11 tok/s27.36B params

Try Q3\_K\_M (15.4 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 26B A4B it AWQ 8bit](/fit/cyankiwi-gemma-4-26b-a4b-it-awq-8bit)

General · cyankiwi

Q5\_K\_MTight

20.3 GB85% of RAM~9 tok/s26.55B params

Try Q4\_K\_M (17.6 GB, ~11 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B ultra uncensored heretic v1](/fit/llmfan46-qwen3-5-27b-ultra-uncensored-heretic-v1)

General · llmfan46

Q4\_K\_MTight

18.2 GB76% of RAM~11 tok/s27.36B params

Try Q3\_K\_M (15.4 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B FP8](/fit/qwen-qwen3-5-27b-fp8)

General · Alibaba

Q4\_K\_MTight

18.4 GB77% of RAM~10 tok/s27.78B params

Try Q3\_K\_M (15.7 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B AWQ](/fit/quanttrio-qwen3-5-27b-awq)

General · quanttrio

Q4\_K\_MTight

18.4 GB77% of RAM~10 tok/s27.78B params

Try Q3\_K\_M (15.7 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Huihui Qwen3.5 27B abliterated](/fit/huihui-ai-huihui-qwen3-5-27b-abliterated)

General · huihui-ai

Q4\_K\_MTight

18.4 GB77% of RAM~10 tok/s27.78B params

Try Q3\_K\_M (15.7 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[EXAONE 4.0 32B](/fit/lgai-exaone-exaone-4-0-32b)

General · lgai-exaone · 2025-07-11

Q3\_K\_MTight

18.0 GB75% of RAM~11 tok/s32B params

Try Q2\_K (14.4 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Huihui Qwen3.5 27B Claude 4.6 Opus abliterated](/fit/huihui-ai-huihui-qwen3-5-27b-claude-4-6-opus-abliterated)

General · huihui-ai

Q4\_K\_MTight

18.4 GB77% of RAM~10 tok/s27.78B params

Try Q3\_K\_M (15.7 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B AWQ 4bit](/fit/cyankiwi-qwen3-5-27b-awq-4bit)

General · cyankiwi

Q4\_K\_MTight

18.9 GB79% of RAM~10 tok/s28.55B params

Try Q3\_K\_M (16.1 GB, ~12 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B AWQ BF16 INT8](/fit/cyankiwi-qwen3-5-27b-awq-bf16-int8)

General · cyankiwi

Q4\_K\_MTight

18.8 GB78% of RAM~10 tok/s28.38B params

Try Q3\_K\_M (16.0 GB, ~12 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 27B AWQ BF16 INT4](/fit/cyankiwi-qwen3-5-27b-awq-bf16-int4)

General · cyankiwi

Q4\_K\_MTight

18.8 GB78% of RAM~10 tok/s28.38B params

Try Q3\_K\_M (16.0 GB, ~12 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NVIDIA Nemotron 3 Nano 30B A3B](/fit/lmstudio-community-nvidia-nemotron-3-nano-30b-a3b-mlx-4bit)

General · lmstudio-community

mlx-4bitTight

19.9 GB83% of RAM~10 tok/s31.58B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[vllm translategemma 27b it](/fit/infomaniak-ai-vllm-translategemma-27b-it)

General · infomaniak-ai

Q4\_K\_MTight

19.1 GB80% of RAM~10 tok/s28.84B params

Try Q3\_K\_M (16.2 GB, ~12 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NVIDIA Nemotron 3 Nano 30B A3B FP8](/fit/nvidia-nvidia-nemotron-3-nano-30b-a3b-fp8)

General · nvidia

Q3\_K\_MTight

17.7 GB74% of RAM~11 tok/s31.58B params

Try Q2\_K (14.3 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 32B Instruct](/fit/qwen-qwen2-5-coder-32b-instruct)

Coding · Alibaba · 2024-11-06

Q3\_K\_MTight

18.4 GB77% of RAM~11 tok/s32.76B params

Try Q2\_K (14.8 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GLM 4.7 Flash](/fit/zai-org-glm-4-7-flash)

General · zai-org

Q3\_K\_MTight

17.5 GB73% of RAM~11 tok/s31.22B params

Try Q2\_K (14.1 GB, ~15 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Nemotron Cascade 2 30B A3B](/fit/nvidia-nemotron-cascade-2-30b-a3b)

General · nvidia

Q3\_K\_MTight

17.7 GB74% of RAM~11 tok/s31.58B params

Try Q2\_K (14.3 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[NVIDIA Nemotron 3 Nano 30B A3B Base BF16](/fit/nvidia-nvidia-nemotron-3-nano-30b-a3b-base-bf16)

General · nvidia

Q3\_K\_MTight

17.7 GB74% of RAM~11 tok/s31.58B params

Try Q2\_K (14.3 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GLM 4.7 Flash AWQ](/fit/quanttrio-glm-4-7-flash-awq)

General · quanttrio

Q3\_K\_MTight

17.5 GB73% of RAM~11 tok/s31.22B params

Try Q2\_K (14.1 GB, ~15 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[ERNIE 4.5 VL 28B A3B PT](/fit/baidu-ernie-4-5-vl-28b-a3b-pt)

General · baidu

Q4\_K\_MTight

19.5 GB81% of RAM~10 tok/s29.4B params

Try Q3\_K\_M (16.5 GB, ~12 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 Coder 32B](/fit/qwen-qwen2-5-coder-32b)

Coding · Alibaba

Q3\_K\_MTight

18.4 GB77% of RAM~11 tok/s32.76B params

Try Q2\_K (14.8 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[ERNIE 4.5 VL 28B A3B Thinking](/fit/baidu-ernie-4-5-vl-28b-a3b-thinking)

General · baidu

Q4\_K\_MTight

19.6 GB82% of RAM~10 tok/s29.66B params

Try Q3\_K\_M (16.7 GB, ~12 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 31B it heretic v2](/fit/momix-44-gemma-4-31b-it-heretic-v2)

General · momix-44

Q3\_K\_MTight

17.6 GB73% of RAM~11 tok/s31.27B params

Try Q2\_K (14.1 GB, ~15 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 31B it AWQ](/fit/quanttrio-gemma-4-31b-it-awq)

General · quanttrio

Q3\_K\_MTight

17.6 GB73% of RAM~11 tok/s31.27B params

Try Q2\_K (14.1 GB, ~15 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 31B it FP8 block](/fit/redhatai-gemma-4-31b-it-fp8-block)

General · redhatai

Q3\_K\_MTight

17.6 GB73% of RAM~11 tok/s31.27B params

Try Q2\_K (14.1 GB, ~15 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 3 27b it](/fit/google-gemma-3-27b-it)

General · Google · 2025-03-01

Q4\_K\_MTight

18.2 GB76% of RAM~11 tok/s27.43B params

Try Q3\_K\_M (15.5 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[GLM 4.7 Flash AWQ 4bit](/fit/cyankiwi-glm-4-7-flash-awq-4bit)

General · cyankiwi

Q3\_K\_MTight

18.0 GB75% of RAM~11 tok/s32.14B params

Try Q2\_K (14.5 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 31B it AWQ 4bit](/fit/cyankiwi-gemma-4-31b-it-awq-4bit)

General · cyankiwi

Q3\_K\_MTight

18.1 GB75% of RAM~11 tok/s32.19B params

Try Q2\_K (14.5 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[EXAONE 4.0 32B FP8](/fit/lgai-exaone-exaone-4-0-32b-fp8)

General · lgai-exaone

Q3\_K\_MTight

18.0 GB75% of RAM~11 tok/s32.01B params

Try Q2\_K (14.4 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[EXAONE 4.0.1 32B](/fit/lgai-exaone-exaone-4-0-1-32b)

General · lgai-exaone

Q3\_K\_MTight

18.0 GB75% of RAM~11 tok/s32B params

Try Q2\_K (14.4 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 31B it AWQ 8bit](/fit/cyankiwi-gemma-4-31b-it-awq-8bit)

General · cyankiwi

Q3\_K\_MTight

18.1 GB75% of RAM~11 tok/s32.19B params

Try Q2\_K (14.5 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 32B](/fit/qwen-qwen2-5-32b)

General · Alibaba

Q3\_K\_MTight

18.4 GB77% of RAM~11 tok/s32.76B params

Try Q2\_K (14.8 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Baichuan M2 32B](/fit/baichuan-inc-baichuan-m2-32b)

General · baichuan-inc

Q3\_K\_MTight

18.4 GB77% of RAM~11 tok/s32.76B params

Try Q2\_K (14.8 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 4 31B](/fit/google-gemma-4-31b)

General · Google

Q3\_K\_MTight

18.3 GB76% of RAM~11 tok/s32.68B params

Try Q2\_K (14.7 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[gemma 2 27b it](/fit/google-gemma-2-27b-it)

General · Google · 2024-06-24

Q4\_K\_MTight

18.1 GB75% of RAM~11 tok/s27.23B params

Try Q3\_K\_M (15.4 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 32B Thinking](/fit/qwen-qwen3-vl-32b-thinking)

General · Alibaba

Q3\_K\_MTight

18.7 GB78% of RAM~11 tok/s33.36B params

Try Q2\_K (15.0 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[HyperCLOVAX SEED Think 32B](/fit/naver-hyperclovax-hyperclovax-seed-think-32b)

General · naver-hyperclovax

Q3\_K\_MTight

18.7 GB78% of RAM~11 tok/s33.31B params

Try Q2\_K (15.0 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Olmo 3 1125 32B](/fit/allenai-olmo-3-1125-32b)

General · allenai

Q3\_K\_MTight

18.1 GB75% of RAM~11 tok/s32.23B params

Try Q2\_K (14.5 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[karakuri vl 32b thinking 2507 exp](/fit/karakuri-ai-karakuri-vl-32b-thinking-2507-exp)

General · karakuri-ai

Q3\_K\_MTight

18.7 GB78% of RAM~11 tok/s33.45B params

Try Q2\_K (15.1 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 32B AWQ](/fit/qwen-qwen3-32b-awq)

General · Alibaba

Q3\_K\_MTight

18.4 GB77% of RAM~11 tok/s32.76B params

Try Q2\_K (14.8 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[QwQ 32B](/fit/qwen-qwq-32b)

General · Alibaba

Q3\_K\_MTight

18.4 GB77% of RAM~11 tok/s32.76B params

Try Q2\_K (14.8 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 32B FP8 dynamic](/fit/redhatai-qwen3-32b-fp8-dynamic)

General · redhatai

Q3\_K\_MTight

18.4 GB77% of RAM~11 tok/s32.77B params

Try Q2\_K (14.8 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[CodeLlama 34b Instruct hf](/fit/codellama-codellama-34b-instruct-hf)

Coding · codellama

Q3\_K\_MTight

18.9 GB79% of RAM~10 tok/s33.74B params

Try Q2\_K (15.2 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[xLAM 2 32b fc r](/fit/salesforce-xlam-2-32b-fc-r)

General · salesforce

Q3\_K\_MTight

18.4 GB77% of RAM~11 tok/s32.76B params

Try Q2\_K (14.8 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 32B Instruct](/fit/qwen-qwen3-vl-32b-instruct)

Chat · Alibaba

Q3\_K\_MTight

18.7 GB78% of RAM~11 tok/s33.36B params

Try Q2\_K (15.0 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3 VL 32B Instruct AWQ](/fit/quanttrio-qwen3-vl-32b-instruct-awq)

Chat · quanttrio

Q3\_K\_MTight

18.7 GB78% of RAM~11 tok/s33.36B params

Try Q2\_K (15.0 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 VL 32B Instruct](/fit/qwen-qwen2-5-vl-32b-instruct)

Chat · Alibaba

Q3\_K\_MTight

18.7 GB78% of RAM~11 tok/s33.45B params

Try Q2\_K (15.1 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[karakuri vl 32b instruct 2507](/fit/karakuri-ai-karakuri-vl-32b-instruct-2507)

Chat · karakuri-ai

Q3\_K\_MTight

18.7 GB78% of RAM~11 tok/s33.45B params

Try Q2\_K (15.1 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen2.5 32B Instruct AWQ](/fit/qwen-qwen2-5-32b-instruct-awq)

Chat · Alibaba

Q3\_K\_MTight

18.4 GB77% of RAM~11 tok/s32.76B params

Try Q2\_K (14.8 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[OLMo 2 0325 32B Instruct](/fit/allenai-olmo-2-0325-32b-instruct)

Chat · allenai · 2025-03-12

Q3\_K\_MTight

18.1 GB75% of RAM~11 tok/s32.23B params

Try Q2\_K (14.5 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Qwen3.5 40B Claude 4.6 Opus Deckard Heretic Uncensored Thinking](/fit/davidau-qwen3-5-40b-claude-4-6-opus-deckard-heretic-uncensored-thinking)

General · davidau

Q2\_KTight

17.7 GB74% of RAM~12 tok/s39.53B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[dolphin 2.9.1 yi 1.5 34b](/fit/dphn-dolphin-2-9-1-yi-1-5-34b)

General · dphn

Q3\_K\_MTight

19.3 GB80% of RAM~10 tok/s34.39B params

Try Q2\_K (15.5 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3\_5 30B A3B](/fit/opengvlab-internvl3_5-30b-a3b)

General · opengvlab

Q4\_K\_MTight

20.4 GB85% of RAM~9 tok/s30.85B params

Try Q3\_K\_M (17.3 GB, ~11 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[CodeLlama 34b Instruct hf](/fit/meta-llama-codellama-34b-instruct-hf)

Coding · Meta · 2024-03-14

Q3\_K\_MTight

18.9 GB79% of RAM~10 tok/s33.74B params

Try Q2\_K (15.2 GB, ~14 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava v1.6 34b](/fit/liuhaotian-llava-v1-6-34b)

General · liuhaotian

Q3\_K\_MTight

19.5 GB81% of RAM~10 tok/s34.75B params

Try Q2\_K (15.6 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[llava v1.6 34b hf](/fit/llava-hf-llava-v1-6-34b-hf)

General · llava-hf

Q3\_K\_MTight

19.5 GB81% of RAM~10 tok/s34.75B params

Try Q2\_K (15.6 GB, ~13 tok/s)

What Mac for this model? [Run with ToolPiper](/toolpiper)

[InternVL3 38B](/fit/opengvlab-internvl3-38b)

General · opengvlab

Q2\_KTight

17.2 GB72% of RAM~12 tok/s38.39B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Skywork R1V 38B](/fit/skywork-skywork-r1v-38b)

Reasoning · skywork

Q2\_KTight

17.2 GB72% of RAM~12 tok/s38.39B params

What Mac for this model? [Run with ToolPiper](/toolpiper)

[Kimi K2.5](/fit/moonshotai-kimi-k2-5)

General · moonshotai · 2026-01-01

Won't Fit

461.6 GB1,923% of RAM1058.59B params

Needs 462+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[gpt oss 120b](/fit/openai-gpt-oss-120b)

General · openai

Won't Fit

52.9 GB221% of RAM120.41B params

Needs 53+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[DeepSeek R1](/fit/deepseek-ai-deepseek-r1)

Reasoning · DeepSeek · 2025-01-20

Won't Fit

298.6 GB1,244% of RAM684.53B params

Needs 299+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[GLM 5 FP8](/fit/zai-org-glm-5-fp8)

General · zai-org

Won't Fit

328.9 GB1,370% of RAM753.91B params

Needs 329+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[NVIDIA Nemotron 3 Super 120B A12B NVFP4](/fit/nvidia-nvidia-nemotron-3-super-120b-a12b-nvfp4)

General · nvidia

Won't Fit

29.8 GB124% of RAM67.23B params

Needs 30+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[DeepSeek V3.2](/fit/deepseek-ai-deepseek-v3-2)

General · DeepSeek · 2025-12-01

Won't Fit

299.0 GB1,246% of RAM685.4B params

Needs 299+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 Coder Next FP8](/fit/qwen-qwen3-coder-next-fp8)

Coding · Alibaba

Won't Fit

35.2 GB147% of RAM79.68B params

Needs 35+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[NVIDIA Nemotron 3 Super 120B A12B FP8](/fit/nvidia-nvidia-nemotron-3-super-120b-a12b-fp8)

General · nvidia

Won't Fit

54.3 GB226% of RAM123.61B params

Needs 54+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 3.1 70B Instruct](/fit/meta-llama-llama-3-1-70b-instruct)

Chat · Meta · 2024-07-16

Won't Fit

31.2 GB130% of RAM70.55B params

Needs 31+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3.5 397B A17B](/fit/qwen-qwen3-5-397b-a17b)

General · Alibaba · 2026-02-16

Won't Fit

176.2 GB734% of RAM403.4B params

Needs 176+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3.5 397B A17B FP8](/fit/qwen-qwen3-5-397b-a17b-fp8)

General · Alibaba

Won't Fit

176.2 GB734% of RAM403.42B params

Needs 176+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3.5 122B A10B](/fit/qwen-qwen3-5-122b-a10b)

General · Alibaba · 2026-02-24

Won't Fit

55.0 GB229% of RAM125.09B params

Needs 55+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3.5 122B A10B FP8](/fit/qwen-qwen3-5-122b-a10b-fp8)

General · Alibaba

Won't Fit

55.0 GB229% of RAM125.09B params

Needs 55+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[DeepSeek R1 0528](/fit/deepseek-ai-deepseek-r1-0528)

Reasoning · DeepSeek

Won't Fit

298.6 GB1,244% of RAM684.53B params

Needs 299+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 Coder Next](/fit/qwen-qwen3-coder-next)

Coding · Alibaba · 2026-01-30

Won't Fit

35.2 GB147% of RAM79.67B params

Needs 35+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen2.5 72B Instruct](/fit/qwen-qwen2-5-72b-instruct)

Chat · Alibaba · 2024-09-16

Won't Fit

32.2 GB134% of RAM72.71B params

Needs 32+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[L3.3 GeneticLemonade Final v2 70B](/fit/zerofata-l3-3-geneticlemonade-final-v2-70b)

General · zerofata

Won't Fit

31.2 GB130% of RAM70.55B params

Needs 31+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[DeepSeek V3](/fit/deepseek-ai-deepseek-v3)

General · DeepSeek · 2024-12-25

Won't Fit

298.6 GB1,244% of RAM684.53B params

Needs 299+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[MiniMax M2.5](/fit/minimaxai-minimax-m2-5)

General · minimaxai · 2026-02-12

Won't Fit

100.1 GB417% of RAM228.7B params

Needs 100+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 235B A22B](/fit/qwen-qwen3-235b-a22b)

General · Alibaba · 2025-04-27

Won't Fit

102.9 GB429% of RAM235.09B params

Needs 103+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[DeepSeek R1 0528 NVFP4 v2](/fit/nvidia-deepseek-r1-0528-nvfp4-v2)

Reasoning · nvidia

Won't Fit

171.9 GB716% of RAM393.63B params

Needs 172+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 Next 80B A3B Instruct](/fit/qwen-qwen3-next-80b-a3b-instruct)

Chat · Alibaba

Won't Fit

35.9 GB150% of RAM81.32B params

Needs 36+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 3.3 70B Instruct AWQ](/fit/kosbu-llama-3-3-70b-instruct-awq)

Chat · kosbu

Won't Fit

31.2 GB130% of RAM70.55B params

Needs 31+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[DeepSeek V3 0324](/fit/deepseek-ai-deepseek-v3-0324)

General · DeepSeek

Won't Fit

298.6 GB1,244% of RAM684.53B params

Needs 299+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 Coder Next 8bit](/fit/nexveridian-qwen3-coder-next-8bit)

Coding · nexveridian

Won't Fit

35.2 GB147% of RAM79.67B params

Needs 35+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Mixtral 8x7B Instruct v0.1](/fit/mistralai-mixtral-8x7b-instruct-v0-1)

Chat · Mistral AI · 2023-12-10

Won't Fit

20.8 GB87% of RAM46.7B params

Needs 21+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[llama 3.3 70b instruct awq](/fit/casperhansen-llama-3-3-70b-instruct-awq)

Chat · casperhansen

Won't Fit

31.2 GB130% of RAM70.55B params

Needs 31+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 VL 235B A22B Instruct](/fit/qwen-qwen3-vl-235b-a22b-instruct)

Chat · Alibaba

Won't Fit

103.1 GB430% of RAM235.67B params

Needs 103+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen2.5 72B Instruct abliterated](/fit/huihui-ai-qwen2-5-72b-instruct-abliterated)

Chat · huihui-ai

Won't Fit

32.2 GB134% of RAM72.71B params

Needs 32+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 3.3 70B Instruct](/fit/meta-llama-llama-3-3-70b-instruct)

Chat · Meta · 2024-11-26

Won't Fit

31.2 GB130% of RAM70.55B params

Needs 31+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[GLM 4.5 Air](/fit/zai-org-glm-4-5-air)

General · zai-org

Won't Fit

48.6 GB203% of RAM110.47B params

Needs 49+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[GLM 5](/fit/zai-org-glm-5)

General · zai-org · 2026-02-11

Won't Fit

328.8 GB1,370% of RAM753.86B params

Needs 329+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 VL 235B A22B Thinking](/fit/qwen-qwen3-vl-235b-a22b-thinking)

General · Alibaba

Won't Fit

103.1 GB430% of RAM235.67B params

Needs 103+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3.5 122B A10B NVFP4](/fit/sehyo-qwen3-5-122b-a10b-nvfp4)

General · sehyo

Won't Fit

31.5 GB131% of RAM71.22B params

Needs 32+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Step 3.5 Flash FP8](/fit/stepfun-ai-step-3-5-flash-fp8)

General · stepfun-ai

Won't Fit

87.3 GB364% of RAM199.4B params

Needs 87+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[NVIDIA Nemotron 3 Super 120B A12B BF16](/fit/nvidia-nvidia-nemotron-3-super-120b-a12b-bf16)

General · nvidia

Won't Fit

54.3 GB226% of RAM123.61B params

Needs 54+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 3.1 405B](/fit/meta-llama-llama-3-1-405b)

General · Meta

Won't Fit

177.3 GB739% of RAM405.85B params

Needs 177+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 235B A22B Instruct 2507 FP8](/fit/qwen-qwen3-235b-a22b-instruct-2507-fp8)

Chat · Alibaba

Won't Fit

102.9 GB429% of RAM235.11B params

Needs 103+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 Next 80B A3B Instruct FP8](/fit/qwen-qwen3-next-80b-a3b-instruct-fp8)

Chat · Alibaba

Won't Fit

35.9 GB150% of RAM81.33B params

Needs 36+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3.5 122B A10B NVFP4](/fit/txn545-qwen3-5-122b-a10b-nvfp4)

General · txn545

Won't Fit

28.5 GB119% of RAM64.35B params

Needs 29+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Meta Llama 3 70B](/fit/meta-llama-meta-llama-3-70b)

General · Meta

Won't Fit

31.2 GB130% of RAM70.55B params

Needs 31+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 3\_3 Nemotron Super 49B v1\_5](/fit/nvidia-llama-3_3-nemotron-super-49b-v1_5)

General · nvidia

Won't Fit

22.2 GB93% of RAM49.87B params

Needs 22+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 3.1 405B Instruct](/fit/meta-llama-llama-3-1-405b-instruct)

Chat · Meta · 2024-07-16

Won't Fit

177.3 GB739% of RAM405.85B params

Needs 177+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[XORTRON.CriminalComputing.LARGE.2026.3](/fit/darkc0de-xortron-criminalcomputing-large-2026-3)

General · darkc0de

Won't Fit

53.9 GB225% of RAM122.61B params

Needs 54+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[jais adapted 70b chat 4bit bnb](/fit/inceptionai-jais-adapted-70b-chat-4bit-bnb)

Chat · inceptionai

Won't Fit

31.7 GB132% of RAM71.64B params

Needs 32+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[LLaDA2.1 flash](/fit/inclusionai-llada2-1-flash)

General · inclusionai

Won't Fit

45.3 GB189% of RAM102.89B params

Needs 45+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[GLM 4.7](/fit/zai-org-glm-4-7)

General · zai-org

Won't Fit

156.6 GB652% of RAM358.34B params

Needs 157+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[sarvam 105b uncensored](/fit/aoxo-sarvam-105b-uncensored)

General · aoxo

Won't Fit

24.8 GB103% of RAM55.73B params

Needs 25+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 VL 235B A22B Instruct FP8](/fit/qwen-qwen3-vl-235b-a22b-instruct-fp8)

Chat · Alibaba

Won't Fit

103.1 GB430% of RAM235.68B params

Needs 103+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[GLM 5 NVFP4](/fit/nvidia-glm-5-nvfp4)

General · nvidia

Won't Fit

190.1 GB792% of RAM435.24B params

Needs 190+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[step3](/fit/stepfun-ai-step3)

General · stepfun-ai

Won't Fit

140.3 GB585% of RAM320.97B params

Needs 140+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Step 3.5 Flash](/fit/stepfun-ai-step-3-5-flash)

General · stepfun-ai

Won't Fit

87.3 GB364% of RAM199.38B params

Needs 87+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[deepseek coder v2 instruct awq](/fit/casperhansen-deepseek-coder-v2-instruct-awq)

Coding · casperhansen

Won't Fit

103.2 GB430% of RAM235.74B params

Needs 103+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[gpt oss 120b heretic](/fit/kldzj-gpt-oss-120b-heretic)

General · kldzj

Won't Fit

51.4 GB214% of RAM116.83B params

Needs 51+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Meta Llama 3.3 70B Instruct AWQ INT4](/fit/ibnzterrell-meta-llama-3-3-70b-instruct-awq-int4)

Chat · ibnzterrell

Won't Fit

31.2 GB130% of RAM70.55B params

Needs 31+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 3.1 70B](/fit/meta-llama-llama-3-1-70b)

General · Meta

Won't Fit

31.2 GB130% of RAM70.55B params

Needs 31+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Kimi K2 Instruct](/fit/moonshotai-kimi-k2-instruct)

Chat · moonshotai · 2025-07-11

Won't Fit

447.6 GB1,865% of RAM1026.47B params

Needs 448+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 Coder 480B A35B Instruct](/fit/qwen-qwen3-coder-480b-a35b-instruct)

Coding · Alibaba · 2025-07-22

Won't Fit

209.6 GB873% of RAM480.15B params

Needs 210+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen2.5 VL 72B Instruct](/fit/qwen-qwen2-5-vl-72b-instruct)

Chat · Alibaba

Won't Fit

32.5 GB135% of RAM73.41B params

Needs 32+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Kimi K2 Instruct 0905](/fit/moonshotai-kimi-k2-instruct-0905)

Chat · moonshotai

Won't Fit

447.6 GB1,865% of RAM1026.47B params

Needs 448+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen2 72B Instruct](/fit/qwen-qwen2-72b-instruct)

Chat · Alibaba

Won't Fit

32.2 GB134% of RAM72.71B params

Needs 32+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[MiniMax M2.5](/fit/lmstudio-community-minimax-m2-5-mlx-8bit)

General · lmstudio-community

Won't Fit

141.3 GB589% of RAM228.69B params

Needs 141+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[MiniMax M2](/fit/minimaxai-minimax-m2)

General · minimaxai

Won't Fit

100.1 GB417% of RAM228.7B params

Needs 100+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 4 Maverick 17B 128E Instruct FP8](/fit/meta-llama-llama-4-maverick-17b-128e-instruct-fp8)

Chat · Meta

Won't Fit

175.4 GB731% of RAM401.65B params

Needs 175+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[GLM 4.5](/fit/zai-org-glm-4-5)

General · zai-org

Won't Fit

156.6 GB652% of RAM358.34B params

Needs 157+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Meta Llama 3 70B Instruct](/fit/meta-llama-meta-llama-3-70b-instruct)

Chat · Meta

Won't Fit

31.2 GB130% of RAM70.55B params

Needs 31+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3.5 397B A17B](/fit/lmstudio-community-qwen3-5-397b-a17b-mlx-8bit)

General · lmstudio-community

Won't Fit

69.4 GB289% of RAM111.93B params

Needs 69+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Intern S1](/fit/internlm-intern-s1)

General · internlm

Won't Fit

105.3 GB439% of RAM240.71B params

Needs 105+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[gpt oss 120b](/fit/lmstudio-community-gpt-oss-120b-mlx-8bit)

General · lmstudio-community

Won't Fit

72.4 GB302% of RAM116.83B params

Needs 72+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3.5 122B A10B AWQ](/fit/quanttrio-qwen3-5-122b-a10b-awq)

General · quanttrio

Won't Fit

55.0 GB229% of RAM125.09B params

Needs 55+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Kimi Linear 48B A3B Instruct](/fit/moonshotai-kimi-linear-48b-a3b-instruct)

Chat · moonshotai

Won't Fit

21.9 GB91% of RAM49.12B params

Needs 22+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[DeepSeek V3 0324 NVFP4](/fit/nvidia-deepseek-v3-0324-nvfp4)

General · nvidia

Won't Fit

173.3 GB722% of RAM396.77B params

Needs 173+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[DeepSeek V2.5 1210 FP8](/fit/redhatai-deepseek-v2-5-1210-fp8)

General · redhatai

Won't Fit

103.2 GB430% of RAM235.74B params

Needs 103+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[MiMo V2 Flash](/fit/xiaomimimo-mimo-v2-flash)

General · xiaomimimo · 2025-12-16

Won't Fit

135.4 GB564% of RAM309.79B params

Needs 135+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Kimi K2 Thinking](/fit/moonshotai-kimi-k2-thinking)

General · moonshotai

Won't Fit

461.3 GB1,922% of RAM1058.12B params

Needs 461+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen2.5 72B](/fit/qwen-qwen2-5-72b)

General · Alibaba

Won't Fit

32.2 GB134% of RAM72.71B params

Needs 32+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[MiniMax M2.5 AWQ](/fit/quanttrio-minimax-m2-5-awq)

General · quanttrio

Won't Fit

100.1 GB417% of RAM228.69B params

Needs 100+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[xLAM 8x7b r](/fit/salesforce-xlam-8x7b-r)

General · salesforce

Won't Fit

20.8 GB87% of RAM46.7B params

Needs 21+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 235B A22B FP8](/fit/qwen-qwen3-235b-a22b-fp8)

General · Alibaba

Won't Fit

102.9 GB429% of RAM235.11B params

Needs 103+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Seed OSS 36B Instruct](/fit/lmstudio-community-seed-oss-36b-instruct-mlx-8bit)

Chat · lmstudio-community

Won't Fit

22.8 GB95% of RAM36.15B params

Needs 23+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[LongCat Flash Chat](/fit/meituan-longcat-longcat-flash-chat)

Chat · meituan-longcat

Won't Fit

245.2 GB1,022% of RAM561.86B params

Needs 245+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[command a vision 07 2025](/fit/coherelabs-command-a-vision-07-2025)

General · coherelabs

Won't Fit

49.2 GB205% of RAM111.87B params

Needs 49+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[GLM 4.5V](/fit/zai-org-glm-4-5v)

General · zai-org

Won't Fit

47.4 GB198% of RAM107.71B params

Needs 47+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 3\_3 Nemotron Super 49B v1\_5 FP8](/fit/nvidia-llama-3_3-nemotron-super-49b-v1_5-fp8)

General · nvidia

Won't Fit

22.2 GB93% of RAM49.87B params

Needs 22+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[DeepSeek V3.2 NVFP4](/fit/nvidia-deepseek-v3-2-nvfp4)

General · nvidia

Won't Fit

172.3 GB718% of RAM394.5B params

Needs 172+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 4 Scout 17B 16E Instruct FP8 dynamic](/fit/redhatai-llama-4-scout-17b-16e-instruct-fp8-dynamic)

Chat · redhatai

Won't Fit

47.8 GB199% of RAM108.66B params

Needs 48+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen2 VL 72B Instruct](/fit/qwen-qwen2-vl-72b-instruct)

Chat · Alibaba

Won't Fit

32.5 GB135% of RAM73.41B params

Needs 32+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[MiniMax M2.1](/fit/minimaxai-minimax-m2-1)

General · minimaxai

Won't Fit

100.1 GB417% of RAM228.7B params

Needs 100+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 3\_3 Nemotron Super 49B v1](/fit/nvidia-llama-3_3-nemotron-super-49b-v1)

General · nvidia

Won't Fit

22.2 GB93% of RAM49.87B params

Needs 22+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Mixtral 8x22B Instruct v0.1](/fit/mistralai-mixtral-8x22b-instruct-v0-1)

Chat · Mistral AI · 2024-04-16

Won't Fit

61.7 GB257% of RAM140.63B params

Needs 62+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Meta Llama 3.1 70B Instruct FP8](/fit/redhatai-meta-llama-3-1-70b-instruct-fp8)

Chat · redhatai

Won't Fit

31.2 GB130% of RAM70.55B params

Needs 31+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3 Next 80B A3B Instruct](/fit/lmstudio-community-qwen3-next-80b-a3b-instruct-mlx-4bit)

Chat · lmstudio-community

Won't Fit

49.5 GB206% of RAM79.67B params

Needs 50+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen2 72B](/fit/qwen-qwen2-72b)

General · Alibaba

Won't Fit

32.2 GB134% of RAM72.71B params

Needs 32+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 3.3 70B Instruct FP8 dynamic](/fit/redhatai-llama-3-3-70b-instruct-fp8-dynamic)

Chat · redhatai

Won't Fit

31.2 GB130% of RAM70.56B params

Needs 31+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[K EXAONE 236B A23B](/fit/lgai-exaone-k-exaone-236b-a23b)

General · lgai-exaone

Won't Fit

103.8 GB432% of RAM237.1B params

Needs 104+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[MiniMax M2.5 NVFP4](/fit/nvidia-minimax-m2-5-nvfp4)

General · nvidia

Won't Fit

51.2 GB213% of RAM116.35B params

Needs 51+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 4 Maverick 17B 128E Instruct FP8](/fit/redhatai-llama-4-maverick-17b-128e-instruct-fp8)

Chat · redhatai

Won't Fit

175.4 GB731% of RAM401.65B params

Needs 175+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Ring 2.5 1T](/fit/inclusionai-ring-2-5-1t)

General · inclusionai

Won't Fit

441.5 GB1,839% of RAM1012.47B params

Needs 441+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 4 Maverick 17B 128E Instruct](/fit/meta-llama-llama-4-maverick-17b-128e-instruct)

Chat · Meta · 2025-04-01

Won't Fit

175.4 GB731% of RAM401.58B params

Needs 175+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 4 Scout 17B 16E Instruct](/fit/redhatai-llama-4-scout-17b-16e-instruct)

Chat · redhatai

Won't Fit

47.8 GB199% of RAM108.64B params

Needs 48+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[dots.llm1.inst](/fit/rednote-hilab-dots-llm1-inst)

General · rednote-hilab · 2025-05-14

Won't Fit

62.7 GB261% of RAM142.77B params

Needs 63+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Nous Hermes 2 Mixtral 8x7B DPO](/fit/nousresearch-nous-hermes-2-mixtral-8x7b-dpo)

General · NousResearch · 2024-01-11

Won't Fit

20.8 GB87% of RAM46.7B params

Needs 21+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Llama 3.2 90B Vision Instruct](/fit/meta-llama-llama-3-2-90b-vision-instruct)

Chat · Meta

Won't Fit

39.1 GB163% of RAM88.59B params

Needs 39+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[bloom](/fit/bigscience-bloom)

General · bigscience · 2022-05-19

Won't Fit

77.3 GB322% of RAM176.25B params

Needs 77+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Molmo 72B 0924](/fit/allenai-molmo-72b-0924)

General · allenai

Won't Fit

32.4 GB135% of RAM73.31B params

Needs 32+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[Qwen3.5 397B A17B MXFP4](/fit/amd-qwen3-5-397b-a17b-mxfp4)

General · amd

Won't Fit

97.3 GB405% of RAM222.2B params

Needs 97+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[GLM 4.6V NVFP4](/fit/gadflyii-glm-4-6v-nvfp4)

General · gadflyii

Won't Fit

27.3 GB114% of RAM61.52B params

Needs 27+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[falcon 180B chat](/fit/tiiuae-falcon-180b-chat)

Chat · TII · 2023-09-04

Won't Fit

78.7 GB328% of RAM179.52B params

Needs 79+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

[ERNIE 4.5 300B A47B Paddle](/fit/baidu-ernie-4-5-300b-a47b-paddle)

General · baidu · 2025-06-28

Won't Fit

131.4 GB547% of RAM300.47B params

Needs 131+ GB — try a Mac with more RAM

What Mac for this model? [Needs more RAM](/toolpiper)

Model not in the list? Paste a HuggingFace URL or ID for an instant fit check.

 Check Fit

## Frequently Asked Questions

### How much RAM do I need to run LLMs on a Mac?

It depends on the model size and quantization. A 7B parameter model at Q4 quantization needs about 5 GB of RAM, while a 70B model needs 40+ GB. Apple Silicon Macs use unified memory, so your entire RAM pool is available for model weights — no separate VRAM required.

### Can I run a 70B model on a MacBook Air?

Not comfortably. A 70B model at Q4 quantization needs about 40 GB of RAM. The MacBook Air maxes out at 24-32 GB depending on the generation. You'd need a Mac Studio or MacBook Pro with 48+ GB for a 70B model to run well.

### What's the fastest LLM I can run on my Mac?

Speed depends on your chip's memory bandwidth and the model size. Smaller models (3-7B) run fastest — expect 40-70+ tokens per second on M2 Pro or better. Use the calculator above to see estimated speeds for your specific Mac.

### What does quantization mean for model quality?

Quantization reduces model precision to use less memory. Q8 (8-bit) is nearly lossless. Q4 (4-bit) reduces memory by ~75% with minor quality loss — it's the sweet spot for most users. Q2 (2-bit) saves the most memory but noticeably degrades output quality.

### How is Apple Silicon different from NVIDIA for LLMs?

Apple Silicon uses unified memory — CPU and GPU share the same RAM pool. A Mac with 32 GB can load a 28 GB model directly. On NVIDIA systems, you're limited by GPU VRAM (typically 8-24 GB on consumer cards), even if the PC has 64 GB of system RAM.

### Does ToolPiper use GPU or CPU for inference?

ToolPiper uses Metal GPU acceleration via llama.cpp for LLM inference on Apple Silicon. The GPU and CPU share unified memory, so there's no data transfer overhead. The Neural Engine (ANE) is used for specific tasks like super-resolution and pose detection.

### Can I run multiple models at the same time?

Yes, if you have enough RAM. ToolPiper manages model loading and can keep multiple models in memory simultaneously. When memory gets tight, it automatically evicts the least recently used model to make room for a new one.

### What's the difference between GGUF and other formats?

GGUF is the standard format for running quantized models with llama.cpp (and ToolPiper). It supports all quantization levels and runs on CPU+GPU. MLX is Apple's format optimized for Apple Silicon. AWQ and GPTQ are NVIDIA-focused formats that don't run natively on Mac.

### Quick Stats

Total RAM24 GB

Bandwidth307 GB/s

Models that fit757

Best coding

Qwen3 Coder 30B A3B Instruct gptq 8bit

Largest that fits39.53B Q2\_K

[Get ToolPiper — Free](/toolpiper)

Model database updated: 2026-04-14 · 866 models

My Connections

### AI Providers

No providers yet. Click + to add one.

Services

ToolPiper

VisionPiper

AudioPiper
