Check which AI models your Mac can run locally. Apple Silicon optimized — M1, M2, M3, M4. See fit ratings, estimated speed, and recommended quantizations.
Showing 866 of 866 models
General · Alibaba · 2026-02-28
Chat · Alibaba
Chat · redhatai
General · Alibaba · 2026-02-28
General · Alibaba
Chat · redhatai
Reasoning · DeepSeek
Chat · cyankiwi
Coding · btbtyler09
Reasoning · lmstudio-community
General · paddlepaddle
Chat · Liquid AI · 2026-01-06
General · abhishekchohan
General · hmellor
Chat · lmstudio-community
Chat · lmstudio-community
General · Alibaba · 2026-02-28
General · Liquid AI · 2026-01-05
General · lkhl
Chat · stelterlab
General · hmellor
General · titanml
General · lmstudio-community
General · Alibaba · 2026-02-28
Coding · cyankiwi
General · ibm-granite · 2025-09-16
General · cyankiwi
General · Alibaba
Chat · lmstudio-community
Reasoning · lmstudio-community
General · Liquid AI · 2025-10-07
General · Liquid AI · 2025-07-10
General · winninghealth
General · Liquid AI · 2025-07-10
Chat · Alibaba
General · ahczhg
General · Liquid AI · 2025-10-28
General · abaryan
Chat · cyankiwi
Chat · vikhrmodels
General · Liquid AI · 2026-01-05
General · optimum-intel-internal-testing
Coding · shahriarferdoush
General · Liquid AI · 2026-01-20
General · ibm-granite
Reasoning · typhoon-ai
General · Liquid AI · 2025-12-25
General · quanttrio
General · adamlucek
General · lmstudio-community
General · lmstudio-community
General · z-lab
General · Liquid AI · 2025-08-12
Embedding · embedl
General · yujiepan
General · Liquid AI · 2025-07-10
General · apolo13x
General · optimum-intel-internal-testing
General · cyankiwi
General · huihui-ai
General · paddlepaddle
Chat · cyankiwi
General · Liquid AI · 2025-09-22
General · dmusingu
General · cyankiwi
General · Liquid AI · 2025-08-12
General · Liquid AI · 2026-01-04
General · Liquid AI · 2025-08-22
General · Liquid AI · 2025-09-03
General · Liquid AI · 2025-09-30
General · Liquid AI · 2025-09-03
General · Liquid AI · 2025-09-03
General · Liquid AI · 2025-08-25
General · Liquid AI · 2025-09-03
General · Liquid AI · 2026-01-05
General · rednote-hilab
Chat · ai-sage
General · ibm-granite
General · kristaller486
Chat · cyankiwi
General · Alibaba
General · rednote-hilab
General · z-lab
General · ibm-granite · 2025-09-16
General · lgai-exaone · 2025-07-11
General · prism-ml
Reasoning · jackrong
General · Liquid AI · 2025-10-22
General · redhatai
Reasoning · nvidia
General · Alibaba · 2025-04-27
Chat · Alibaba
General · Alibaba · 2025-04-27
Chat · Alibaba
Chat · Alibaba · 2025-01-26
Chat · Alibaba
Chat · Alibaba
General · Alibaba
General · Alibaba
General · Alibaba
General · huggingfacetb · 2025-07-08
General · redhatai
General · farbodtavakkoli
Chat · Microsoft · 2024-08-16
General · farbodtavakkoli
Chat · Alibaba
General · nanbeige
General · Alibaba
General · farbodtavakkoli
General · Alibaba
General · llava-hf
General · opengvlab
General · laap-ai
General · warshanks
General · farbodtavakkoli
Chat · junhowie
General · Alibaba
Chat · gensyn
Chat · Alibaba
General · tiger-lab
General · z-lab
General · jakobhuss
Chat · Alibaba
Chat · lgai-exaone
General · huggingfacetb
General · perceptronai
Chat · redhatai
General · zstanjj
General · perceptronai
General · Alibaba
General · redhatai
General · ibm-granite
General · lmstudio-community
General · salesforce
General · Alibaba
General · lmstudio-community
General · lmstudio-community
General · idea-research
General · lmstudio-community
General · nanonets
General · Alibaba
General · nvidia
General · primeintellect
Chat · redhatai
Chat · Alibaba
General · turing-motors
General · optimum-intel-internal-testing
Chat · namaa-space
Chat · redhatai
General · huihui-ai
General · opengvlab
Chat · huihui-ai
General · opengvlab
General · opengvlab
Chat · prithivmlmods
General · cyankiwi
General · stanfordaimi
General · opengvlab
General · numind
General · ibm-granite
General · thisisiron
Chat · Alibaba
General · Alibaba · 2026-02-27
Chat · Alibaba
Chat · Alibaba
General · lightonai
Coding · Alibaba · 2024-09-18
General · lovedheart
Coding · Alibaba
General · Google · 2026-03-02
General · stelterlab
General · ista-daslab
General · cyankiwi
Reasoning · vikhrmodels
Coding · lmstudio-community
General · opendatalab
General · Alibaba · 2026-02-27
Chat · nvidia
General · datalab-to
General · zju-ai4h
General · tristepin
General · TII
Coding · Alibaba
General · lmstudio-community
Reasoning · lmstudio-community
Chat · huggingfacetb
Chat · Alibaba
General · cyankiwi
General · salesforce
General · quanttrio
Chat · ista-daslab
General · infomaniak-ai
General · lightonai
Chat · huihui-ai
Chat · prithivmlmods
General · huihui-ai
General · dealignai
General · huggingfacetb
Chat · huggingfacetb
General · Alibaba
Chat · huggingfacetb
General · hmellor
Chat · kakaocorp
Coding · Alibaba
Chat · huggingfacetb
General · huggingfacetb
General · huggingfacetb
General · starvector
General · efficient-large-model
General · Alibaba
General · Alibaba
Coding · Alibaba
Chat · huggingfacetb
Chat · h2oai
General · nvidia
General · bytedance-seed
General · opengvlab
General · intel
General · DeepSeek
General · DeepSeek
Chat · Microsoft · 2025-02-24
Coding · cyankiwi
Coding · bullpoint
Coding · BigCode
Coding · lmstudio-community
Reasoning · DeepSeek · 2025-01-20
Reasoning · DeepSeek
General · cyankiwi
Chat · servicenow-ai
Coding · BigCode
Reasoning · stevenhh2000
Chat · Meta
General · Meta · 2024-09-18
General · opengvlab
General · Microsoft
General · h2oai
General · h2oai
General · Alibaba
General · ibm-research
Chat · Microsoft
General · Google
General · opengvlab
General · llava-hf
General · allenai
General · Google · 2024-07-16
Chat · opengvlab
General · state-spaces
General · bigscience
General · Google
General · opengvlab
Chat · Microsoft
Chat · Alibaba
General · Microsoft
General · generalanalysis
General · optimum-intel-internal-testing
Chat · zyphra
General · Google
General · redhatai
General · Google
General · kitefishai
General · opengvlab
Chat · allenai
General · Meta
General · bigscience
General · opengvlab
General · florence-community
Chat · allenai
Chat · cazzz307
General · Stability AI
General · dealignai
General · happypatrick
General · Google
General · stepfun-ai
General · TII
General · opengvlab
General · Google
General · fal
General · moondream
General · ahmed-masry
Reasoning · momix-44
Chat · DeepSeek
General · facebook
General · zai-org
General · 5cd-ai
Chat · Stability AI · 2024-04-08
General · Liquid AI · 2025-12-18
General · Liquid AI · 2025-08-28
Embedding · Nomic · 2024-02-10
Chat · Meta
Chat · Community · 2023-12-30
General · Meta · 2024-09-18
General · bigscience
General · eleutherai
Chat · DeepSeek
General · eleutherai
Coding · DeepSeek · 2024-06-14
General · DeepSeek
General · rinna
General · Alibaba
General · eleutherai
General · huggingfacetb
General · peft-internal-testing
General · eleutherai
General · ai-sweden-models
General · eleutherai
Reasoning · jackrong
General · eleutherai
General · eleutherai
Chat · nvfp4
Coding · redhatai
General · huggingfacetb
General · jackfram
General · ibm-research
General · Google
Reasoning · jackrong
General · axionml
General · coherelabs
General · kblueleaf
General · eleutherai
General · eleutherai
General · Google
General · Google
Chat · huggingfacetb
General · nm-testing
General · isotr0py
General · kblueleaf
General · eleutherai
General · redhatai
General · opengvlab
Chat · kaitchup
General · Microsoft
Chat · Microsoft · 2024-04-22
Chat · Microsoft
General · salesforce
General · eleutherai
General · eleutherai
General · Microsoft
General · Microsoft
General · Microsoft
General · liautoad
General · aidc-ai
General · gokaygokay
Chat · redhatai
General · gokaygokay
General · opengvlab
General · florence-community
General · opengvlab
General · 5cd-ai
General · opengvlab
Embedding · BAAI · 2023-09-12
Coding · Alibaba · 2024-09-17
General · ggml-org
General · nvidia
General · vamsi
Coding · Alibaba
Chat · opengvlab
General · gokaygokay
General · xenova
General · opengvlab
General · rhoninseiei
Coding · codellama
General · allenai
Coding · DeepSeek
Coding · codellama
Coding · DeepSeek
Chat · cyankiwi
Coding · BigCode · 2024-02-20
General · apolo13x
General · Mistral AI
General · Google · 2026-03-02
General · lmstudio-community
General · redhatai
General · nvidia
Coding · xlabs-ai
General · HuggingFace · 2023-10-26
General · NousResearch
General · salesforce
General · inclusionai
General · salesforce
General · prometheus-eval
General · mlabonne
General · parasail-ai
General · augmxnt
General · inclusionai
General · llava-hf
General · allenai
General · lgai-exaone
General · lmstudio-community
General · allenai
General · swe-bench
General · bytedance-seed
Chat · dream-org
Chat · Alibaba
General · Alibaba
Chat · moonshotai
General · numind
Chat · allenai
General · fancyfeast
Coding · BigCode
General · xiaomimimo
General · moonshotai
General · haochenwang
General · virtue-ai-hub
General · liuhaotian
General · Microsoft
General · llava-hf
General · typhoon-ai
General · Alibaba
General · singh8898
General · allenai
General · mbzuai
General · inclusionai · 2025-02-28
General · Alibaba · 2025-04-27
General · openai
Chat · Mistral AI
Chat · Mistral AI · 2024-05-22
General · Alibaba
General · nvidia
General · nvidia · 2025-08-12
Chat · ibm-granite
Chat · nvidia
General · nvidia
Chat · redhatai
Chat · NousResearch
General · Alibaba
General · cyankiwi
Try Q4_K_M (16.2 GB, ~151 tok/s)
General · Alibaba
General · lmms-lab
General · nvidia
General · xiaomimimo
General · nvidia
General · opengvlab
Chat · thesven
General · sanjiwatsuki
Chat · patronusai
General · Liquid AI · 2026-02-24
Try Q4_K_M (15.9 GB, ~127 tok/s)
Coding · ibm-granite
Chat · redhatai
General · nytopop
Chat · gradientai
Chat · redhatai
Chat · 01.ai · 2023-11-22
Chat · TII · 2024-11-29
General · zju-ai4h
Chat · Alibaba · 2024-09-16
General · Alibaba · 2026-02-27
Chat · Alibaba · 2025-01-26
Chat · Alibaba
General · quanttrio
General · cyankiwi
General · lovedheart
General · huggingfacem4
General · NousResearch
Chat · xcuros
General · openbmb
General · Alibaba · 2026-02-26
General · huihui-ai
General · allenai
Chat · internlm
General · opengvlab
General · tartunlp
General · openbmb
General · gaunernst
General · redhatai
General · jackrong
General · prithivmlmods
General · huihui-ai
General · davidau
General · bytedance-seed
Chat · asi992h
General · cyankiwi
General · allenai
Chat · Alibaba
General · llava-hf
Reasoning · DeepSeek
Chat · Alibaba
General · ilyagusev
General · NousResearch
Chat · xcuros
General · facebook
General · omni-research
General · liuhaotian
General · zai-org
General · nvidia
General · llava-hf
General · redhatai
Chat · thecluster
Chat · huihui-ai
Coding · Meta · 2024-03-13
Chat · Alibaba
General · zai-org
Chat · essentialai
General · stepfun-ai
Chat · thudm · 2024-06-04
General · dengcao
General · openbmb
Reasoning · nvidia
Chat · bsc-lt
General · tiger-lab
Chat · zai-org
General · xtuner
Chat · NousResearch
General · languagebind
Chat · lmms-lab
General · llava-hf
General · Alibaba
General · eleutherai
General · ytu-ce-cosmos
General · salesforce
General · openvla
General · openvla
General · Meta
General · Meta · 2024-07-14
General · opengvlab
General · Meta
Chat · nvidia
Chat · TII · 2023-04-25
General · llava-hf
General · opengvlab
Chat · TII
General · allenai
General · skywork
Chat · huihui-ai
Reasoning · harleywang
Chat · DeepSeek
Chat · Alibaba
General · coherelabs
Chat · lmstudio-community
General · solidrust
Reasoning · Microsoft
General · lightricks
General · redhatai
Chat · Meta · 2024-07-18
Chat · Meta
General · naver-hyperclovax
General · mistral-experimental
General · Google · 2024-06-24
General · Google
General · opengvlab
Chat · opengvlab
General · moondream
Coding · Alibaba · 2024-11-06
General · nvidia
Coding · lmstudio-community
General · nvidia
Coding · Alibaba
Chat · salesforce
Chat · konstantinoskk
Chat · casperhansen
Chat · vagosolutions
General · Meta
Chat · Alibaba
Try Q3_K_M (17.2 GB, ~105 tok/s)
Chat · junhowie
Try Q3_K_M (17.2 GB, ~105 tok/s)
Chat · quanttrio
Try Q3_K_M (17.4 GB, ~142 tok/s)
General · Alibaba
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · aidc-ai
Try Q2_K (14.2 GB, ~133 tok/s)
General · typhoon-ai
Try Q3_K_M (17.2 GB, ~105 tok/s)
Coding · Alibaba
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · k-compression
General · nvidia
Coding · Alibaba
Try Q3_K_M (17.2 GB, ~105 tok/s)
Chat · Meta · 2024-09-18
General · Alibaba
General · Alibaba
Try Q3_K_M (17.4 GB, ~142 tok/s)
General · aoxo
Try Q2_K (14.5 GB, ~150 tok/s)
General · sarvamai
Try Q2_K (14.5 GB, ~150 tok/s)
General · naver-hyperclovax
Chat · Upstage · 2023-12-12
Coding · quanttrio
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · Alibaba
Reasoning · mconcat
General · browser-use
Try Q3_K_M (17.4 GB, ~142 tok/s)
Coding · Meta · 2024-03-13
Chat · speakleash
General · Alibaba
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · Alibaba
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · quixiai
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · nytopop
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · Google
General · cyankiwi
General · Alibaba
General · Alibaba
General · eleutherai
General · lmstudio-community
General · liuhaotian
General · cyankiwi
General · gaunernst
General · llava-hf
General · opengvlab
General · cais
General · redhatai
General · llm-jp
General · redhatai
Chat · Alibaba
Chat · openpipe
General · Alibaba · 2026-02-24
Try Q2_K (16.2 GB, ~152 tok/s)
General · Alibaba
Try Q2_K (16.2 GB, ~159 tok/s)
General · cyankiwi
General · quanttrio
Try Q2_K (16.2 GB, ~159 tok/s)
General · huihui-ai
Try Q2_K (16.2 GB, ~159 tok/s)
General · moonshotai
General · moonshotai
General · huihui-ai
Try Q2_K (16.2 GB, ~159 tok/s)
General · Alibaba
Try Q2_K (16.2 GB, ~159 tok/s)
General · cyankiwi
Reasoning · jackrong
Try Q3_K_M (15.7 GB, ~13 tok/s)
General · nvidia
General · apolo13x
Reasoning · jackrong
Try Q3_K_M (15.7 GB, ~13 tok/s)
Reasoning · mconcat
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · cyankiwi
Try Q2_K (16.5 GB, ~156 tok/s)
Chat · llm-jp
Reasoning · harleywang
Try Q3_K_M (15.4 GB, ~13 tok/s)
Reasoning · codgician
Try Q3_K_M (15.7 GB, ~13 tok/s)
Reasoning · quanttrio
Try Q3_K_M (15.7 GB, ~13 tok/s)
Reasoning · oxzoid
Try Q3_K_M (15.7 GB, ~13 tok/s)
General · deaquay
General · opengvlab
General · axionml
Chat · moonshotai
Coding · coder3101
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · nvidia
Reasoning · codgician
Try Q2_K (16.2 GB, ~159 tok/s)
General · dphn
Try Q4_K_M (15.7 GB, ~12 tok/s)
Reasoning · jackrong
Try Q2_K (16.2 GB, ~159 tok/s)
Chat · stelterlab
Try Q4_K_M (15.7 GB, ~12 tok/s)
General · cerebras
Try Q5_K_M (17.6 GB, ~11 tok/s)
Chat · gghfez
Try Q4_K_M (15.7 GB, ~12 tok/s)
General · Google
General · DeepSeek
General · m-ric
Try Q4_K_M (16.8 GB, ~12 tok/s)
General · rhymes-ai
Try Q4_K_M (16.8 GB, ~12 tok/s)
General · Google · 2026-03-11
Try Q4_K_M (17.6 GB, ~11 tok/s)
General · lmstudio-community
General · cyankiwi
Try Q4_K_M (17.6 GB, ~11 tok/s)
General · Google
Try Q4_K_M (17.6 GB, ~11 tok/s)
General · davidau
Try Q3_K_M (15.4 GB, ~13 tok/s)
Coding · coder3101
Try Q2_K (14.1 GB, ~15 tok/s)
General · jackrong
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · davidau
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · voidful
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · voidful
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · cyankiwi
Try Q4_K_M (17.6 GB, ~11 tok/s)
General · llmfan46
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · Alibaba · 2026-02-24
Try Q3_K_M (15.7 GB, ~13 tok/s)
Reasoning · DeepSeek · 2025-01-20
Try Q2_K (14.8 GB, ~14 tok/s)
General · Alibaba
Try Q3_K_M (15.7 GB, ~13 tok/s)
General · quanttrio
Try Q3_K_M (15.7 GB, ~13 tok/s)
General · huihui-ai
Try Q3_K_M (15.7 GB, ~13 tok/s)
Reasoning · nvidia
Try Q2_K (14.8 GB, ~14 tok/s)
General · redhatai
Try Q3_K_M (15.5 GB, ~13 tok/s)
General · huihui-ai
Try Q3_K_M (15.7 GB, ~13 tok/s)
General · cyankiwi
Try Q3_K_M (16.0 GB, ~12 tok/s)
General · cyankiwi
Try Q3_K_M (16.0 GB, ~12 tok/s)
General · cyankiwi
Try Q3_K_M (16.1 GB, ~12 tok/s)
General · infomaniak-ai
Try Q3_K_M (16.2 GB, ~12 tok/s)
General · lmstudio-community
General · baidu
Try Q3_K_M (16.5 GB, ~12 tok/s)
General · zai-org
Try Q2_K (14.1 GB, ~15 tok/s)
General · quanttrio
Try Q2_K (14.1 GB, ~15 tok/s)
General · baidu
Try Q3_K_M (16.7 GB, ~12 tok/s)
General · momix-44
Try Q2_K (14.1 GB, ~15 tok/s)
General · quanttrio
Try Q2_K (14.1 GB, ~15 tok/s)
General · redhatai
Try Q2_K (14.1 GB, ~15 tok/s)
General · nvidia · 2025-12-04
Try Q2_K (14.3 GB, ~14 tok/s)
General · nvidia
Try Q2_K (14.3 GB, ~14 tok/s)
Coding · Alibaba · 2024-11-06
Try Q2_K (14.8 GB, ~14 tok/s)
General · nvidia
Try Q2_K (14.3 GB, ~14 tok/s)
General · nvidia
Try Q2_K (14.3 GB, ~14 tok/s)
General · lgai-exaone
Try Q2_K (14.4 GB, ~14 tok/s)
Coding · Alibaba
Try Q2_K (14.8 GB, ~14 tok/s)
General · lgai-exaone
Try Q2_K (14.4 GB, ~14 tok/s)
General · lgai-exaone · 2025-07-11
Try Q2_K (14.4 GB, ~14 tok/s)
General · Google · 2026-03-11
Try Q2_K (14.7 GB, ~14 tok/s)
General · Google · 2025-03-01
Try Q3_K_M (15.5 GB, ~13 tok/s)
General · Google · 2024-06-24
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · cyankiwi
Try Q2_K (14.5 GB, ~14 tok/s)
General · cyankiwi
Try Q2_K (14.5 GB, ~14 tok/s)
General · Google
Try Q2_K (14.7 GB, ~14 tok/s)
General · cyankiwi
Try Q2_K (14.5 GB, ~14 tok/s)
General · Alibaba
Try Q2_K (14.8 GB, ~14 tok/s)
General · baichuan-inc
Try Q2_K (14.8 GB, ~14 tok/s)
General · allenai
Try Q2_K (14.5 GB, ~14 tok/s)
General · Alibaba
Try Q2_K (14.8 GB, ~14 tok/s)
General · Alibaba
Try Q2_K (14.8 GB, ~14 tok/s)
General · Alibaba
Try Q2_K (15.0 GB, ~14 tok/s)
General · redhatai
Try Q2_K (14.8 GB, ~14 tok/s)
General · naver-hyperclovax
Try Q2_K (15.0 GB, ~14 tok/s)
Coding · codellama
Try Q2_K (15.2 GB, ~14 tok/s)
General · karakuri-ai
Try Q2_K (15.1 GB, ~14 tok/s)
Chat · Alibaba
Try Q2_K (15.0 GB, ~14 tok/s)
Chat · quanttrio
Try Q2_K (15.0 GB, ~14 tok/s)
Chat · Alibaba
Try Q2_K (15.1 GB, ~14 tok/s)
General · salesforce
Try Q2_K (14.8 GB, ~14 tok/s)
Chat · karakuri-ai
Try Q2_K (15.1 GB, ~14 tok/s)
Chat · Alibaba
Try Q2_K (14.8 GB, ~14 tok/s)
General · dphn
Try Q2_K (15.5 GB, ~13 tok/s)
General · opengvlab
Try Q3_K_M (17.3 GB, ~11 tok/s)
General · davidau
Chat · allenai · 2025-03-12
Try Q2_K (14.5 GB, ~14 tok/s)
Coding · Meta · 2024-03-14
Try Q2_K (15.2 GB, ~14 tok/s)
General · liuhaotian
Try Q2_K (15.6 GB, ~13 tok/s)
General · llava-hf
Try Q2_K (15.6 GB, ~13 tok/s)
Reasoning · skywork
General · opengvlab
General · moonshotai · 2026-01-01
Needs 462+ GB — try a Mac with more RAM
General · openai
Needs 53+ GB — try a Mac with more RAM
Reasoning · DeepSeek · 2025-01-20
Needs 299+ GB — try a Mac with more RAM
General · zai-org
Needs 329+ GB — try a Mac with more RAM
General · nvidia
Needs 30+ GB — try a Mac with more RAM
General · DeepSeek · 2025-12-01
Needs 299+ GB — try a Mac with more RAM
Coding · Alibaba
Needs 35+ GB — try a Mac with more RAM
General · nvidia
Needs 54+ GB — try a Mac with more RAM
Chat · Meta · 2024-07-16
Needs 31+ GB — try a Mac with more RAM
General · Alibaba · 2026-02-16
Needs 176+ GB — try a Mac with more RAM
General · Alibaba
Needs 176+ GB — try a Mac with more RAM
General · Alibaba · 2026-02-24
Needs 55+ GB — try a Mac with more RAM
General · Alibaba
Needs 55+ GB — try a Mac with more RAM
Reasoning · DeepSeek
Needs 299+ GB — try a Mac with more RAM
Coding · Alibaba · 2026-01-30
Needs 35+ GB — try a Mac with more RAM
Chat · Alibaba · 2024-09-16
Needs 32+ GB — try a Mac with more RAM
General · zerofata
Needs 31+ GB — try a Mac with more RAM
General · DeepSeek · 2024-12-25
Needs 299+ GB — try a Mac with more RAM
General · minimaxai · 2026-02-12
Needs 100+ GB — try a Mac with more RAM
General · Alibaba · 2025-04-27
Needs 103+ GB — try a Mac with more RAM
Reasoning · nvidia
Needs 172+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 36+ GB — try a Mac with more RAM
Chat · kosbu
Needs 31+ GB — try a Mac with more RAM
General · DeepSeek
Needs 299+ GB — try a Mac with more RAM
Coding · nexveridian
Needs 35+ GB — try a Mac with more RAM
Chat · Mistral AI · 2023-12-10
Needs 21+ GB — try a Mac with more RAM
Chat · casperhansen
Needs 31+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 103+ GB — try a Mac with more RAM
Chat · huihui-ai
Needs 32+ GB — try a Mac with more RAM
Chat · Meta · 2024-11-26
Needs 31+ GB — try a Mac with more RAM
General · zai-org
Needs 49+ GB — try a Mac with more RAM
General · zai-org · 2026-02-11
Needs 329+ GB — try a Mac with more RAM
General · Alibaba
Needs 103+ GB — try a Mac with more RAM
General · sehyo
Needs 32+ GB — try a Mac with more RAM
General · stepfun-ai
Needs 87+ GB — try a Mac with more RAM
General · nvidia
Needs 54+ GB — try a Mac with more RAM
General · Meta
Needs 177+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 103+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 36+ GB — try a Mac with more RAM
General · txn545
Needs 29+ GB — try a Mac with more RAM
General · Meta
Needs 31+ GB — try a Mac with more RAM
General · nvidia
Needs 22+ GB — try a Mac with more RAM
Chat · Meta · 2024-07-16
Needs 177+ GB — try a Mac with more RAM
General · darkc0de
Needs 54+ GB — try a Mac with more RAM
Chat · inceptionai
Needs 32+ GB — try a Mac with more RAM
General · inclusionai
Needs 45+ GB — try a Mac with more RAM
General · zai-org
Needs 157+ GB — try a Mac with more RAM
General · aoxo
Needs 25+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 103+ GB — try a Mac with more RAM
General · nvidia
Needs 190+ GB — try a Mac with more RAM
General · stepfun-ai
Needs 140+ GB — try a Mac with more RAM
General · stepfun-ai
Needs 87+ GB — try a Mac with more RAM
Coding · casperhansen
Needs 103+ GB — try a Mac with more RAM
General · kldzj
Needs 51+ GB — try a Mac with more RAM
Chat · ibnzterrell
Needs 31+ GB — try a Mac with more RAM
General · Meta
Needs 31+ GB — try a Mac with more RAM
Chat · moonshotai · 2025-07-11
Needs 448+ GB — try a Mac with more RAM
Coding · Alibaba · 2025-07-22
Needs 210+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 32+ GB — try a Mac with more RAM
Chat · moonshotai
Needs 448+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 32+ GB — try a Mac with more RAM
General · lmstudio-community
Needs 141+ GB — try a Mac with more RAM
General · minimaxai
Needs 100+ GB — try a Mac with more RAM
Chat · Meta
Needs 175+ GB — try a Mac with more RAM
General · zai-org
Needs 157+ GB — try a Mac with more RAM
Chat · Meta
Needs 31+ GB — try a Mac with more RAM
General · lmstudio-community
Needs 69+ GB — try a Mac with more RAM
General · internlm
Needs 105+ GB — try a Mac with more RAM
General · lmstudio-community
Needs 72+ GB — try a Mac with more RAM
General · quanttrio
Needs 55+ GB — try a Mac with more RAM
Chat · moonshotai
Needs 22+ GB — try a Mac with more RAM
General · nvidia
Needs 173+ GB — try a Mac with more RAM
General · redhatai
Needs 103+ GB — try a Mac with more RAM
General · xiaomimimo · 2025-12-16
Needs 135+ GB — try a Mac with more RAM
General · moonshotai
Needs 461+ GB — try a Mac with more RAM
General · Alibaba
Needs 32+ GB — try a Mac with more RAM
General · quanttrio
Needs 100+ GB — try a Mac with more RAM
General · salesforce
Needs 21+ GB — try a Mac with more RAM
General · Alibaba
Needs 103+ GB — try a Mac with more RAM
Chat · lmstudio-community
Needs 23+ GB — try a Mac with more RAM
Chat · meituan-longcat
Needs 245+ GB — try a Mac with more RAM
General · coherelabs
Needs 49+ GB — try a Mac with more RAM
General · zai-org
Needs 47+ GB — try a Mac with more RAM
General · nvidia
Needs 22+ GB — try a Mac with more RAM
General · nvidia
Needs 172+ GB — try a Mac with more RAM
Chat · redhatai
Needs 48+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 32+ GB — try a Mac with more RAM
General · minimaxai
Needs 100+ GB — try a Mac with more RAM
General · nvidia
Needs 22+ GB — try a Mac with more RAM
Chat · Mistral AI · 2024-04-16
Needs 62+ GB — try a Mac with more RAM
Chat · redhatai
Needs 31+ GB — try a Mac with more RAM
Chat · lmstudio-community
Needs 50+ GB — try a Mac with more RAM
General · Alibaba
Needs 32+ GB — try a Mac with more RAM
Chat · redhatai
Needs 31+ GB — try a Mac with more RAM
General · lgai-exaone
Needs 104+ GB — try a Mac with more RAM
General · nvidia
Needs 51+ GB — try a Mac with more RAM
Chat · redhatai
Needs 175+ GB — try a Mac with more RAM
General · inclusionai
Needs 441+ GB — try a Mac with more RAM
Chat · Meta · 2025-04-01
Needs 175+ GB — try a Mac with more RAM
Chat · redhatai
Needs 48+ GB — try a Mac with more RAM
General · rednote-hilab · 2025-05-14
Needs 63+ GB — try a Mac with more RAM
General · NousResearch · 2024-01-11
Needs 21+ GB — try a Mac with more RAM
Chat · Meta
Needs 39+ GB — try a Mac with more RAM
General · bigscience · 2022-05-19
Needs 77+ GB — try a Mac with more RAM
General · allenai
Needs 32+ GB — try a Mac with more RAM
General · amd
Needs 97+ GB — try a Mac with more RAM
General · gadflyii
Needs 27+ GB — try a Mac with more RAM
Chat · TII · 2023-09-04
Needs 79+ GB — try a Mac with more RAM
General · baidu · 2025-06-28
Needs 131+ GB — try a Mac with more RAM
Model not in the list? Paste a HuggingFace URL or ID for an instant fit check.
It depends on the model size and quantization. A 7B parameter model at Q4 quantization needs about 5 GB of RAM, while a 70B model needs 40+ GB. Apple Silicon Macs use unified memory, so your entire RAM pool is available for model weights — no separate VRAM required.
Not comfortably. A 70B model at Q4 quantization needs about 40 GB of RAM. The MacBook Air maxes out at 24-32 GB depending on the generation. You'd need a Mac Studio or MacBook Pro with 48+ GB for a 70B model to run well.
Speed depends on your chip's memory bandwidth and the model size. Smaller models (3-7B) run fastest — expect 40-70+ tokens per second on M2 Pro or better. Use the calculator above to see estimated speeds for your specific Mac.
Quantization reduces model precision to use less memory. Q8 (8-bit) is nearly lossless. Q4 (4-bit) reduces memory by ~75% with minor quality loss — it's the sweet spot for most users. Q2 (2-bit) saves the most memory but noticeably degrades output quality.
Apple Silicon uses unified memory — CPU and GPU share the same RAM pool. A Mac with 32 GB can load a 28 GB model directly. On NVIDIA systems, you're limited by GPU VRAM (typically 8-24 GB on consumer cards), even if the PC has 64 GB of system RAM.
ToolPiper uses Metal GPU acceleration via llama.cpp for LLM inference on Apple Silicon. The GPU and CPU share unified memory, so there's no data transfer overhead. The Neural Engine (ANE) is used for specific tasks like super-resolution and pose detection.
Yes, if you have enough RAM. ToolPiper manages model loading and can keep multiple models in memory simultaneously. When memory gets tight, it automatically evicts the least recently used model to make room for a new one.
GGUF is the standard format for running quantized models with llama.cpp (and ToolPiper). It supports all quantization levels and runs on CPU+GPU. MLX is Apple's format optimized for Apple Silicon. AWQ and GPTQ are NVIDIA-focused formats that don't run natively on Mac.
Model database updated: 2026-04-14 · 866 models