Showing 866 of 866 models
General · Alibaba · 2026-02-28
General · Alibaba · 2026-02-28
General · Alibaba · 2026-02-28
General · Alibaba · 2026-02-28
Chat · Liquid AI · 2026-01-06
General · Liquid AI · 2026-01-05
General · Liquid AI · 2026-01-05
General · Liquid AI · 2026-01-20
General · Liquid AI · 2025-12-25
General · Liquid AI · 2026-01-04
General · Liquid AI · 2026-01-05
General · Alibaba · 2026-02-27
General · Google · 2026-03-02
General · Alibaba · 2026-02-27
General · Liquid AI · 2025-10-28
General · ibm-granite · 2025-09-16
General · Liquid AI · 2025-10-07
General · Liquid AI · 2025-10-22
General · Liquid AI · 2025-09-22
General · Liquid AI · 2025-08-22
General · Liquid AI · 2025-09-03
General · Liquid AI · 2025-09-30
General · Liquid AI · 2025-09-03
General · Liquid AI · 2025-09-03
General · Liquid AI · 2025-08-25
General · Liquid AI · 2025-09-03
General · ibm-granite · 2025-09-16
General · Liquid AI · 2025-07-10
General · Liquid AI · 2025-07-10
General · Liquid AI · 2025-08-12
General · Liquid AI · 2025-07-10
General · Liquid AI · 2025-08-12
General · lgai-exaone · 2025-07-11
General · huggingfacetb · 2025-07-08
General · Alibaba · 2025-04-27
General · Alibaba · 2025-04-27
General · Alibaba
Reasoning · DeepSeek
Coding · btbtyler09
Reasoning · lmstudio-community
General · paddlepaddle
General · abhishekchohan
General · hmellor
General · lkhl
General · hmellor
General · titanml
General · lmstudio-community
Coding · cyankiwi
General · cyankiwi
General · Alibaba
Reasoning · lmstudio-community
General · winninghealth
General · ahczhg
General · abaryan
General · optimum-intel-internal-testing
Coding · shahriarferdoush
General · ibm-granite
Reasoning · typhoon-ai
General · quanttrio
General · adamlucek
General · lmstudio-community
General · lmstudio-community
General · z-lab
General · yujiepan
General · apolo13x
General · optimum-intel-internal-testing
General · cyankiwi
General · huihui-ai
General · paddlepaddle
General · dmusingu
General · cyankiwi
General · Liquid AI · 2025-12-18
General · rednote-hilab
General · ibm-granite
General · kristaller486
General · Alibaba
General · rednote-hilab
General · z-lab
General · zstanjj
General · prism-ml
General · nanonets
Reasoning · jackrong
General · redhatai
General · stanfordaimi
Reasoning · nvidia
General · Alibaba
General · Alibaba
General · Alibaba
General · redhatai
General · farbodtavakkoli
Reasoning · DeepSeek · 2025-01-20
General · farbodtavakkoli
General · Google · 2026-03-02
General · nanbeige
General · lovedheart
General · Alibaba
General · farbodtavakkoli
General · stelterlab
General · ista-daslab
General · Alibaba
General · llava-hf
General · cyankiwi
General · opengvlab
General · laap-ai
General · warshanks
General · farbodtavakkoli
General · Alibaba
General · tiger-lab
General · z-lab
General · jakobhuss
General · huggingfacetb
General · perceptronai
General · datalab-to
General · perceptronai
General · zju-ai4h
General · Alibaba
General · redhatai
General · ibm-granite
General · lmstudio-community
General · tristepin
General · lmstudio-community
General · salesforce
General · Alibaba
General · lmstudio-community
General · lmstudio-community
General · idea-research
General · cyankiwi
General · lmstudio-community
General · Alibaba
General · nvidia
General · primeintellect
General · quanttrio
General · turing-motors
General · optimum-intel-internal-testing
General · huihui-ai
General · opengvlab
General · infomaniak-ai
General · opengvlab
General · opengvlab
General · cyankiwi
General · huihui-ai
General · opengvlab
General · numind
General · ibm-granite
General · thisisiron
General · dealignai
General · Liquid AI · 2025-08-28
Chat · Alibaba · 2025-01-26
General · lightonai
Coding · Alibaba · 2024-09-18
Coding · Alibaba
Coding · Alibaba
Reasoning · vikhrmodels
Coding · lmstudio-community
General · opendatalab
Coding · Alibaba
General · TII
Coding · Alibaba
Reasoning · lmstudio-community
General · salesforce
General · lightonai
Chat · Alibaba
Chat · redhatai
General · huggingfacetb
Chat · redhatai
General · Alibaba
Chat · cyankiwi
Chat · Microsoft · 2025-02-24
General · hmellor
Chat · lmstudio-community
Chat · lmstudio-community
Reasoning · DeepSeek
Chat · stelterlab
General · huggingfacetb
Chat · cyankiwi
General · huggingfacetb
General · starvector
General · efficient-large-model
General · Alibaba
Chat · lmstudio-community
Coding · lmstudio-community
General · Alibaba
Chat · Alibaba
General · nvidia
Chat · cyankiwi
General · bytedance-seed
Chat · vikhrmodels
Reasoning · stevenhh2000
Embedding · embedl
Chat · cyankiwi
General · opengvlab
General · intel
General · Alibaba · 2026-02-27
General · DeepSeek
General · DeepSeek
Coding · cyankiwi
Chat · ai-sage
Coding · bullpoint
Reasoning · jackrong
General · Alibaba · 2026-02-26
Coding · BigCode
Chat · redhatai
Reasoning · jackrong
General · Liquid AI · 2026-02-24
Try Q4_K_M (15.9 GB, ~127 tok/s)
Chat · huihui-ai
Reasoning · momix-44
Chat · Alibaba
Chat · Alibaba
Chat · Alibaba
Chat · Alibaba
Chat · Alibaba
Chat · Alibaba
Chat · Microsoft · 2024-08-16
Chat · Alibaba
Chat · junhowie
General · cyankiwi
Chat · gensyn
Chat · Alibaba
Chat · nvidia
Chat · Alibaba
Chat · lgai-exaone
General · dealignai
Chat · Alibaba
Coding · BigCode
Chat · redhatai
Chat · Alibaba
Chat · ista-daslab
Chat · namaa-space
Chat · redhatai
Chat · huihui-ai
Chat · prithivmlmods
Chat · prithivmlmods
Chat · Alibaba
General · Meta · 2024-09-18
General · Meta · 2024-09-18
General · opengvlab
General · Microsoft
General · h2oai
General · h2oai
General · Alibaba
General · ibm-research
General · Google
General · opengvlab
General · llava-hf
General · nvidia · 2025-08-12
General · allenai
General · Google · 2024-07-16
Chat · kakaocorp
General · state-spaces
General · bigscience
General · Google
General · opengvlab
General · Microsoft
General · generalanalysis
General · optimum-intel-internal-testing
General · Google
General · redhatai
General · Google
General · kitefishai
General · opengvlab
General · Meta
General · bigscience
General · opengvlab
General · axionml
General · florence-community
General · Stability AI
Chat · huggingfacetb
General · happypatrick
General · Google
General · stepfun-ai
General · TII
General · opengvlab
General · Google
General · redhatai
General · fal
General · moondream
General · ahmed-masry
General · facebook
General · zai-org
General · 5cd-ai
Coding · Alibaba · 2024-09-17
General · bigscience
General · eleutherai
Chat · huggingfacetb
General · eleutherai
General · salesforce
Chat · huggingfacetb
General · DeepSeek
General · rinna
General · Alibaba
General · eleutherai
General · huggingfacetb
General · peft-internal-testing
Chat · huggingfacetb
General · eleutherai
General · ai-sweden-models
General · eleutherai
General · eleutherai
General · eleutherai
Coding · Alibaba
Coding · redhatai
General · huggingfacetb
General · jackfram
General · ibm-research
General · Google
Chat · huggingfacetb
General · coherelabs
General · kblueleaf
General · eleutherai
General · eleutherai
General · liautoad
Chat · h2oai
General · Google
General · Google
General · nm-testing
General · isotr0py
General · kblueleaf
General · eleutherai
General · opengvlab
General · opengvlab
General · opengvlab
General · 5cd-ai
General · rhoninseiei
General · Alibaba · 2025-04-27
General · Microsoft
Coding · DeepSeek · 2024-06-14
General · eleutherai
Coding · codellama
General · eleutherai
Coding · DeepSeek
Coding · codellama
General · Microsoft
Coding · DeepSeek
General · Microsoft
General · Microsoft
General · aidc-ai
General · gokaygokay
General · gokaygokay
General · florence-community
General · opengvlab
General · opengvlab
General · apolo13x
General · Mistral AI
General · ggml-org
General · allenai
General · redhatai
General · nvidia
General · allenai
General · nvidia
General · allenai
General · bytedance-seed
Chat · servicenow-ai
General · vamsi
General · NousResearch
General · numind
General · xiaomimimo
General · salesforce
General · prometheus-eval
General · haochenwang
General · gokaygokay
General · mlabonne
General · parasail-ai
General · augmxnt
General · xenova
General · typhoon-ai
General · singh8898
General · allenai
Chat · Meta
Chat · Microsoft
General · llava-hf
General · nvidia
General · lmstudio-community
General · lmstudio-community
Chat · opengvlab
General · swe-bench
General · HuggingFace · 2023-10-26
Chat · Microsoft
Chat · Alibaba
General · nvidia
General · nvidia
Chat · zyphra
General · Alibaba
General · salesforce
General · fancyfeast
Chat · allenai
General · virtue-ai-hub
Chat · allenai
Chat · cazzz307
Coding · BigCode · 2024-02-20
General · liuhaotian
General · Microsoft
General · llava-hf
General · Alibaba
Chat · DeepSeek
General · mbzuai
General · inclusionai · 2025-02-28
Chat · Meta
Chat · Alibaba · 2025-01-26
General · Alibaba
Chat · kaitchup
Chat · DeepSeek
Chat · Microsoft
Reasoning · DeepSeek
General · nvidia
General · lgai-exaone
General · Alibaba
General · Alibaba
General · lmms-lab
Coding · xlabs-ai
General · xiaomimimo
Chat · nvfp4
General · inclusionai
General · opengvlab
General · sanjiwatsuki
General · allenai
Chat · huggingfacetb
General · openbmb
General · nytopop
General · inclusionai
Chat · cyankiwi
General · prithivmlmods
General · davidau
General · bytedance-seed
General · zju-ai4h
Chat · Stability AI · 2024-04-08
General · openai
General · quanttrio
General · cyankiwi
General · lovedheart
General · huggingfacem4
Chat · dream-org
General · openbmb
Chat · Alibaba
General · huihui-ai
Coding · BigCode
General · moonshotai
Chat · opengvlab
General · zai-org
Coding · ibm-granite
General · opengvlab
Chat · redhatai
General · jackrong
Chat · TII · 2024-11-29
General · redhatai
General · huihui-ai
General · cyankiwi
General · allenai
Embedding · Nomic · 2024-02-10
Chat · Community · 2023-12-30
Chat · Mistral AI
Chat · Microsoft · 2024-04-22
Chat · ibm-granite
General · ilyagusev
Chat · nvidia
Chat · redhatai
General · stepfun-ai
General · NousResearch
Chat · NousResearch
General · cyankiwi
Try Q4_K_M (16.2 GB, ~151 tok/s)
General · NousResearch
Chat · thesven
Chat · allenai
Chat · patronusai
General · nvidia
Chat · redhatai
General · tartunlp
Chat · gradientai
General · gaunernst
Chat · redhatai
Chat · Alibaba · 2024-09-16
Embedding · BAAI · 2023-09-12
Chat · Alibaba
General · llava-hf
General · zai-org
Chat · Alibaba
Chat · Alibaba
Chat · xcuros
Chat · xcuros
General · dengcao
General · facebook
General · omni-research
General · openbmb
Reasoning · nvidia
General · liuhaotian
General · tiger-lab
Chat · internlm
General · xtuner
General · redhatai
General · llava-hf
General · llava-hf
Reasoning · harleywang
Chat · huihui-ai
Chat · asi992h
Chat · Mistral AI · 2024-05-22
Chat · Alibaba
Chat · essentialai
General · Alibaba
General · eleutherai
General · ytu-ce-cosmos
Chat · zai-org
General · salesforce
General · languagebind
General · openvla
Chat · thecluster
General · openvla
Coding · Meta · 2024-03-13
General · Meta
General · Meta · 2024-07-14
General · opengvlab
Chat · thudm · 2024-06-04
Chat · moonshotai
General · Meta
Chat · bsc-lt
General · llava-hf
General · opengvlab
General · solidrust
Reasoning · Microsoft
General · lightricks
General · allenai
Chat · lmms-lab
General · skywork
General · redhatai
Coding · Alibaba · 2024-11-06
General · naver-hyperclovax
General · mistral-experimental
General · coherelabs
Chat · NousResearch
Chat · 01.ai · 2023-11-22
General · Google
Chat · huihui-ai
General · nvidia
General · Google · 2024-06-24
General · nvidia
Chat · nvidia
Chat · TII
Coding · Alibaba
General · opengvlab
General · moondream
Chat · DeepSeek
Chat · Alibaba
Chat · Meta · 2024-07-18
Chat · Meta
Coding · lmstudio-community
Chat · casperhansen
Chat · TII · 2023-04-25
Chat · vagosolutions
General · Meta
Chat · opengvlab
General · Alibaba · 2026-02-24
Try Q2_K (16.2 GB, ~152 tok/s)
General · k-compression
General · naver-hyperclovax
Reasoning · mconcat
Chat · salesforce
Chat · konstantinoskk
General · nvidia
General · Alibaba
General · aidc-ai
Try Q2_K (14.2 GB, ~133 tok/s)
Chat · lmstudio-community
General · Alibaba
General · Alibaba
General · Alibaba
Try Q3_K_M (17.2 GB, ~105 tok/s)
Chat · Meta · 2024-09-18
General · aoxo
Try Q2_K (14.5 GB, ~150 tok/s)
General · sarvamai
Try Q2_K (14.5 GB, ~150 tok/s)
General · typhoon-ai
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · lmstudio-community
General · Google
General · cyankiwi
Coding · Meta · 2024-03-13
Coding · Alibaba
Try Q3_K_M (17.2 GB, ~105 tok/s)
Coding · Alibaba
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · Alibaba
Try Q3_K_M (17.4 GB, ~142 tok/s)
General · Alibaba
General · eleutherai
Coding · quanttrio
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · browser-use
Try Q3_K_M (17.4 GB, ~142 tok/s)
Chat · speakleash
General · Alibaba
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · liuhaotian
General · Alibaba
Try Q3_K_M (17.2 GB, ~105 tok/s)
Chat · Upstage · 2023-12-12
General · quixiai
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · redhatai
General · nytopop
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · cyankiwi
General · gaunernst
General · llava-hf
General · opengvlab
Chat · Alibaba
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · cais
Chat · junhowie
Try Q3_K_M (17.2 GB, ~105 tok/s)
General · llm-jp
General · redhatai
Chat · quanttrio
Try Q3_K_M (17.4 GB, ~142 tok/s)
General · Google · 2026-03-11
Try Q4_K_M (17.6 GB, ~11 tok/s)
Chat · openpipe
General · moonshotai
General · moonshotai
Reasoning · mconcat
Try Q3_K_M (15.4 GB, ~13 tok/s)
Reasoning · harleywang
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · Alibaba · 2026-02-24
Try Q3_K_M (15.7 GB, ~13 tok/s)
Chat · Alibaba
General · cyankiwi
Reasoning · jackrong
Try Q3_K_M (15.7 GB, ~13 tok/s)
General · nvidia
General · apolo13x
Reasoning · jackrong
Try Q3_K_M (15.7 GB, ~13 tok/s)
General · cyankiwi
Reasoning · codgician
Try Q3_K_M (15.7 GB, ~13 tok/s)
Reasoning · quanttrio
Try Q3_K_M (15.7 GB, ~13 tok/s)
Reasoning · oxzoid
Try Q3_K_M (15.7 GB, ~13 tok/s)
General · deaquay
General · Alibaba
Try Q2_K (16.2 GB, ~159 tok/s)
General · opengvlab
General · quanttrio
Try Q2_K (16.2 GB, ~159 tok/s)
General · huihui-ai
Try Q2_K (16.2 GB, ~159 tok/s)
General · huihui-ai
Try Q2_K (16.2 GB, ~159 tok/s)
General · Alibaba
Try Q2_K (16.2 GB, ~159 tok/s)
General · axionml
Coding · coder3101
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · cyankiwi
Try Q2_K (16.5 GB, ~156 tok/s)
Chat · llm-jp
Chat · moonshotai
General · nvidia
General · dphn
Try Q4_K_M (15.7 GB, ~12 tok/s)
General · nvidia · 2025-12-04
Try Q2_K (14.3 GB, ~14 tok/s)
Reasoning · DeepSeek · 2025-01-20
Try Q2_K (14.8 GB, ~14 tok/s)
General · Google · 2026-03-11
Try Q2_K (14.7 GB, ~14 tok/s)
Reasoning · codgician
Try Q2_K (16.2 GB, ~159 tok/s)
Reasoning · jackrong
Try Q2_K (16.2 GB, ~159 tok/s)
General · DeepSeek
Chat · stelterlab
Try Q4_K_M (15.7 GB, ~12 tok/s)
General · cerebras
Try Q5_K_M (17.6 GB, ~11 tok/s)
General · m-ric
Try Q4_K_M (16.8 GB, ~12 tok/s)
General · rhymes-ai
Try Q4_K_M (16.8 GB, ~12 tok/s)
Chat · gghfez
Try Q4_K_M (15.7 GB, ~12 tok/s)
General · Google
Coding · coder3101
Try Q2_K (14.1 GB, ~15 tok/s)
General · lmstudio-community
Reasoning · nvidia
Try Q2_K (14.8 GB, ~14 tok/s)
General · cyankiwi
Try Q4_K_M (17.6 GB, ~11 tok/s)
General · redhatai
Try Q3_K_M (15.5 GB, ~13 tok/s)
General · Google
Try Q4_K_M (17.6 GB, ~11 tok/s)
General · davidau
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · jackrong
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · davidau
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · voidful
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · voidful
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · cyankiwi
Try Q4_K_M (17.6 GB, ~11 tok/s)
General · llmfan46
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · Alibaba
Try Q3_K_M (15.7 GB, ~13 tok/s)
General · quanttrio
Try Q3_K_M (15.7 GB, ~13 tok/s)
General · huihui-ai
Try Q3_K_M (15.7 GB, ~13 tok/s)
General · lgai-exaone · 2025-07-11
Try Q2_K (14.4 GB, ~14 tok/s)
General · huihui-ai
Try Q3_K_M (15.7 GB, ~13 tok/s)
General · cyankiwi
Try Q3_K_M (16.1 GB, ~12 tok/s)
General · cyankiwi
Try Q3_K_M (16.0 GB, ~12 tok/s)
General · cyankiwi
Try Q3_K_M (16.0 GB, ~12 tok/s)
General · lmstudio-community
General · infomaniak-ai
Try Q3_K_M (16.2 GB, ~12 tok/s)
General · nvidia
Try Q2_K (14.3 GB, ~14 tok/s)
Coding · Alibaba · 2024-11-06
Try Q2_K (14.8 GB, ~14 tok/s)
General · zai-org
Try Q2_K (14.1 GB, ~15 tok/s)
General · nvidia
Try Q2_K (14.3 GB, ~14 tok/s)
General · nvidia
Try Q2_K (14.3 GB, ~14 tok/s)
General · quanttrio
Try Q2_K (14.1 GB, ~15 tok/s)
General · baidu
Try Q3_K_M (16.5 GB, ~12 tok/s)
Coding · Alibaba
Try Q2_K (14.8 GB, ~14 tok/s)
General · baidu
Try Q3_K_M (16.7 GB, ~12 tok/s)
General · momix-44
Try Q2_K (14.1 GB, ~15 tok/s)
General · quanttrio
Try Q2_K (14.1 GB, ~15 tok/s)
General · redhatai
Try Q2_K (14.1 GB, ~15 tok/s)
General · Google · 2025-03-01
Try Q3_K_M (15.5 GB, ~13 tok/s)
General · cyankiwi
Try Q2_K (14.5 GB, ~14 tok/s)
General · cyankiwi
Try Q2_K (14.5 GB, ~14 tok/s)
General · lgai-exaone
Try Q2_K (14.4 GB, ~14 tok/s)
General · lgai-exaone
Try Q2_K (14.4 GB, ~14 tok/s)
General · cyankiwi
Try Q2_K (14.5 GB, ~14 tok/s)
General · Alibaba
Try Q2_K (14.8 GB, ~14 tok/s)
General · baichuan-inc
Try Q2_K (14.8 GB, ~14 tok/s)
General · Google
Try Q2_K (14.7 GB, ~14 tok/s)
General · Google · 2024-06-24
Try Q3_K_M (15.4 GB, ~13 tok/s)
General · Alibaba
Try Q2_K (15.0 GB, ~14 tok/s)
General · naver-hyperclovax
Try Q2_K (15.0 GB, ~14 tok/s)
General · allenai
Try Q2_K (14.5 GB, ~14 tok/s)
General · karakuri-ai
Try Q2_K (15.1 GB, ~14 tok/s)
General · Alibaba
Try Q2_K (14.8 GB, ~14 tok/s)
General · Alibaba
Try Q2_K (14.8 GB, ~14 tok/s)
General · redhatai
Try Q2_K (14.8 GB, ~14 tok/s)
Coding · codellama
Try Q2_K (15.2 GB, ~14 tok/s)
General · salesforce
Try Q2_K (14.8 GB, ~14 tok/s)
Chat · Alibaba
Try Q2_K (15.0 GB, ~14 tok/s)
Chat · quanttrio
Try Q2_K (15.0 GB, ~14 tok/s)
Chat · Alibaba
Try Q2_K (15.1 GB, ~14 tok/s)
Chat · karakuri-ai
Try Q2_K (15.1 GB, ~14 tok/s)
Chat · Alibaba
Try Q2_K (14.8 GB, ~14 tok/s)
Chat · allenai · 2025-03-12
Try Q2_K (14.5 GB, ~14 tok/s)
General · davidau
General · dphn
Try Q2_K (15.5 GB, ~13 tok/s)
General · opengvlab
Try Q3_K_M (17.3 GB, ~11 tok/s)
Coding · Meta · 2024-03-14
Try Q2_K (15.2 GB, ~14 tok/s)
General · liuhaotian
Try Q2_K (15.6 GB, ~13 tok/s)
General · llava-hf
Try Q2_K (15.6 GB, ~13 tok/s)
General · opengvlab
Reasoning · skywork
General · moonshotai · 2026-01-01
Needs 462+ GB — try a Mac with more RAM
General · openai
Needs 53+ GB — try a Mac with more RAM
Reasoning · DeepSeek · 2025-01-20
Needs 299+ GB — try a Mac with more RAM
General · zai-org
Needs 329+ GB — try a Mac with more RAM
General · nvidia
Needs 30+ GB — try a Mac with more RAM
General · DeepSeek · 2025-12-01
Needs 299+ GB — try a Mac with more RAM
Coding · Alibaba
Needs 35+ GB — try a Mac with more RAM
General · nvidia
Needs 54+ GB — try a Mac with more RAM
Chat · Meta · 2024-07-16
Needs 31+ GB — try a Mac with more RAM
General · Alibaba · 2026-02-16
Needs 176+ GB — try a Mac with more RAM
General · Alibaba
Needs 176+ GB — try a Mac with more RAM
General · Alibaba · 2026-02-24
Needs 55+ GB — try a Mac with more RAM
General · Alibaba
Needs 55+ GB — try a Mac with more RAM
Reasoning · DeepSeek
Needs 299+ GB — try a Mac with more RAM
Coding · Alibaba · 2026-01-30
Needs 35+ GB — try a Mac with more RAM
Chat · Alibaba · 2024-09-16
Needs 32+ GB — try a Mac with more RAM
General · zerofata
Needs 31+ GB — try a Mac with more RAM
General · DeepSeek · 2024-12-25
Needs 299+ GB — try a Mac with more RAM
General · minimaxai · 2026-02-12
Needs 100+ GB — try a Mac with more RAM
General · Alibaba · 2025-04-27
Needs 103+ GB — try a Mac with more RAM
Reasoning · nvidia
Needs 172+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 36+ GB — try a Mac with more RAM
Chat · kosbu
Needs 31+ GB — try a Mac with more RAM
General · DeepSeek
Needs 299+ GB — try a Mac with more RAM
Coding · nexveridian
Needs 35+ GB — try a Mac with more RAM
Chat · Mistral AI · 2023-12-10
Needs 21+ GB — try a Mac with more RAM
Chat · casperhansen
Needs 31+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 103+ GB — try a Mac with more RAM
Chat · huihui-ai
Needs 32+ GB — try a Mac with more RAM
Chat · Meta · 2024-11-26
Needs 31+ GB — try a Mac with more RAM
General · zai-org
Needs 49+ GB — try a Mac with more RAM
General · zai-org · 2026-02-11
Needs 329+ GB — try a Mac with more RAM
General · Alibaba
Needs 103+ GB — try a Mac with more RAM
General · sehyo
Needs 32+ GB — try a Mac with more RAM
General · stepfun-ai
Needs 87+ GB — try a Mac with more RAM
General · nvidia
Needs 54+ GB — try a Mac with more RAM
General · Meta
Needs 177+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 103+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 36+ GB — try a Mac with more RAM
General · txn545
Needs 29+ GB — try a Mac with more RAM
General · Meta
Needs 31+ GB — try a Mac with more RAM
General · nvidia
Needs 22+ GB — try a Mac with more RAM
Chat · Meta · 2024-07-16
Needs 177+ GB — try a Mac with more RAM
General · darkc0de
Needs 54+ GB — try a Mac with more RAM
Chat · inceptionai
Needs 32+ GB — try a Mac with more RAM
General · inclusionai
Needs 45+ GB — try a Mac with more RAM
General · zai-org
Needs 157+ GB — try a Mac with more RAM
General · aoxo
Needs 25+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 103+ GB — try a Mac with more RAM
General · nvidia
Needs 190+ GB — try a Mac with more RAM
General · stepfun-ai
Needs 140+ GB — try a Mac with more RAM
General · stepfun-ai
Needs 87+ GB — try a Mac with more RAM
Coding · casperhansen
Needs 103+ GB — try a Mac with more RAM
General · kldzj
Needs 51+ GB — try a Mac with more RAM
Chat · ibnzterrell
Needs 31+ GB — try a Mac with more RAM
General · Meta
Needs 31+ GB — try a Mac with more RAM
Chat · moonshotai · 2025-07-11
Needs 448+ GB — try a Mac with more RAM
Coding · Alibaba · 2025-07-22
Needs 210+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 32+ GB — try a Mac with more RAM
Chat · moonshotai
Needs 448+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 32+ GB — try a Mac with more RAM
General · lmstudio-community
Needs 141+ GB — try a Mac with more RAM
General · minimaxai
Needs 100+ GB — try a Mac with more RAM
Chat · Meta
Needs 175+ GB — try a Mac with more RAM
General · zai-org
Needs 157+ GB — try a Mac with more RAM
Chat · Meta
Needs 31+ GB — try a Mac with more RAM
General · lmstudio-community
Needs 69+ GB — try a Mac with more RAM
General · internlm
Needs 105+ GB — try a Mac with more RAM
General · lmstudio-community
Needs 72+ GB — try a Mac with more RAM
General · quanttrio
Needs 55+ GB — try a Mac with more RAM
Chat · moonshotai
Needs 22+ GB — try a Mac with more RAM
General · nvidia
Needs 173+ GB — try a Mac with more RAM
General · redhatai
Needs 103+ GB — try a Mac with more RAM
General · xiaomimimo · 2025-12-16
Needs 135+ GB — try a Mac with more RAM
General · moonshotai
Needs 461+ GB — try a Mac with more RAM
General · Alibaba
Needs 32+ GB — try a Mac with more RAM
General · quanttrio
Needs 100+ GB — try a Mac with more RAM
General · salesforce
Needs 21+ GB — try a Mac with more RAM
General · Alibaba
Needs 103+ GB — try a Mac with more RAM
Chat · lmstudio-community
Needs 23+ GB — try a Mac with more RAM
Chat · meituan-longcat
Needs 245+ GB — try a Mac with more RAM
General · coherelabs
Needs 49+ GB — try a Mac with more RAM
General · zai-org
Needs 47+ GB — try a Mac with more RAM
General · nvidia
Needs 22+ GB — try a Mac with more RAM
General · nvidia
Needs 172+ GB — try a Mac with more RAM
Chat · redhatai
Needs 48+ GB — try a Mac with more RAM
Chat · Alibaba
Needs 32+ GB — try a Mac with more RAM
General · minimaxai
Needs 100+ GB — try a Mac with more RAM
General · nvidia
Needs 22+ GB — try a Mac with more RAM
Chat · Mistral AI · 2024-04-16
Needs 62+ GB — try a Mac with more RAM
Chat · redhatai
Needs 31+ GB — try a Mac with more RAM
Chat · lmstudio-community
Needs 50+ GB — try a Mac with more RAM
General · Alibaba
Needs 32+ GB — try a Mac with more RAM
Chat · redhatai
Needs 31+ GB — try a Mac with more RAM
General · lgai-exaone
Needs 104+ GB — try a Mac with more RAM
General · nvidia
Needs 51+ GB — try a Mac with more RAM
Chat · redhatai
Needs 175+ GB — try a Mac with more RAM
General · inclusionai
Needs 441+ GB — try a Mac with more RAM
Chat · Meta · 2025-04-01
Needs 175+ GB — try a Mac with more RAM
Chat · redhatai
Needs 48+ GB — try a Mac with more RAM
General · rednote-hilab · 2025-05-14
Needs 63+ GB — try a Mac with more RAM
General · NousResearch · 2024-01-11
Needs 21+ GB — try a Mac with more RAM
Chat · Meta
Needs 39+ GB — try a Mac with more RAM
General · bigscience · 2022-05-19
Needs 77+ GB — try a Mac with more RAM
General · allenai
Needs 32+ GB — try a Mac with more RAM
General · amd
Needs 97+ GB — try a Mac with more RAM
General · gadflyii
Needs 27+ GB — try a Mac with more RAM
Chat · TII · 2023-09-04
Needs 79+ GB — try a Mac with more RAM
General · baidu · 2025-06-28
Needs 131+ GB — try a Mac with more RAM
Model not in the list? Paste a HuggingFace URL or ID for an instant fit check.
It depends on the model size and quantization. A 7B parameter model at Q4 quantization needs about 5 GB of RAM, while a 70B model needs 40+ GB. Apple Silicon Macs use unified memory, so your entire RAM pool is available for model weights — no separate VRAM required.
Not comfortably. A 70B model at Q4 quantization needs about 40 GB of RAM. The MacBook Air maxes out at 24-32 GB depending on the generation. You'd need a Mac Studio or MacBook Pro with 48+ GB for a 70B model to run well.
Speed depends on your chip's memory bandwidth and the model size. Smaller models (3-7B) run fastest — expect 40-70+ tokens per second on M2 Pro or better. Use the calculator above to see estimated speeds for your specific Mac.
Quantization reduces model precision to use less memory. Q8 (8-bit) is nearly lossless. Q4 (4-bit) reduces memory by ~75% with minor quality loss — it's the sweet spot for most users. Q2 (2-bit) saves the most memory but noticeably degrades output quality.
Apple Silicon uses unified memory — CPU and GPU share the same RAM pool. A Mac with 32 GB can load a 28 GB model directly. On NVIDIA systems, you're limited by GPU VRAM (typically 8-24 GB on consumer cards), even if the PC has 64 GB of system RAM.
ToolPiper uses Metal GPU acceleration via llama.cpp for LLM inference on Apple Silicon. The GPU and CPU share unified memory, so there's no data transfer overhead. The Neural Engine (ANE) is used for specific tasks like super-resolution and pose detection.
Yes, if you have enough RAM. ToolPiper manages model loading and can keep multiple models in memory simultaneously. When memory gets tight, it automatically evicts the least recently used model to make room for a new one.
GGUF is the standard format for running quantized models with llama.cpp (and ToolPiper). It supports all quantization levels and runs on CPU+GPU. MLX is Apple's format optimized for Apple Silicon. AWQ and GPTQ are NVIDIA-focused formats that don't run natively on Mac.
Model database updated: 2026-04-14 · 866 models