Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
442
20
403
John Leimgruber III
PRO
ubergarm
Follow
rAIfle's profile picture
victor's profile picture
joaires's profile picture
421 followers
·
65 following
https://blog.aifoundry.org/p/adventures-in-model-quantization
ubergarm
john-leimgruber
AI & ML interests
Open LLMs and Astrophotography image processing.
Recent Activity
new
activity
about 10 hours ago
RDson/Qwen3.6-27B-MTP-Q4_K_M-GGUF:
提速效果不理想
liked
a model
about 11 hours ago
Lorbus/Qwen3.6-27B-int4-AutoRound
new
activity
about 11 hours ago
Lorbus/Qwen3.6-27B-int4-AutoRound:
How does this fair against other quants without MTP like unsloth?
View all activity
Organizations
ubergarm
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
RDson/Qwen3.6-27B-MTP-Q4_K_M-GGUF
about 10 hours ago
提速效果不理想
7
#1 opened 6 days ago by
androidli
liked
a model
about 11 hours ago
Lorbus/Qwen3.6-27B-int4-AutoRound
Image-Text-to-Text
•
6B
•
Updated
14 days ago
•
320k
•
84
New activity in
Lorbus/Qwen3.6-27B-int4-AutoRound
about 11 hours ago
How does this fair against other quants without MTP like unsloth?
👍
1
1
#4 opened 11 days ago by
Crigges
New activity in
ubergarm/Qwen3.6-27B-GGUF
about 13 hours ago
How to use MTP in GGUF?
16
#2 opened 6 days ago by
Friedland
liked
a model
about 16 hours ago
google/gemma-4-31B-it-assistant
Any-to-Any
•
0.5B
•
Updated
about 18 hours ago
•
4.24k
•
93
liked
a model
about 17 hours ago
RDson/Qwen3.6-27B-MTP-Q4_K_M-GGUF
27B
•
Updated
7 days ago
•
1.21k
•
8
updated
a model
1 day ago
ubergarm/Qwen3.6-27B-GGUF
Text Generation
•
27B
•
Updated
1 day ago
•
8.55k
•
18
New activity in
ubergarm/Qwen3.6-27B-GGUF
5 days ago
Q6_0 use over Q6_K?
2
#3 opened 5 days ago by
resynth
New activity in
ubergarm/Qwen3.6-27B-GGUF
7 days ago
Great model for single GPU use cases.
🔥
4
16
#1 opened 11 days ago by
phakio
liked
a model
8 days ago
XiaomiMiMo/MiMo-V2.5-Pro
Text Generation
•
1T
•
Updated
8 days ago
•
16k
•
448
New activity in
ubergarm/Qwen3.5-122B-A10B-GGUF
9 days ago
How to split this model between 2 (3) GPUs and CPU/RAM ?
30
#12 opened about 2 months ago by
mancub
updated
a model
9 days ago
ubergarm/Kimi-K2.6-GGUF
Text Generation
•
1T
•
Updated
9 days ago
•
5.62k
•
35
liked
a model
11 days ago
bartowski/Qwen_Qwen3.6-27B-GGUF
Image-Text-to-Text
•
27B
•
Updated
13 days ago
•
81.9k
•
30
New activity in
ubergarm/Qwen3.5-35B-A3B-GGUF
12 days ago
Any plans to release Qwen3.6 ?
1
#3 opened 12 days ago by
ittokasso
updated
a model
12 days ago
ubergarm/Qwen3.5-27B-GGUF
Text Generation
•
27B
•
Updated
12 days ago
•
1.98k
•
16
New activity in
ubergarm/Qwen3.5-27B-GGUF
12 days ago
smol-IQ4_NL with turboquant is outstanding for 24GB VRAM
🔥
1
3
#7 opened 12 days ago by
IHaveNoClueAndIMustPost
published
a model
12 days ago
ubergarm/Qwen3.6-27B-GGUF
Text Generation
•
27B
•
Updated
1 day ago
•
8.55k
•
18
New activity in
ubergarm/Kimi-K2.6-GGUF
12 days ago
Much faster IQ2_KS quantization PPL results?
3
#10 opened 13 days ago by
gghfez
really awesome speeds! running at 256k context.
🔥
1
5
#11 opened 13 days ago by
mtcl
Is IQ3_K mainline compatible?
1
#12 opened 12 days ago by
TimothyRoo
Load more