Qwen3.6 12B IQ Ultra Heretic Uncensored Thinking V2 Hightop - GGUF

GGUF quantizations of DavidAU/Qwen3.6-12B-IQ-Ultra-Heretic-Uncensored-Thinking-V2-Hightop.

Converted and quantized using llama.cpp b9192.

Available Quants

Quant Size Quality
Q8_0 11.57 GB Near-perfect
Q6_K 8.93 GB Excellent
Q5_K_M 7.84 GB Very good
Q5_K_S 7.65 GB Very good
Q5_0 7.65 GB Good
Q4_K_M 6.82 GB Best balance
IQ4_NL 6.58 GB Very good (IQ)
Q4_K_S 6.49 GB Good
Q4_0 6.43 GB Good
IQ4_XS 6.31 GB Good (IQ)
Q3_K_L 5.94 GB Acceptable
Q3_K_M 5.58 GB Acceptable
IQ3_M 5.33 GB Acceptable (IQ)
IQ3_S 5.27 GB Acceptable (IQ)
Q3_K_S 5.15 GB Fair
Q2_K 4.60 GB Minimal usable

Original Model

DavidAU/Qwen3.6-12B-IQ-Ultra-Heretic-Uncensored-Thinking-V2-Hightop

Usage

Use with LM Studio, llama.cpp, Ollama, or any GGUF-compatible inference engine.

Downloads last month
18,616
GGUF
Model size
12B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for KevinJK51/Qwen3.6-12B-IQ-Ultra-Heretic-Uncensored-Thinking-V2-Hightop-GGUF