Inference Providers
Active filters: vLLM
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ-BF16Mix
Text Generation
• 31B • Updated • 13
• 4
QuantTrio/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 174
• 2
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ
Text Generation
• 31B • Updated • 4.73k
• 4
QuantTrio/KAT-V1-40B-GPTQ-Int4-Int8Mix
Text Generation
• 47B • Updated • 1
QuantTrio/Qwen3-Coder-30B-A3B-Instruct-GPTQ-Int8
Text Generation
• 31B • Updated • 233
• 8
EliovpAI/Qwen3-14B-FP8-KV
Text Generation
• 15B • Updated • 24
• 2
Image-Text-to-Text
• 17B • Updated • 3.53k
• 19
QuantTrio/Seed-OSS-36B-Instruct-AWQ
Text Generation
• 36B • Updated • 355
• 8
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated • 85
• 4
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated • 11
• 5
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int3
Text Generation
• 34B • Updated • 5
• 3
amakhov/tiny-random-llama
Text Generation
• 4.18M • Updated • 65
Text Generation
• 41B • Updated • 2
QuantTrio/DeepSeek-V3.1-AWQ
Text Generation
• 684B • Updated • 330
• 5
QuantTrio/DeepSeek-V3.1-AWQ-Fp16Mix
Text Generation
• 684B • Updated • 18
• 1
QuantTrio/DeepSeek-V3.1-AWQ-Lite
Text Generation
• 684B • Updated • 2.25k
• 3
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int4
Text Generation
• 4B • Updated • 4.96k
• 4
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int8
Text Generation
• 4B • Updated • 261
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int4
Text Generation
• 4B • Updated • 191
• 1
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int8
Text Generation
• 4B • Updated • 38
• 2
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int4
Text Generation
• 31B • Updated • 1.65k
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 36
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int4
Text Generation
• 31B • Updated • 3
JunHowie/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
• 8B • Updated • 4.35k
JunHowie/Qwen2-7B-Instruct-GPTQ-Int8
Text Generation
• 8B • Updated • 4
EliovpAI/Deepseek-R1-0528-Qwen3-8B-FP8-KV
Text Generation
• 8B • Updated • 7
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 11
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated • 1
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ
Text Generation
• 236B • Updated • 3.6k
• 13