Inference Providers
Active filters: quant
AngelSlim/Hy-MT1.5-1.8B-1.25bit
Translation
• 2B • Updated • 17.6k
• 190
tencent/Hy-MT1.5-1.8B-1.25bit
Translation
• 2B • Updated • 341
• 31
Text-to-Image
• Updated • 13.6k
• 48
3ndetz/LTX2-Rapid-Merges-GGUF
Image-Text-to-Video
• 19B • Updated • 2.22k
• 24
2B • Updated • 49
• 59
AngelSlim/Hy-MT1.5-1.8B-1.25bit-GGUF
Translation
• 2B • Updated • 7.18k
• 41
AngelSlim/Hy-MT1.5-1.8B-2bit-GGUF
Translation
• 2B • Updated • 4.52k
• 21
tencent/Hy-MT1.5-1.8B-2bit
Translation
• 2B • Updated • 47.2k
• 35
tencent/Hy-MT1.5-1.8B-1.25bit-GGUF
Translation
• 2B • Updated • 4.23k
• 17
digitous/13B-HyperMantis_GPTQ_4bit-128g
Text Generation
• Updated • 11
• 12
pszemraj/nougat-small-onnx-quant_avx2
Image-Text-to-Text
• Updated • 7
pszemraj/nougat-base-onnx-quant_avx2
Image-Text-to-Text
• Updated • 5
fhai50032/RolePlayLake-7B-GGUF
7B • Updated • 48
• 3
oldbridge/latxa-7b-instruct-q8
Text Generation
• 7B • Updated • 16
pszemraj/nougat-small-onnx-quant_avx512_vnni
Image-Text-to-Text
• Updated • 4
RDson/Llama-3-Magenta-Instruct-4x8B-MoE-GGUF
25B • Updated • 215
• 1
TroyDoesAI/Codestral-21B-Pruned
Text Generation
• 21B • Updated • 8
• 2
mradermacher/Codestral-21B-Pruned-GGUF
21B • Updated • 273
mradermacher/Codestral-21B-Pruned-i1-GGUF
21B • Updated • 426
pszemraj/candle-flanUL2-quantized
Text Generation
• 19B • Updated • 37
byroneverson/gemma-2-27b-it-abliterated-gguf
Text Generation
• 27B • Updated • 269
• 12
QuantFactory/gemma-2-27b-it-abliterated-GGUF
Text Generation
• 27B • Updated • 951
• 7
EmperorKronos/gemma-2-27b-it-abliterated-exl2
Text Generation
• Updated • 1
byroneverson/LongWriter-glm4-9b-abliterated-gguf
Text Generation
• 9B • Updated • 28
• 3
Question Answering
• 8B • Updated • 8
• 4
mradermacher/FinShibainu-GGUF
8B • Updated • 106
• 1
eaddario/Hammer2.1-7b-GGUF
Text Generation
• 8B • Updated • 830
• 2
eaddario/DeepSeek-R1-Distill-Qwen-7B-GGUF
Text Generation
• 8B • Updated • 822
• 3