nm-testing/qwen3-80b-fp8-dynamic
80B • Updated • 4
nm-testing/gemma-3-4b-it-s_q-W4A8-G512
5B • Updated • 13
nm-testing/llama3.3-70B-speculators.09-10-2025-eagle3
2B • Updated • 4
nm-testing/Llama-3.2-1B-Instruct-quipv-w4a16
2B • Updated • 4
nm-testing/Llama-3.2-1B-Instruct-quip
2B • Updated • 3
nm-testing/Llama-3.2-1B-Instruct-spinquantR1R2-online
2B • Updated • 4
nm-testing/Qwen3-Coder-30B-A3B-Instruct-W4A16-awq
5B • Updated • 167k
• 4
nm-testing/llama4-scout-17b-eagle3-dummy-drafter
Updated • 53
nm-testing/Llama-3.2-1B-Instruct-spinquantR1R2R4-w4a16
2B • Updated • 10.2k
nm-testing/Llama-3.1-8B-Instruct-quip-w4a16
2B • Updated • 5
nm-testing/Meta-Llama-3-8B-Instruct-spinquantR3-FP8_asym-attn
8B • Updated • 3
nm-testing/Meta-Llama-3-8B-Instruct-spinquantR3
8B • Updated • 3
nm-testing/gemma-3n-2b-quantized.w4a16-test
4B • Updated • 4
nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-FP8-Dynamic
6B • Updated • 9
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4-FP8-Dynamic
0.8B • Updated • 4
nm-testing/Llama-3.2-1B-Instruct-lc_min_hack-hadamard-w4a16
2B • Updated • 4
nm-testing/Llama-3.2-1B-Instruct-sq_min_hack-hadamard-w4a16
2B • Updated • 4
nm-testing/Llama-3.2-1B-Instruct-sq_min_hack-eye-w4a16
2B • Updated • 4
nm-testing/Llama-3.2-1B-Instruct-lc_min_hack-eye-w4a16
2B • Updated • 4
nm-testing/Meta-Llama-3-8B-Instruct-quip-w4a16
2B • Updated • 4
nm-testing/gemma-3n-E2B-it-W4A16-G128
4B • Updated • 7
nm-testing/block-quantization-fp8-qwen3-0.6B
0.8B • Updated • 5
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted
Text Generation
• 1.0B • Updated • 350
nm-testing/gemma-3n-2B-it-w4a16
4B • Updated • 5
nm-testing/Speculator-Qwen3-8B-Eagle3-converted-071-quantized
1B • Updated • 13.3k
nm-testing/granite-20b-code-instruct-8k-quantized.w4a16
3B • Updated • 5
nm-testing/SpeculatorLlama3-1-8B-Eagle3-converted-0717-quantized
1.0B • Updated • 13.9k
nm-testing/Llama-3.1-8B-Instruct-bearester-quant
8B • Updated • 2
nm-testing/Llama-3.1-8B-Instruct-bearest-quant
8B • Updated • 1
nm-testing/Llama-3.1-8B-Instruct-bare-bones
8B • Updated • 2