inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-heuristic-per-tensor 7B • Updated Apr 22 • 20