Inference Optimized Checkpoints (with Model Optimizer) Collection A collection of generative models quantized and optimized for inference with Model Optimizer. • 65 items • Updated 7 days ago • 157
view reply Have you compared models served by an inference provider to evals run direct against the model? Some data to support the crux of your argument would be good.