models885d ago

Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive

HHugging Face Blogscore 0.18

Hugging Face and Microsoft collaborated to optimize SD Turbo and SDXL Turbo inference using ONNX Runtime and Olive. This integration reduces latency by up to 30% and improves throughput. You can deploy these optimized models on Hugging Face's Inference API or use them locally. The optimization enables faster and more efficient image generation.

Key takeaways

Up to 30% latency reduction with ONNX Runtime and Olive.
Optimized models deployable via Hugging Face's Inference API or locally.
Faster image generation for applications.

#inference-optimization #onnx #image-generation

Read the original

models885d ago

Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive

HHugging Face Blog

Hugging Face and Microsoft collaborated to optimize SD Turbo and SDXL Turbo inference using ONNX Runtime and Olive. This integration reduces latency by up to 30% and improves throughput. You can deploy these optimized models on Hugging Face's Inference API or use them locally. The optimization enables faster and more efficient image generation.

Key takeaways

Up to 30% latency reduction with ONNX Runtime and Olive.
Optimized models deployable via Hugging Face's Inference API or locally.
Faster image generation for applications.

#inference-optimization #onnx #image-generation

Read at Hugging Face Blog