1sec.ai
Back to feed
models885d ago

Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive

Hugging Face and Microsoft collaborated to optimize SD Turbo and SDXL Turbo inference using ONNX Runtime and Olive. This integration reduces latency by up to 30% and improves throughput. You can deploy these optimized models on Hugging Face's Inference API or use them locally. The optimization enables faster and more efficient image generation.

Key takeaways

  • Up to 30% latency reduction with ONNX Runtime and Olive.
  • Optimized models deployable via Hugging Face's Inference API or locally.
  • Faster image generation for applications.
models885d ago

Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive

Hugging Face and Microsoft collaborated to optimize SD Turbo and SDXL Turbo inference using ONNX Runtime and Olive. This integration reduces latency by up to 30% and improves throughput. You can deploy these optimized models on Hugging Face's Inference API or use them locally. The optimization enables faster and more efficient image generation.

Key takeaways

  • Up to 30% latency reduction with ONNX Runtime and Olive.
  • Optimized models deployable via Hugging Face's Inference API or locally.
  • Faster image generation for applications.