Tag

#intel

Every item tagged intel, newest first.

6 items

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

Intel optimized the Qwen3-8B agent on Core Ultra CPUs using depth-pruned draft models, achieving 2.4x faster inference. This tech enables faster, more efficient AI on consumer hardware. You can deploy optimized models like these to improve performance in resource-constrained environments.

Key takeaways

2.4x faster inference on Intel Core Ultra CPUs.
Optimized using depth-pruned draft models.
Enables efficient AI on consumer hardware.

HHugging Face Blog#model-optimization #intel #consumer-hardware

toolsSep 20

Optimize and deploy with Optimum-Intel and OpenVINO GenAI

Intel and Hugging Face collaborated on Optimum-Intel, a deployment optimization tool that integrates with OpenVINO GenAI for efficient model serving. This integration enables developers to optimize and deploy AI models more efficiently across Intel hardware. You can use Optimum-Intel to streamline model deployment on Intel-based infrastructure. The collaboration aims to make AI model deployment faster and more cost-effective.

Key takeaways

Optimum-Intel integrates with OpenVINO GenAI for optimized model deployment.
Targets efficient serving on Intel hardware.
Aims to reduce deployment costs and increase speed.

HHugging Face Blog#model-deployment #intel #openvino

modelsApr 3

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Hugging Face and Intel collaborated on optimizing SetFit inference for Intel Xeon processors using Hugging Face's Optimum library. The result is a 2.2x speedup in SetFit inference performance. You can integrate this optimized solution into your applications for faster and more efficient processing.

Key takeaways

2.2x speedup in SetFit inference on Intel Xeon processors.
Optimized using Hugging Face's Optimum library.
Solution available for integration into applications.

HHugging Face Blog#setfit #optimum #intel #inference-optimization

modelsMar 28

Accelerating Stable Diffusion Inference on Intel CPUs

Intel and Hugging Face collaborated to optimize Stable Diffusion inference on Intel CPUs, achieving up to 2x faster performance. The optimization leverages Intel's AVX-512 and VNNI instructions. This work enables faster and more efficient image generation on widely available hardware, benefiting developers who deploy Stable Diffusion models in production.

Key takeaways

Up to 2x faster Stable Diffusion inference on Intel CPUs.
Optimization uses Intel's AVX-512 and VNNI instructions.
Faster inference on widely available hardware reduces deployment costs.

HHugging Face Blog#stable-diffusion #intel #optimization #inference

modelsJan 2

Accelerating PyTorch Transformers with Intel Sapphire Rapids - part 1

Intel and Hugging Face collaborated to optimize PyTorch transformer performance on Intel Sapphire Rapids CPUs. The work resulted in significant speedups for transformer inference, making it more efficient for builders to deploy AI models. This optimization enables faster and more cost-effective model serving. You can leverage these improvements in your own applications.

Key takeaways

PyTorch transformer inference sped up on Intel Sapphire Rapids.
Optimization achieved through Intel and Hugging Face collaboration.
Faster inference enables more efficient model deployment.

HHugging Face Blog#pytorch #transformers #optimization #intel

toolsNov 19

Accelerating PyTorch distributed fine-tuning with Intel technologies

Intel and Hugging Face collaborated to optimize PyTorch distributed fine-tuning on Intel hardware. The work improves training speed and efficiency for large models. You can now fine-tune models faster and more efficiently on Intel-based infrastructure. This acceleration enables builders to explore more model configurations and experiment with larger models.

Key takeaways

PyTorch distributed fine-tuning optimized for Intel hardware.
Faster training speeds and improved efficiency for large models.
Enables exploration of more model configurations and larger models.

HHugging Face Blog#pytorch #distributed-training #fine-tuning #intel