1sec.ai

Tag

#serverless

Every item tagged serverless, newest first.

2 items

otherApr 2

Bringing serverless GPU inference to Hugging Face users

Cloudflare partners with Hugging Face to enable serverless GPU inference on the Hugging Face platform. This integration allows users to deploy and run AI models at the edge, reducing latency and costs. Builders can now access GPU-accelerated inference without managing infrastructure. The partnership aims to make AI model deployment more accessible and efficient.

Key takeaways
  • Serverless GPU inference now available on Hugging Face via Cloudflare.
  • Deploy AI models at the edge to reduce latency and costs.
  • No infrastructure management required for users.
tutorialsMar 18

My Journey to a serverless transformers pipeline on Google Cloud

A developer detailed building a serverless transformers pipeline on Google Cloud using Hugging Face Transformers and Cloud Run. The pipeline leverages pre-trained models and handles inference workloads efficiently. You can deploy similar pipelines using these steps. This approach allows for scalable and cost-effective model serving.

Key takeaways
  • Uses Hugging Face Transformers and Cloud Run for deployment.
  • Pipeline handles inference workloads efficiently.
  • Deployment process is scalable and cost-effective.