Making automatic speech recognition work on large files with Wav2Vec2 in ๐ค Transformers
The Hugging Face Transformers library now supports chunking large audio files for automatic speech recognition (ASR) with Wav2Vec2. This allows processing files over 1 minute long by splitting them into smaller chunks. You can integrate this feature into your ASR workflows to handle longer audio inputs.
- Wav2Vec2 in Transformers supports chunking for large audio files.
- Chunking enables processing audio over 1 minute long.
- Improves ASR usability for longer audio inputs.