1 repo
Tools that divide long-form audio into smaller, fixed-length segments to improve processing memory efficiency.
Explore 1 awesome GitHub repository matching graphics & multimedia · Audio Segmentation Utilities. Refine with filters or upvote what's useful.
This project is a speech recognition and translation engine that utilizes a sequence-to-sequence transformer architecture to convert audio into text. It is built upon a weakly supervised learning framework, which leverages large-scale, unlabelled audio-transcript data to create generalized speech representations capabl