This video shows the full code walkthrough to develop and host GUI for OpenAI Whisper at Huggingface spaces. OpenAI Whisper model is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
GitHub Resources:
https://github.com/prodramp/DeepWorks...
Huggingface Space:
https://huggingface.co/spaces/Avkash/...
▬▬▬▬▬▬ ⏰ TUTORIAL TIME STAMPS ⏰ ▬▬▬▬▬▬
(00:00) Content Introduction
(01:15) Reusing Python Project
(03:10) Code Walkthrough
(03:35) Loading Model from UI
(04:35) Loading YouTube video
(04:45) Loading data from disk
(06:10) Reading Mp4/Mp3
(07:35) Whisper Transcribe
(10:50) Decode with Translate
(12:55) Hosting at HuggingFace Space
(14:15) Source Code at GitHub
Connect
------------------
Prodramp By Avkash (@prodramp)
Website - https://prodramp.com
LinkedIn - / prodramp
GitHub- https://github.com/prodramp/
AngelList - https://angel.co/company/prodramp
Facebook - / prodramp
Content Creator: Avkash Chauhan (@avkashchauhan)
/ avkashchauhan
/ avkashchauhan
Tags:
#openai #whisper #pytorch #tensorflow #prodramp #avkashchauhan