Best FREE Speech to Text AI – Whisper AI

WHISPER AI

In this step-by-step tutorial, learn how to use OpenAI’s Whisper AI to transcribe and convert speech or audio into text. Whisper AI performs extremely well and better than most human transcribers. It also outperforms most other speech to text tools in most environments.

  1. WHISPER AI
  2. INSTALL GOOGLE COLABORATORY
  3. CONFIGURE GOOGLE COLABORATORY
  4. INSTALL WHISPER AI ON GOOGLE COLABORATORY
  5. RUN WHISPER AI
  6. VIDEO STEPS
  7. RESOURCES

INSTALL GOOGLE COLABORATORY

  1. Visit Google Drive and setup your Google account if you don’t already have one setup.
  2. In the top left hand corner, click the New button-> More->Connect more apps.
  3. In the search field at the top of the dialog, type in Google Colaboratory and search.
  4. Select the first option “Colaboratory”
  5. Click the Install button, then Click Continue and hit OK to the button that Google Colaboratory is connected to Google Drive.
  6. Colaboratory has been installed.
  7. Click the Done button and close out the “Connect more apps” window.
  8. You have now installed Google Colaboratory.

CONFIGURE GOOGLE COLABORATORY

  1. Visit Google Drive and setup your Google account if you don’t already have one setup.
  2. In the top left hand corner, click the New button-> More->Colaboratory.
  3. This opens Colaboratory.
  4. In the top left hand corner, give the file a name by selecting Untitled.ipynb and renaming it to something more useful.
  5. Click the “Runtime” menu and select “Change runtime type” to open the “Notebook settings” dialog
  6. Set the “Hardware accelerator” to “GPU”. This will set it to use the graphics card where Whisper AI runs best.
  7. You have now configured Google Colaboratory.

INSTALL WHISPER AI ON GOOGLE COLABORATORY

  1. After following the previous steps in Google Colaboratory, open Colaboratory.
  2. Paste in the following code into the Colaboratory editor to install whisper and ffmpeg(support for audio and video files) to Colaboratory:
    !pip install git+https://github.com/openai/whisper.git
    !sudo apt update && sudo apt install ffmpeg
  3. Select Run icon to run the code to install Whisper and ffmpeg. It should take ~20 seconds.

RUN WHISPER AI

  1. After following the previous steps in Google Colaboratory, open Colaboratory.
  2. Click the Folder icon on the left hand navigation menu
  3. Drag and drop in the audio or video you want to transcribe.
  4. Click “OK” to the “Reminder, uploaded files will get deleted when this runtime is recycled.” dialog box.
  5. The file has been uploaded and you should see it under the Folder menu in the left navigation menu.
  6. Click to the code menu and paste in the following code to run Whisper on the file :
    !whisper "ENTER FILE NAME HERE" --model medium.en
    • Replace “ENTER FILE NAME HERE” with the name of the file you want to transcribe.
    • Replace medium.en with the model you would like to use- tiny, base, small, medium or large where tiny is the fastest, smallest and with the least accuracy and large takes longer, is a larger file and with highest quality model.
  7. Click the Run icon to run the code.
  8. You can see the transcript. You can also see 3 files added to the Folder- FILE.mp3.srt, FILE.mp3.txt and FILE.mp3.vtt files
    • FILE.mp3.txt contains all the text from the audio
    • FILE.mp3.vtt and FILE.mp3.srt are caption formats with timestamps
  9. To download the files, hover over the FILE.mp3.*, select the ellipsis menu and select Download.

VIDEO STEPS

  • 0:00 Introduction
  • 0:34 Whisper AI background
  • 1:20 Install Google Colaboratory
  • 2:10 Configure Google Colaboratory
  • 3:09 Install Whisper AI
  • 3:54 Upload audio or video
  • 4:22 Run Whisper AI
  • 6:06 Review results
  • 6:31 Transcribe another file
  • 6:42 Additional parameters
  • 7:35 Wrap up

RESOURCES

  • đź’ĄSPECIAL OFFER Get 99% accurate transcripts, captions and subtitles with Rev — the #1 speech-to-text service in the world. https://rev.pxf.io/DVGe7G (Disclosure: Signing up through this link gives me a small commission to support videos on this channel. The price to you is the same.)
  • Whisper GitHub page
  • Google Drive

One thought on “Best FREE Speech to Text AI – Whisper AI

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s