Best FREE Speech to Text AI – Whisper AI

kerrystratvertJanuary 19, 2023November 1, 2023Whisper AI

WHISPER AI

In this step-by-step tutorial, learn how to use OpenAI’s Whisper AI to transcribe and convert speech or audio into text. Whisper AI performs extremely well and better than most human transcribers. It also outperforms most other speech to text tools in most environments.

INSTALL GOOGLE COLABORATORY

Visit Google Drive and setup your Google account if you don’t already have one setup.
In the top left hand corner, click the New button-> More->Connect more apps.
In the search field at the top of the dialog, type in Google Colaboratory and search.
Select the first option “Colaboratory”
Click the Install button, then Click Continue and hit OK to the button that Google Colaboratory is connected to Google Drive.
Colaboratory has been installed.
Click the Done button and close out the “Connect more apps” window.
You have now installed Google Colaboratory.

CONFIGURE GOOGLE COLABORATORY

Visit Google Drive and setup your Google account if you don’t already have one setup.
In the top left hand corner, click the New button-> More->Colaboratory.
This opens Colaboratory.
In the top left hand corner, give the file a name by selecting Untitled.ipynb and renaming it to something more useful.
Click the “Runtime” menu and select “Change runtime type” to open the “Notebook settings” dialog
Set the “Hardware accelerator” to “GPU”. This will set it to use the graphics card where Whisper AI runs best.
You have now configured Google Colaboratory.

INSTALL WHISPER AI ON GOOGLE COLABORATORY

After following the previous steps in Google Colaboratory, open Colaboratory.
Paste in the following code into the Colaboratory editor to install whisper and ffmpeg(support for audio and video files) to Colaboratory:
!pip install git+https://github.com/openai/whisper.git
!sudo apt update && sudo apt install ffmpeg
Select Run icon to run the code to install Whisper and ffmpeg. It should take ~20 seconds.

RUN WHISPER AI

After following the previous steps in Google Colaboratory, open Colaboratory.
Click the Folder icon on the left hand navigation menu
Drag and drop in the audio or video you want to transcribe.
Click “OK” to the “Reminder, uploaded files will get deleted when this runtime is recycled.” dialog box.
The file has been uploaded and you should see it under the Folder menu in the left navigation menu.
Click to the code menu and paste in the following code to run Whisper on the file :
!whisper "ENTER FILE NAME HERE" --model medium.en
- Replace “ENTER FILE NAME HERE” with the name of the file you want to transcribe.
- Replace medium.en with the model you would like to use- tiny, base, small, medium or large where tiny is the fastest, smallest and with the least accuracy and large takes longer, is a larger file and with highest quality model.
Click the Run icon to run the code.
You can see the transcript. You can also see 3 files added to the Folder- FILE.mp3.srt, FILE.mp3.txt and FILE.mp3.vtt files
- FILE.mp3.txt contains all the text from the audio
- FILE.mp3.vtt and FILE.mp3.srt are caption formats with timestamps
To download the files, hover over the FILE.mp3.*, select the ellipsis menu and select Download.

VIDEO STEPS

0:00 Introduction
0:34 Whisper AI background
1:20 Install Google Colaboratory
2:10 Configure Google Colaboratory
3:09 Install Whisper AI
3:54 Upload audio or video
4:22 Run Whisper AI
6:06 Review results
6:31 Transcribe another file
6:42 Additional parameters
7:35 Wrap up

RESOURCES

💥SPECIAL OFFER Get 99% accurate transcripts, captions and subtitles with Rev — the #1 speech-to-text service in the world. https://rev.pxf.io/DVGe7G (Disclosure: Signing up through this link gives me a small commission to support videos on this channel. The price to you is the same.)
Whisper GitHub page
Google Drive

29 thoughts on “Best FREE Speech to Text AI – Whisper AI”

john2515 says:

January 21, 2023 at 5:55 pm

Hi Brian,

I have followed these instructions to the letter and cannot get a result. See the two attachments. Any ideas?

Cheers John

John Devenish 0434 389 068

LikeLike

Reply
Juan Elevancini says:

February 3, 2023 at 9:07 am

Thank you Kevin for the excellent article!

LikeLiked by 1 person

Reply
1. kerrystratvert says:
  
  November 1, 2023 at 9:55 am
  
  Glad it was helpful!
  
  LikeLike
  
  Reply
Fabio Vieira Santos says:

February 8, 2023 at 3:05 am

Great explanation Kevin. I wonder how to apply it for a Portuguese transcription. Could you give me some hints on that? Thank you very much!!!

LikeLike

Reply
1. Jairo Iván González Linares says:
  
  November 24, 2023 at 1:55 pm
  
  Hi Fabio,
  The following code worked for me, in my case, in Spanish.
  
  !whisper “audio1561784797.m4a” –model medium –language {“es”}
  
  –language {af,am,ar,as,az,ba,be,bg,bn,bo,br,bs,ca,cs,cy,da,de,el,en,es,et,eu,fa,fi,fo,fr,gl,gu,ha,haw,he,hi,hr,ht,hu,hy,id,is,it,ja,jw,ka,kk,km,kn,ko,la,lb,ln,lo,lt,lv,mg,mi,mk,ml,mn,mr,ms,mt,my,ne,nl,nn,no,oc,pa,pl,ps,pt,ro,ru,sa,sd,si,sk,sl,sn,so,sq,sr,su,sv,sw,ta,te,tg,th,tk,tl,tr,tt,uk,ur,uz,vi,yi,yo,yue,zh,
  Afrikaans,Albanian,Amharic,Arabic,Armenian,Assamese,Azerbaijani,Bashkir,Basque,Belarusian,Bengali,Bosnian,Breton,Bulgarian,Burmese,Cantonese,Castilian,Catalan,Chinese,Croatian,Czech,Danish,Dutch,English,Estonian,Faroese,Finnish,Flemish,French,Galician,Georgian,German,Greek,Gujarati,Haitian,Haitian Creole,Hausa,Hawaiian,Hebrew,Hindi,Hungarian,Icelandic,Indonesian,Italian,Japanese,Javanese,Kannada,Kazakh,Khmer,Korean,Lao,Latin,Latvian,Letzeburgesch,Lingala,Lithuanian,Luxembourgish,Macedonian,Malagasy,Malay,Malayalam,Maltese,Mandarin,Maori,Marathi,Moldavian,Moldovan,Mongolian,Myanmar,Nepali,Norwegian,Nynorsk,Occitan,Panjabi,Pashto,Persian,Polish,Portuguese,Punjabi,Pushto,Romanian,Russian,Sanskrit,Serbian,Shona,Sindhi,Sinhala,Sinhalese,Slovak,Slovenian,Somali,Spanish,Sundanese,Swahili,Swedish,Tagalog,Tajik,Tamil,Tatar,Telugu,Thai,Tibetan,Turkish,Turkmen,Ukrainian,Urdu,Uzbek,Valencian,Vietnamese,Welsh,Yiddish,Yoruba}
  
  LikeLike
  
  Reply
sjgamalath says:

February 13, 2023 at 10:52 pm

No words to say Kevin…You are a gifted person…….

LikeLike

Reply
HB says:

March 1, 2023 at 7:44 pm

Been using it this week, and I got you to thank for it.

LikeLike

Reply
Mike Nguyen says:

March 2, 2023 at 10:08 pm

Would like to learn more

LikeLike

Reply
Mehmet YASAR says:

March 8, 2023 at 8:57 am

It’s a great job!
Thanks Kevin!

LikeLike

Reply
Andreas Stangenes says:

March 11, 2023 at 6:24 am

Thanks for this very helpful guide! However, using Colaborative to upload mp4 files (or any files of some size) takes forever. Could you help figure out how to redirect Whisper to a google directory where I can upload my files much faster?

Much love,
Andreas, Norway

LikeLike

Reply
apple_wang says:

April 26, 2023 at 12:59 am

Thanks your sharing!
I have a question.
When I upload an audio file that is around 50MB and cannot work.
Would you like to share some ideas to fix it?

By the way, the transcribing process costs over 10 minutes, I saw you just spent 58 seconds and get done! Is the computer memory problem?

LikeLike

Reply
Asad Aleem says:

May 7, 2023 at 11:39 am

Hi Kevin. Thanks a million. Please do tell about the code of translation into Spanish and for the Large size. Thanks

LikeLike

Reply
fraktal says:

May 8, 2023 at 3:18 pm

Funktioniert nicht mehr es kommt immer eine Fehlermeldung.
/bin/bash: whisper: command not found

LikeLike

Reply
Thomas Wiemann says:

May 13, 2023 at 11:31 am

When i try using the run command, i get an error: /bin/bash: whisper: command not found
Any idea why?

LikeLike

Reply
1. Kovács Ervin says:
  
  October 20, 2023 at 3:51 am
  
  You have to install the Whisper program first
  
  LikeLike
  
  Reply
Tiyana says:

May 17, 2023 at 2:04 am

Thank you so much this was very helpful. Just wondering how should be the new code if the recording is in another language like Italian or Spanish?

LikeLike

Reply
Esam says:

May 19, 2023 at 9:15 am

thanks, Keven, for the great articles. Don’t you think we need to have How to install Whisper on your pc blog or article? thank you in advance!

LikeLike

Reply
Create says:

May 25, 2023 at 12:14 am

This is a great article. After watching the video. You are a great guy

LikeLike

Reply
Dan says:

July 2, 2023 at 12:17 am

I’m not able to paste the code : /

LikeLike

Reply
Soraya says:

July 10, 2023 at 1:45 pm

Thank you so, so much! This is helping me writing my papers in so much less time.

LikeLike

Reply
Dj Bouma says:

July 20, 2023 at 3:01 am

Excellent article! and great youtube tutorial. One thing is missing. how to expand the line with the propper language. Its now based on en and would be nice to describe the parameters that can be used. Overall verry good tutorial!

LikeLike

Reply
George Pang says:

July 21, 2023 at 9:14 am

Hi Kevin, Can you please help. The code was brilliant. Ive been using it for the past few weeks and it as woking well. But recently it stop transcribing, whenever I try to use it I get this (/bin/bash: line 1: whisper: command not found).

LikeLike

Reply
Tarun Saxena says:

July 26, 2023 at 1:48 am

Thanks Kevin. How could we use this to transcribe a video with two languages in it (e.g. English and Hindi)

LikeLike

Reply
Aaron says:

August 1, 2023 at 8:28 am

I have a problem: When i transcribe a video, ffmpeg fails to load because of an outdated version, can anyone help me?

LikeLike

Reply
Pingback: Best FREE Speech to Text AI – Whisper AI – Everything AI Possible: Unlocking the Future
Xihan says:

April 5, 2024 at 5:56 am

hi

I want to transfer Dutch video, the code is

!”Cam_1_-_2024.01.21_13.27.12.mp4″ –model medium –language {“de”}

but said “whisper: error: unrecognized arguments: –language {“de”}”,

how can I solve it?

LikeLike

Reply
Xihan says:

April 5, 2024 at 5:57 am

hi

I want to transfer Dutch video, the code is

!whisper ”Cam_1_-_2024.01.21_13.27.12.mp4″ –model medium –language {“de”}

but said “whisper: error: unrecognized arguments: –language {“de”}”,

how can I solve it?

LikeLike

Reply
Akeeeeem says:

April 9, 2024 at 10:36 am

Does Whisper send data back to the OpenAI servers while/after transcribing? I’m asking because of the confidentiality of the data used.

LikeLike

Reply
1. kerrystratvert says:
  
  April 22, 2024 at 11:55 am
  
  It should all run locally. If you’re concerned, you can always disconnect from the network while running Whisper.
  
  LikeLike
  
  Reply