Jump to content
JWTalk - Jehovah's Witnesses Online Community

Transcribing audio using Whisper AI


Recommended Posts

Kevin Stratvert describes how to use whisper AI to run on Google Colaboratory (a free service) to transcribe audio files to text.

 

The Results:

Transcribing a 40 min talk: "2011-MemorialTalk-HarfmannMartin.mp3" took about 15 mins.

The results were excellent. There were very few errors in transcription. I'm not sure if its because the audio quality was good and the speaker also spoke clearly.

 

Best FREE Speech to Text AI - Whisper AI

 

 

Instructions for transcribing are summarised here:
https://kevinstratvert.com/2023/01/19/best-free-speech-to-text-ai-whisper-ai/


 

Link to comment
Share on other sites

ThioJoe describes how he created a tool called "Auto Synced and Translated Dubs", to create YouTube videos that will let you switch the audio track to one of several languages, and hear a dubbed spoken version, instead of just having to read translated subtitles like you'd usually do.

 

 

Link to comment
Share on other sites

21 hours ago, luisenriquereyes said:

ThioJoe describes how he created a tool called "Auto Synced and Translated Dubs", to create YouTube videos that will let you switch the audio track to one of several languages, and hear a dubbed spoken version, instead of just having to read translated subtitles like you'd usually do.

 

 

 

At 12:37, ThioJoe talks about how impressed he is with whisper AI:

 

Now, if you're wondering about my whole workflow, like if you're looking to do this yourself, to transcribe the videos, I actually use OpenAI's "Whisper" model, which is free and you can run it locally on your own GPU if you have a powerful enough one. And I have found this model to be way more accurate than anything else I've seen,even Google's transcription API.
 
The Whisper thing even does punctuation pretty decently. I mean the first time I tried it, I put in a 4,500 word video that was 22 minutes long and it missed three words. I was astounded.
 
It's not always that good, but it's a really good starting point from which you can just play the video and then correct any minor changes you see as you go through it.

 

Link to comment
Share on other sites

I have been using WhisperAI for months.  I've converted most English talks and even some Tagalog ones from jwstream to text.  It is incredible.

Injustice is when a little girl in China is deprived of her childhood because she needs to glue the soles of your son’s latest Nike.

Link to comment
Share on other sites

6 hours ago, rockieboi said:

I have been using WhisperAI for months.  I've converted most English talks and even some Tagalog ones from jwstream to text.  It is incredible.

 

Do you use the google colaboratory method or do you run whisper AI on your windows desktop?

Link to comment
Share on other sites

I have been using Google collab so much that it came to a point that I can't use their free GPUs anymore.  I was using it to transcribe 10 meetings per day.  I have almost 1000 recordings from JW stream to transcribe and I have done around 200 already, and now I am only doing around 10-15 per week so that it won't limit my usage anymore.

Injustice is when a little girl in China is deprived of her childhood because she needs to glue the soles of your son’s latest Nike.

Link to comment
Share on other sites

I'm now running whisper locally using my GTX 1650 Ti 4GB graphics card.

I find the tiny.en model good enough for english transcriptions and its very fast.

The largest model I could run was the small.en which took 3.6GB of GPU memory.

 

The Details:

Spoiler

Install Python

Installed Python 3.10.9

 

Install CUDA

CUDA 12.0 - cuda_11.7.0_516.01_windows.exe

 

To install PyTorch:
pip3 install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu117
pip show torch (to enquire version installed)
torch 1.13.1+cu117, torchvision 0.14.1+cu117, torchaudio 0.13.1+cu117

 

Install ffmpeg

ffmpeg 5.1.2

 

Install GIT

winget install --id Git.Git -e --source winget
Installed Git for Windows 2.39.2

 

TroubleChute's Youtube Link of how to do it:
https://youtu.be/XX-ET_-onYU 

The FREE & OFFLINE Audio to Text | Whisper: Install Guide | OpenAI Whisper | ASR
(by TroubleChute)

 

 

Installation Instructions in text can be found here:
https://hub.tcno.co/ai/whisper/install/

Link to comment
Share on other sites

I have not used the tiny model yet, but I used the small, base, medium, and large.  I find that medium is the most accurate for almost all of my meeting audios including the non english ones.  I tried the small and base models to make it faster, but there are a lot that I need to correct.

Thanks for the instructions by the way.  I might try it if I can run it overnight with my computer.😁

Injustice is when a little girl in China is deprived of her childhood because she needs to glue the soles of your son’s latest Nike.

Link to comment
Share on other sites

  • 3 months later...

whisper-desktop gives a simple interface to transcribe audio to text.

I've had good results with the medium model. The tiny model doesn't work.

 

The 2023 RC Fri AM 1 720P file with audio of 1:51:16 took 34 mins 48 sec using the medium model.

 

My PC has an AMD Ryzen 5-5600G CPU with integrated Radeon graphics. 

 

image.png.8d9bf9dfcab710e63201292a1eb1d0ec.png

 

 

Example of the debug console while transcribing.

 

image.thumb.png.6e886660346c20625d4485123bc80bea.png

 

 

Task manager showing how the Radeon graphics GPU is heavily loaded while the CPU is idling when transcribing:

 

image.thumb.png.6a16a44f0c5a84faceaf952fb86d650c.png

 

 

 

Link to comment
Share on other sites

  • 3 weeks later...

Subtitleedit now integrates speech to text transcribing.

It allows using either the whisper or vosk/kaldi transcribing methods.

 

This is the link to the subtitleedit help file on audio to text:
https://www.nikse.dk/subtitleedit/help#audio_to_text

 

The following Const-me option gives good results:
A GPU optimized version called Whisper Const-me (written in C++)

 

Sub1.jpg.97ca08820e9c9bffabbe4678871b18ec.jpg

 

 

 

sub2.thumb.jpg.8d15accf404fecacf7934462de1c1e9e.jpg

Link to comment
Share on other sites

  • 3 months later...

This makeuseof.com article talks about whisper desktop and also give instructions on how to install and use it.

 

How to Turn Your Voice to Text in Real Time With Whisper Desktop
BY ODYSSEAS KOURAFALOS
PUBLISHED MAY 19, 2023

 

Whisper Desktop is a third-party GUI for Whisper, made for everyone who'd prefer to click buttons instead of typing commands.

 

Whisper Desktop is a standalone solution that doesn't rely on an existing Whisper installation. As a bonus, it uses an alternative, optimized version of Whisper, so it should perform better than the standalone version.

 

 

 

 

Link to comment
Share on other sites

Join the conversation with your brothers and sisters!


You can post now, and then we will take you to the membership application. If you are already a member, sign in now to post with your existing account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

  • Recently Browsing   0 members

    • No registered users viewing this page.

About JWTalk.net - Jehovah's Witnesses Online Community

Since 2006, JWTalk has proved to be a well-moderated online community for real Jehovah's Witnesses on the web. However, our community is not an official website of Jehovah's Witnesses. It is not endorsed, sponsored, or maintained by any legal entity used by Jehovah's Witnesses. We are a pro-JW community maintained by brothers and sisters around the world. We expect all community members to be active publishers in their congregations, therefore, please do not apply for membership if you are not currently one of Jehovah's Witnesses.

×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.

JWTalk 23.8.11 (changelog)