EasyWhisper

EasyWhisper is a simple Python package that allows you to convert speech to text using OpenAI's model Whisper locally.
It is a wrapper around the openai-whisper package.

Installation

Use

pip install easy-whisper-local

This is enough to install the package and its dependencies.
Besides, you can also install torch with CUDA support to speed up the process using your GPU.
The model will be downloaded automatically when you run the package for the first time, and it will be saved in the subdirectory models/.

Usage

from easy_whisper_local.base import EasyWhisper
ew = EasyWhisper()
text = ew.file_to_text('audio.mp3')
times = ew.file_to_timestamps('audio.mp3')
dictionary = ew.file_to_dictionary('audio.mp3')

The default model is 'medium', but if you want to use other model, you can do so by passing it to the EasyWhisper constructor, for example:

ew = EasyWhisper(model="large")

You can also turn off the verbose mode to the constructor by passing False to its parameter:

ew = EasyWhisper(verbose=False)

Streaming (still in development)

You can also use the package to convert speech to text in real time.

from easy_whisper_local.streaming import EasyWhisperStreaming
ews = EasyWhisperStreaming()
for text in ews.stream_to_text(audio_listener):
    print(text)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

EasyWhisper

Installation

Usage

Streaming (still in development)

Files

README.md

Latest commit

History

README.md

File metadata and controls

EasyWhisper

Installation

Usage

Streaming (still in development)