Skip to content

Latest commit

 

History

History
37 lines (35 loc) · 1.34 KB

README.md

File metadata and controls

37 lines (35 loc) · 1.34 KB

EasyWhisper

EasyWhisper is a simple Python package that allows you to convert speech to text using OpenAI's model Whisper locally.
It is a wrapper around the openai-whisper package.

Installation

Use

pip install easy-whisper-local

This is enough to install the package and its dependencies.
Besides, you can also install torch with CUDA support to speed up the process using your GPU.
The model will be downloaded automatically when you run the package for the first time, and it will be saved in the subdirectory models/.

Usage

from easy_whisper_local.base import EasyWhisper
ew = EasyWhisper()
text = ew.file_to_text('audio.mp3')
times = ew.file_to_timestamps('audio.mp3')
dictionary = ew.file_to_dictionary('audio.mp3')

The default model is 'medium', but if you want to use other model, you can do so by passing it to the EasyWhisper constructor, for example:

ew = EasyWhisper(model="large")

You can also turn off the verbose mode to the constructor by passing False to its parameter:

ew = EasyWhisper(verbose=False)

Streaming (still in development)

You can also use the package to convert speech to text in real time.

from easy_whisper_local.streaming import EasyWhisperStreaming
ews = EasyWhisperStreaming()
for text in ews.stream_to_text(audio_listener):
    print(text)