Skip to content

Latest commit

 

History

History
47 lines (29 loc) · 887 Bytes

README.rst

File metadata and controls

47 lines (29 loc) · 887 Bytes

reazonspeech.espnet.asr

This supplies the main interface for using ReazonSpeech ESPnet models.

More information can be found at https://research.reazon.jp/projects/ReazonSpeech

Install

$ git clone https://github.com/reazon-research/ReazonSpeech
$ pip install ReazonSpeech/pkg/espnet-asr

Usage

Python interface

from reazonspeech.espnet.asr import load_model, transcribe, audio_from_path

# Load ReazonSpeech model from Hugging Face
model = load_model()

# Read a local audio file
audio = audio_from_path("speech.wav")

# Recognize speech
ret = transcribe(model, audio)

Comnand-line interface

$ reazonspeech-espnet-asr speech.wav

Use -h to show a full help.

$ reazonspeech-espnet-asr -h