STT Inference Server

This project implements a Speech-to-Text (STT) Inference Server using FastAPI. It supports multiple backends, including Faster Whisper and OpenAI Whisper, and provides both batch and real-time transcription capabilities.

Features

Support for multiple STT backends (Faster Whisper, OpenAI Whisper)
Batch audio file transcription
Real-time audio stream transcription via WebSocket
Configurable model parameters (device, quantization)
Server metadata and statistics endpoints

Prerequisites

Python 3.8+
FastAPI
uvicorn
faster-whisper
openai-whisper
PyAudio (for client-side audio capture)

Installation

Clone the repository:

git clone https://github.com/your-username/stt-inference-server.git
cd stt-inference-server

Create a virtual environment and activate it:

python -m venv venv
source venv/bin/activate  # On Windows, use `venv\Scripts\activate`

Install the required packages:
```
pip install -r requirements.txt
```

Usage

Start the server:

python main.py --backend faster_whisper --model_id base --device cpu --dtype int8

The server will start on http://localhost:8000 by default.
Use the provided API endpoints to transcribe audio files or connect via WebSocket for real-time transcription.

API Documentation

Refer to the API Documentation for detailed information on available endpoints and their usage.

TODO

[ ] Add backend for QwenAudio models
[ ] Optimize the server for batch inference
[ ] Alternative implementation for the live_transcription

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
docs		docs
stt-inference-server		stt-inference-server
.gitignore		.gitignore
Dockerfile		Dockerfile
Readme.md		Readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

STT Inference Server

Features

Prerequisites

Installation

Usage

API Documentation

TODO

Contributing

License

About

Releases

Packages

Languages

akhilreddy0703/STT-Inference-server

Folders and files

Latest commit

History

Repository files navigation

STT Inference Server

Features

Prerequisites

Installation

Usage

API Documentation

TODO

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages