Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

KoljaB / RealtimeSTT Public

Notifications You must be signed in to change notification settings
Fork 200
Star 2.1k

Code
Issues 55
Pull requests 5
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Releases: KoljaB/RealtimeSTT

Releases · KoljaB/RealtimeSTT

v0.3.81

25 Nov 20:57

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.81 Latest

Latest

RealtimeSTT 0.3.81

Enhanced CLI Interface

Introduced the -sed command for improved speech end detection
Added the -l command to set the language
Implemented the -L command to quickly display a list of all available audio input devices
Enabled setting the input device index .
Improved piping support for seamless with > or |

Assets 2

Loading

All reactions

v0.3.7

03 Nov 11:54

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.7

RealtimeSTT 0.3.7

fixed a bug to make client terminate gracefully (logged websocket error in debug mode before)
reworked the CLI interfaces and added shorter commands (for example --writechunks is now -W or --write, for more information please look into the Client Server Readme)

Assets 2

Loading

alafsasa reacted with hooray emoji

t20n14 reacted with heart emoji

All reactions

🎉 1 reaction
❤️ 1 reaction

2 people reacted

v0.3.6

02 Nov 15:19

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.6

RealtimeSTT 0.3.6

more logging for client/server:
Additional parameters for server:
- --use_extended_logging, writes extensive log messages for the recording worker, that processes the audio chunks
- --debug, enables debug logging for detailed server operations
- --logchunks, enables logging of incoming audio chunks (periods)
- --writechunks, saves received audio chunks to a WAV file
  Additional parameters for client:
- --debug, enables debug logging for detailed client operations
- --writechunks, saves recorded audio chunks to a WAV file
more logging for AudioToTextRecorder when called with use_extended_logging = True
new init_realtime_after_seconds parameter for AudioToTextRecorder to finetune the default of 0.2s

Assets 2

Loading

homelab-00, Bastigonzales, and t20n14 reacted with heart emoji

All reactions

❤️ 3 reactions

3 people reacted

v0.3.5

29 Oct 12:17

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.5

RealtimeSTT 0.3.5

some upgrades and bugfixes for cli and server (linux support)

Assets 2

Loading

All reactions

v0.3.4

27 Oct 21:45

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.4

RealtimeSTT 0.3.4

some upgrades and bugfixes for server
v0.3.2 yanked

Assets 2

Loading

All reactions

v0.3.2

27 Oct 13:32

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.2

RealtimeSTT 0.3.2

New Features:

server/stt_server.py and AudioToTextRecorderClient class now support wake words (all parameters and callbacks of AudioToTextRecorder should now have been already implemented into AudioToTextRecorderClient class, please write an issue if you miss a functionality)
update microphone reconnect

Assets 2

Loading

All reactions

v0.3.1

21 Oct 13:41

KoljaB

Compare

Choose a tag to compare

Loading

v0.3.1

RealtimeSTT 0.3.1

New Features:

AudioToTextRecorderClient class: automatically starts a server if none is running and connects to it. The class shares the same interface as AudioToTextRecorder, making it easy to upgrade or switch between the two. (Work in progress, most parameters and callbacks of AudioToTextRecorder are already implemented into AudioToTextRecorderClient, but not all. Also the server can not handle concurrent (parallel) requests yet.)
New reworked CLI interface: "stt-server" to start the server, "stt" to start the client, look at "server" folder for more info
fixed #127
integrated PR #131

Assets 2

Loading

homelab-00, ngphuchoangnam, Bastigonzales, and t20n14 reacted with heart emoji

All reactions

❤️ 4 reactions

4 people reacted

v0.3.0

03 Oct 00:00

KoljaB

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.3.0

RealtimeSTT 0.3.0

New Features:

Soundcard Compatibility: Automatically adjusts from 48kHz downwards if 16kHz is unsupported, resampling to 16kHz.
Early Transcription: Added early_transcription_on_silence parameter to enable transcription during speech pauses, reducing overall latency.
Transcription Process Optimizations: Transcription process outsourced into separate class and optimized pipe communication for more stability and faster pipe communication, leading to fewer occurrances of audio chunks getting discarded due to queue size overflows.
Immediate Listen State: Fixed issue soi the system immediately returns to the listening state right after stopping the recording, preventing lost chunks.
Improved Logging: Always logs debug messages to a file, even if not explicitly configured. Option to disable logging with no_log_file parameter.
Transcription Time Display: New print_transcription_time parameter to show model processing time.

Bugfixes:

Chunk Handling: Enhanced chunk handling with the new allowed_latency_limit parameter, reducing dropped data during high-latency scenarios.

Assets 2

Loading

All reactions

v0.2.42

26 Sep 14:09

KoljaB

Compare

Choose a tag to compare

Loading

v0.2.42

clean_audio_buffer method added
preparations for functionality for automatically downsampling on soundcards that don't allow 16kHz recording

Assets 2

Loading

Bastigonzales and Chonlasitsk reacted with heart emoji

All reactions

❤️ 2 reactions

2 people reacted

v0.2.41

18 Aug 08:44

KoljaB

Compare

Choose a tag to compare

Loading

v0.2.41

fixed a typo that made v0.2.4 unable to use

Assets 2

Loading

All reactions

Previous 1 2 3 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.