All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- Unreleased
- 0.5.6-alpha - 2020-03-20
- 0.5.5-alpha - 2020-03-04
- 0.5.4-alpha - 2020-01-31
- 0.5.3-alpha - 2019-12-30
- 0.5.2-alpha - 2019-11-05
- 0.5.1-alpha - 2019-08-02
- 0.5.0-alpha - 2019-07-27
- 0.4.1-alpha - 2019-07-11
- 0.4.0-alpha - 2019-02-17
Click up arrow to go back to TOC.
- Add support for Xun Fei Yun Speech-to-Text WebSocket API.
- Add support for Baidu Automatic Speech Recognition API. issue #68
- Add chars filter for the transcript result in XfyunWebSocketAPI.
- Change the replacement condition of the audio_split_cmd only when the user doesn't modify it.
- Change the MAX_REGION_SIZE_LIMIT into 60 seconds.
- Fix the size count bug when the last line been split in list_to_googletrans.
- Remove Python 2.7 support.
0.5.6-alpha - 2020-03-20
- Add extra environment variables check when finding dependencies. issue #91
- Add arguments parser input when open it without arguments. issue #92
- Add subtitles processing when not input
-SRC
.
- Change option
-sml
into-nsml
. - Change defaults for Auditok.
- Fix Google Speech-to-Text API empty result response bug. issue #89
- Deprecate Python 2.7 support.
0.5.5-alpha - 2020-03-04
- Add Google Cloud Speech-to-Text API Recognition Config json file input support.
- Add Speech-to-Text API recognition result json file output support. issue #76
- Add exception when receiving error from Google-Speech-v2.
- Add Nuitka compatibility codes to stop program compiled by Nuitka when using Google Cloud service account credentials. Nuitka pkg_resources.DistributionNotFound error
- Fix high memory/RAM usage partially by terminating child processes, lowering default multiprocessing counts and using
gc.collect(0)
. issue #67, issue #74 - Fix dependency finding issue. issue #82
- Fix ass override tags translation issue. issue #79
0.5.4-alpha - 2020-01-31
- Add basic Google Cloud Speech-to-Text support. issue #10
- Add more bilingual subtitles formats output support. issue #72
- Remove gtransv2 support.
- Fix output format limits when input is a subtitles file.
0.5.3-alpha - 2019-12-30
- Fix excessive transcoding time issue. pull request #66
- Fix Auditok option issues. issue #70
- Fix output option issue. issue #73
0.5.2-alpha - 2019-11-05
- Add issue templates.
- Fix last row of empty translation text missing issue. issue #62
- Fix executable file detection problem in the current directory.
0.5.1-alpha - 2019-08-02
- Add translation source lang code auto match.
- Fix method list_to_googletrans index error bug. issue #48
- Fix unix subprocess.check_output compatibility. issue #47
- Fix googletrans full-wide chars length too long issue. issue #49
0.5.0-alpha - 2019-07-27
- Add arguments for min and max region size. issue #3
- Add metadata.py. issue #5
- Add output file name detection to avoid any file overwritting.
- Add new dev branch for latest dev codes to push.
- Add more output format(ass, ssa, sub, mpl2, tmp). issue #20
- Add arguments for auditok.StreamTokenizer and energy_threshold. issue #30
- Add overwrite option
-y
for output overwrite and no input pause. issue #29 - Add specific .ass style when output format is .ass. issue #21
- Add timings generating function instead of using speech-to-text api. issue #14
- Add arguments for confidence control. issue #6
- Add arguments for dropping empty lines from speech-to-text results.
- Add free api to use by importing the googletrans. issue #25
- Add bilingual subtitle output. issue #16
- Add multi-types subtitles files output at the same time (regions/source language/destination language/bilingual subtitles) when using
--output-files
option. - Add exception to stop the workflow in main(). issue #35
- Add bilingual subtitle styles input. issue #32
- Add subtitles translate. issue #38
- Add function to auto-replace
’
to'
in the translation result. - Add py-googletrans control args. issue #36
- Add lang codes support.(Depend on langcodes package) issue #34
- Add complex ass json output. issue #39
- Add audio preprocessing. issue #7
- Add options to control every ffmpeg command. issue #43
- Add temp file save function. issue #22
- Add only audio fragments output. issue #44
- Add subtitles(first line) language detection powered by googletrans. issue #40
- Add http, https proxy support.(Set environment variables) issue #17
- Add i18n support. issue #9
- Add i18n language choice. issue #45
- issue #5.
- Rewrite help messages.
- Refactor argparse.
- Refactor constaints.
- Change dev branch into origin branch.
- Use alpha branch for alpha releases.
- Change docs.
- Change audio conversion workflow to get a better audio quality to process. Currently will create two files from the original source file separately. 48kHz/16bit/mono .wav for local speech regions finding. 44.1kHz/24bit/mono .flac for google speech v2 api upload or in other words, speech recognition. Need to point out that Google-Speech-v2 is wrong on the supported .flac audio channel number. According to my test the api doesn't support the 2-channel .flac file. agermanidis/autosub issue #155
- Refactor internal regions unit to millisecond. issue #23
- Refactor speech regions detection by using auditok. issue #27
- Refactor generate_subtitles into 3 parts. issue #24
- Refactor api_gen_text to 2 parts. One is speech_to_text. Another is text_translation. issue #33
- Refactor txt output. Now txt can output regions.
- Refactor list_to_sub_str. issue #37
- Fix vtt output replacing all commas to dots issues.
- issue #8
- Fix python3 compatibility issues.
- Fix Nuitka build after updating Nuitka to 0.6.4(Environment Anaconda2 python3.5).
0.4.1-alpha - 2019-07-11
- Add https speech-to-text api url and url choice argument. agermanidis/autosub pull request #135
- Add external speech-to-text regions control from external subtitle files. agermanidis/autosub pull request #159
- Add scripts to build, release and etc.
- Change docs.
- Fix vague language codes caused wrong recognition result. agermanidis/autosub pull request #136
0.4.0-alpha - 2019-02-17
- Fix several issues. agermanidis/autosub pull request #128 by @iWangJiaxiang
- Fix "ffmpeg.exe" causes "Dependency not found: ffmpeg" on Windows.
- Fix "ValueError" when the response data of "SpeechRecognizer" couldn't be parsed to JSON Object.
- Fix Temp Folder Permissions Denied on Windows 10. agermanidis/autosub issue #15
- Fix JSONDecodeError caused crash. agermanidis/autosub pull request #131 by @raryelcostasouza