Changelog

简体中文

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

Unreleased
- Added
- Changed
- Fixed
0.5.6-alpha - 2020-03-20
- Added
- Changed
- Deprecated
- Fixed
0.5.5-alpha - 2020-03-04
- Added
- Fixed
0.5.4-alpha - 2020-01-31
- Added
- Removed
- Fixed
0.5.3-alpha - 2019-12-30
- Fixed
0.5.2-alpha - 2019-11-05
- Added
- Fixed
0.5.1-alpha - 2019-08-02
- Added
- Fixed
0.5.0-alpha - 2019-07-27
- Added
- Changed
- Fixed
0.4.1-alpha - 2019-07-11
- Added
- Changed
- Fixed
0.4.0-alpha - 2019-02-17
- Fixed

Click up arrow to go back to TOC.

Unreleased

Added(Unreleased)

Add support for Xun Fei Yun Speech-to-Text WebSocket API.
Add support for Baidu Automatic Speech Recognition API. issue #68
Add chars filter for the transcript result in XfyunWebSocketAPI.

Changed(Unreleased)

Change the replacement condition of the audio_split_cmd only when the user doesn't modify it.
Change the MAX_REGION_SIZE_LIMIT into 60 seconds.

Fixed(Unreleased)

Fix the size count bug when the last line been split in list_to_googletrans.

Removed(Unreleased)

Remove Python 2.7 support.

0.5.6-alpha - 2020-03-20

Added(0.5.6-alpha)

Add extra environment variables check when finding dependencies. issue #91
Add arguments parser input when open it without arguments. issue #92
Add subtitles processing when not input -SRC.

Changed(0.5.6-alpha)

Change option -sml into -nsml.
Change defaults for Auditok.

Fixed(0.5.6-alpha)

Fix Google Speech-to-Text API empty result response bug. issue #89

Deprecated(0.5.6-alpha)

Deprecate Python 2.7 support.

0.5.5-alpha - 2020-03-04

Added(0.5.5-alpha)

Add Google Cloud Speech-to-Text API Recognition Config json file input support.
Add Speech-to-Text API recognition result json file output support. issue #76
Add exception when receiving error from Google-Speech-v2.
Add Nuitka compatibility codes to stop program compiled by Nuitka when using Google Cloud service account credentials. Nuitka pkg_resources.DistributionNotFound error

Fixed(0.5.5-alpha)

Fix high memory/RAM usage partially by terminating child processes, lowering default multiprocessing counts and using gc.collect(0). issue #67, issue #74
Fix dependency finding issue. issue #82
Fix ass override tags translation issue. issue #79

↑

0.5.4-alpha - 2020-01-31

Added(0.5.4-alpha)

Add basic Google Cloud Speech-to-Text support. issue #10
Add more bilingual subtitles formats output support. issue #72

Removed(0.5.4-alpha)

Remove gtransv2 support.

Fixed(0.5.4-alpha)

Fix output format limits when input is a subtitles file.

↑

0.5.3-alpha - 2019-12-30

Fixed(0.5.3-alpha)

Fix excessive transcoding time issue. pull request #66
Fix Auditok option issues. issue #70
Fix output option issue. issue #73

↑

0.5.2-alpha - 2019-11-05

Added(0.5.2-alpha)

Add issue templates.

Fixed(0.5.2-alpha)

Fix last row of empty translation text missing issue. issue #62
Fix executable file detection problem in the current directory.

↑

0.5.1-alpha - 2019-08-02

Added(0.5.1-alpha)

Add translation source lang code auto match.

Fixed(0.5.1-alpha)

Fix method list_to_googletrans index error bug. issue #48
Fix unix subprocess.check_output compatibility. issue #47
Fix googletrans full-wide chars length too long issue. issue #49

↑

0.5.0-alpha - 2019-07-27

Added(0.5.0-alpha)

Add arguments for min and max region size. issue #3
Add metadata.py. issue #5
Add output file name detection to avoid any file overwritting.
Add new dev branch for latest dev codes to push.
Add more output format(ass, ssa, sub, mpl2, tmp). issue #20
Add arguments for auditok.StreamTokenizer and energy_threshold. issue #30
Add overwrite option -y for output overwrite and no input pause. issue #29
Add specific .ass style when output format is .ass. issue #21
Add timings generating function instead of using speech-to-text api. issue #14
Add arguments for confidence control. issue #6
Add arguments for dropping empty lines from speech-to-text results.
Add free api to use by importing the googletrans. issue #25
Add bilingual subtitle output. issue #16
Add multi-types subtitles files output at the same time (regions/source language/destination language/bilingual subtitles) when using --output-files option.
Add exception to stop the workflow in main(). issue #35
Add bilingual subtitle styles input. issue #32
Add subtitles translate. issue #38
Add function to auto-replace ’ to ' in the translation result.
Add py-googletrans control args. issue #36
Add lang codes support.(Depend on langcodes package) issue #34
Add complex ass json output. issue #39
Add audio preprocessing. issue #7
Add options to control every ffmpeg command. issue #43
Add temp file save function. issue #22
Add only audio fragments output. issue #44
Add subtitles(first line) language detection powered by googletrans. issue #40
Add http, https proxy support.(Set environment variables) issue #17
Add i18n support. issue #9
Add i18n language choice. issue #45

↑

Changed(0.5.0-alpha)

issue #5.
- Rewrite help messages.
- Refactor argparse.
- Refactor constaints.
Change dev branch into origin branch.
Use alpha branch for alpha releases.
Change docs.
Change audio conversion workflow to get a better audio quality to process. Currently will create two files from the original source file separately. 48kHz/16bit/mono .wav for local speech regions finding. 44.1kHz/24bit/mono .flac for google speech v2 api upload or in other words, speech recognition. Need to point out that Google-Speech-v2 is wrong on the supported .flac audio channel number. According to my test the api doesn't support the 2-channel .flac file. agermanidis/autosub issue #155
Refactor internal regions unit to millisecond. issue #23
Refactor speech regions detection by using auditok. issue #27
Refactor generate_subtitles into 3 parts. issue #24
Refactor api_gen_text to 2 parts. One is speech_to_text. Another is text_translation. issue #33
Refactor txt output. Now txt can output regions.
Refactor list_to_sub_str. issue #37

Fixed(0.5.0-alpha)

Fix vtt output replacing all commas to dots issues.
issue #8
- Fix python3 compatibility issues.
- Fix Nuitka build after updating Nuitka to 0.6.4(Environment Anaconda2 python3.5).

↑

0.4.1-alpha - 2019-07-11

0.4.1-alpha release

Added(0.4.1-alpha)

Add https speech-to-text api url and url choice argument. agermanidis/autosub pull request #135
Add external speech-to-text regions control from external subtitle files. agermanidis/autosub pull request #159
Add scripts to build, release and etc.

Changed(0.4.1-alpha)

Change docs.

Fixed(0.4.1-alpha)

Fix vague language codes caused wrong recognition result. agermanidis/autosub pull request #136

↑

0.4.0-alpha - 2019-02-17

0.4.0-alpha release

Fixed(0.4.0-alpha)

Fix several issues. agermanidis/autosub pull request #128 by @iWangJiaxiang
- Fix "ffmpeg.exe" causes "Dependency not found: ffmpeg" on Windows.
- Fix "ValueError" when the response data of "SpeechRecognizer" couldn't be parsed to JSON Object.
- Fix Temp Folder Permissions Denied on Windows 10. agermanidis/autosub issue #15
Fix JSONDecodeError caused crash. agermanidis/autosub pull request #131 by @raryelcostasouza

↑

Files

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

Changelog

TOC

Unreleased

Added(Unreleased)

Changed(Unreleased)

Fixed(Unreleased)

Removed(Unreleased)

0.5.6-alpha - 2020-03-20

Added(0.5.6-alpha)

Changed(0.5.6-alpha)

Fixed(0.5.6-alpha)

Deprecated(0.5.6-alpha)

0.5.5-alpha - 2020-03-04

Added(0.5.5-alpha)

Fixed(0.5.5-alpha)

0.5.4-alpha - 2020-01-31

Added(0.5.4-alpha)

Removed(0.5.4-alpha)

Fixed(0.5.4-alpha)

0.5.3-alpha - 2019-12-30

Fixed(0.5.3-alpha)

0.5.2-alpha - 2019-11-05

Added(0.5.2-alpha)

Fixed(0.5.2-alpha)

0.5.1-alpha - 2019-08-02

Added(0.5.1-alpha)

Fixed(0.5.1-alpha)

0.5.0-alpha - 2019-07-27

Added(0.5.0-alpha)

Changed(0.5.0-alpha)

Fixed(0.5.0-alpha)

0.4.1-alpha - 2019-07-11

Added(0.4.1-alpha)

Changed(0.4.1-alpha)

Fixed(0.4.1-alpha)

0.4.0-alpha - 2019-02-17

Fixed(0.4.0-alpha)