- Add links to projects using sherpa-onnx. (#1345)
- Support lang/emotion/event results from SenseVoice in Swift API. (#1346)
- Support specifying max speech duration for VAD. (#1348)
- Add APIs about max speech duration in VAD for various programming languages (#1349)
- Allow tokens and hotwords to be loaded from buffered string driectly (#1339)
- Fix computing features for CED audio tagging models. (#1341)
- Preserve previous result as context for next segment (#1335)
- Add Python binding for online punctuation models (#1312)
- Fix vad.Flush(). (#1329)
- Fix wasm app for streaming paraformer (#1328)
- Build websocket related binaries for embedded systems. (#1327)
- Fixed the C api calls and created the TTS project file (#1324)
- Re-implement LM rescore for online transducer (#1231)
- Add VAD and keyword spotting for the Node package with WebAssembly (#1286)
- Fix releasing npm package and fix building Android VAD+ASR example (#1288)
- add Tokens []string, Timestamps []float32, Lang string, Emotion string, Event string (#1277)
- add vad+sense voice example for C API (#1291)
- ADD VAD+ASR example for dart with CircularBuffer. (#1293)
- Fix VAD+ASR example for Dart API. (#1294)
- Avoid SherpaOnnxSpeakerEmbeddingManagerFreeBestMatches freeing null. (#1296)
- Fix releasing wasm app for vad+asr (#1300)
- remove extra files from linux/macos/windows jni libs (#1301)
- two-pass Android APK for SenseVoice (#1302)
- Downgrade flutter sdk versions. (#1305)
- Reduce onnxruntime log output. (#1306)
- Provide prebuilt .jar files for different java versions. (#1307)
- flutter: add lang, emotion, event to OfflineRecognizerResult (#1268)
- Use a separate thread to initialize models for lazarus examples. (#1270)
- Object pascal examples for recording and playing audio with portaudio. (#1271)
- Text to speech API for Object Pascal. (#1273)
- update kotlin api for better release native object and add user-friendly apis. (#1275)
- Update wave-reader.cc to support 8/16/32-bit waves (#1278)
- Add WebAssembly for VAD (#1281)
- WebAssembly example for VAD + Non-streaming ASR (#1284)
- Add Pascal API for reading wave files (#1243)
- Pascal API for streaming ASR (#1246)
- Pascal API for non-streaming ASR (#1247)
- Pascal API for VAD (#1249)
- Add more C API examples (#1255)
- Add emotion, event of SenseVoice. (#1257)
- Support reading multi-channel wave files with 8/16/32-bit encoded samples (#1258)
- Enable IPO only for Release build. (#1261)
- Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR (#1251)
- Fix looking up OOVs in lexicon.txt for MeloTTS models. (#1266)
- Fix ffmpeg c api example (#1185)
- Fix splitting sentences for MeloTTS (#1186)
- Non-streaming WebSocket client for Java. (#1190)
- Fix copying asset files for flutter examples. (#1191)
- Add Chinese+English tts example for flutter (#1192)
- Add speaker identification and verification exmaple for Dart API (#1194)
- Fix reading non-standard wav files. (#1199)
- Add ReazonSpeech Japanese pre-trained model (#1203)
- Describe how to add new words for MeloTTS models (#1209)
- Remove libonnxruntime_providers_cuda.so as a dependency. (#1210)
- Fix setting SenseVoice language. (#1214)
- Support passing TTS callback in Swift API (#1218)
- Add MeloTTS example for ios (#1223)
- Add online punctuation and casing prediction model for English language (#1224)
- Fix python two pass ASR examples (#1230)
- Add blank penalty for various language bindings
- Add Dart API for audio tagging
- Add Dart API for adding punctuations to text
- Prefix all C API functions with SherpaOnnx
- Fix the case when recognition results contain the symbol
"
. It caused issues when converting results to a json string.
- Support SenseVoice CTC models.
- Add Dart API for keyword spotter.
- Support zh-en TTS model from MeloTTS.
- Downgrade onnxruntime from v1.18.1 to v1.17.1
- Support whisper large v3
- Update onnxruntime from v1.18.0 to v1.18.1
- Fix invalid utf8 sequence from Whisper for Dart API.
- Update onnxruntime from 1.17.1 to 1.18.0
- Add C# API for Keyword spotting
- Add Flush to VAD so that the last speech segment can be detected. See also k2-fsa#1077 (reply in thread)
- Support the iOS platform for Flutter.
- Build sherpa-onnx into a single shared library.
- Fix released packages. piper-phonemize was not included in v1.10.8.
- Fix released packages. There should be a lib directory.
- Support Android for Flutter.
- Fix passing C# string to C++
- Enable to stop TTS generation
- Add inverse text normalization
- Add TTS
- Publish with CI
- Fix path separator on Windows.
- Support specifying lib path.
- Initial release.