Skip to content

Commit

Permalink
fixed renaming error
Browse files Browse the repository at this point in the history
  • Loading branch information
w4hns1nn committed Jul 13, 2024
1 parent 2f49143 commit 43424f2
Show file tree
Hide file tree
Showing 4 changed files with 10 additions and 9 deletions.
4 changes: 2 additions & 2 deletions MANIFEST.in
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
include py_audio2face/assets/mark_arkit_solved_default.usd
include py_audio2face/assets/mark_arkit_solved_streaming.usd
recursive-include speechcraft/assets/prompts *.npz
recursive-include speechcraft/assets/prompts/v2 *.npz
6 changes: 5 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -102,6 +102,10 @@ audio = AudioFile().from_np_array(audio_numpy, sr=sample_rate)
audio.save("my_new_audio.wav")
```

Note: The first time your are using speechcraft it will download the models.
These files are quite big and can take a while to download.


## Web Service

![image of openapi server](docs/server_screenshot.png)
Expand Down Expand Up @@ -130,7 +134,7 @@ If this fails, you can download the files manually or with the model_downloader.
NOTE: The Webservice is built with FastTaskAPI. In this regard, for each request it will create a task and return a job id


We highly recommend to use the media-toolkit package for file transmission. It will make your life much easier.
We highly recommend to use [media-toolkit](https://github.com/SocAIty/media-toolkit) for file transmission. It will make your life much easier.
```python
from media_toolkit import AudioFile

Expand Down
4 changes: 2 additions & 2 deletions pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"

[project]
name = "SpeechCraft"
version = "0.0.1"
version = "0.0.2"
description = "Create natural sounding audio from text, clone voices and use them. Convert voice to voice. Bark model."
readme = "README.md"
requires-python = ">=3.8"
Expand Down Expand Up @@ -40,7 +40,7 @@ full = [
]

[tool.setuptools.package-data]
bark = ["assets/prompts/*.npz", "assets/prompts/v2/*.npz"]
speakers = ["assets/prompts/*.npz", "assets/prompts/v2/*.npz"]


[tool.black]
Expand Down
5 changes: 1 addition & 4 deletions speechcraft/voice_cloning/hubert_manager.py
Original file line number Diff line number Diff line change
Expand Up @@ -6,9 +6,6 @@


class HuBERTManager:



@staticmethod
def make_sure_hubert_installed(download_url: str = 'https://dl.fbaipublicfiles.com/hubert/hubert_base_ls960.pt',
model_path: str = 'hubert_base_ls960'):
Expand All @@ -28,7 +25,7 @@ def make_sure_hubert_installed(download_url: str = 'https://dl.fbaipublicfiles.c
def make_sure_tokenizer_installed(
local_tokenizer_path: str,
model: str = 'hubert_base_ls960_23.pth',
repo: str = 'GitMylo/speechcraft-voice-cloning',
repo: str = 'GitMylo/bark-voice-cloning',
):
"""
Downloads the tokenizer from the huggingface hub if not already downloaded
Expand Down

0 comments on commit 43424f2

Please sign in to comment.