Replies: 5 comments 6 replies
-
Hi @Larrycmd There is an option to do that through the API, however, because of the way certain things work, you have to add an element of randomness to the names e.g. Lets say you are in text-gen-webui and you are chatting in there with a LLM bot. If you specify the name as name_001, name_002 etc then first of you have to read back the files list to always track where you are in the numbering. Over time this will slow things down as you have to read out the whole folder contents. Additionally if you dont track the names, you may close a session and re-open a session later on, which would result in the the files being over-written if the current numbering isn't tracked and even then, there's a further complication. Lets say you use that naming scheme mentioned above, you have generated 100 files name_001 to name_100, but you delete the first 50 files as you want to save name_050 to name_100, you then have a complication tracking the fact that its ok to use name_001 to name_49, but not 50 to 100 etc.. Im just pointing out it can get a bit complicated, hence there is always a need to add a unique ID number to the files in some way, otherwise you can always end up with the potential to over-write previous TTS generations. So changing the "name" portion of the file is easy, but naming the unique ID portion of the file can be complicated. Is there a specific section of AllTalk you want to target sequential numbering at? text-gen-webui, tts-generator, the API etc? Thanks |
Beta Was this translation helpful? Give feedback.
-
I would also like this. How about using a naming system similar to what 11labs does? |
Beta Was this translation helpful? Give feedback.
-
@Larrycmd @greek12man So tell me if I am getting this correct, what you are both asking for is an easier way to identify the visual of what you see on screen in the TTS_generator (line number or whatever) to match back to the file name somehow? Is that a fair summary? |
Beta Was this translation helpful? Give feedback.
-
What about keeping it mostly the way it is now, but including a function that appends a user provided prefix to the file name? That would allow people to use either a name or a date that they would manually enter. |
Beta Was this translation helpful? Give feedback.
-
Without going too in-depth on the inner working of things, I've put some code up for you to try, After you have performed a http://127.0.0.1:7851/static/tts_generator/tts_generator_test.html You will manually have to go to that link! (Note the Its been a bit of an ass to rework, but my testing seems to show it works, though youre all welcome to test. The format of the files is: {Custom-file-name} _ {ID number up to 99,999} _ {Unique_ID_number} There is simply no way to avoid having a unique ID number, both on generation and re-generation, without further web-browser caching issues. Im trying to spend most of my time coding AllTalk v2, so you are welcome to give this a go and feed back, however, as mentioned I may be moving over to Gradio for this, so wont be focussing too strongly on more code updates to this version. Thanks |
Beta Was this translation helpful? Give feedback.
-
It would be nice if there was an option to automatically name the output files like the names in the List e.g. ID1, or even better :
ID 001.wav.
It would be good to add a dictonary function so that yo can set one word to be spoken as if it was another.
Beta Was this translation helpful? Give feedback.
All reactions