Skip to content
@DigitalPhonetics

Speech and Language Technology (SaLT) at the University of Stuttgart

Research institute in the field of speech, natural language processing and machine learning

Pinned Loading

  1. IMS-Toucan IMS-Toucan Public

    Controllable and fast Text-to-Speech for over 7000 languages!

    Python 1.5k 168

  2. VoicePAT VoicePAT Public

    VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.

    Shell 47 4

  3. bloomzmms bloomzmms Public

    Materials for the publication "Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training"

    Python 2

  4. conversational-tree-search conversational-tree-search Public

    Code and Data for Conversational Tree Search: A new task that bridges the gap between FAQ-style information retrieval and task-oriented dialog.

    Python 6

Repositories

Showing 10 of 17 repositories
  • Intrinsic-Subgraph-Generation-for-VQA Public

    Predicting a subgraph alongside the answer in a graph based VQA model

    DigitalPhonetics/Intrinsic-Subgraph-Generation-for-VQA’s past year of commit activity
    Python 8 MIT 1 0 0 Updated Dec 12, 2024
  • conversational-tree-search Public

    Code and Data for Conversational Tree Search: A new task that bridges the gap between FAQ-style information retrieval and task-oriented dialog.

    DigitalPhonetics/conversational-tree-search’s past year of commit activity
    Python 6 0 1 0 Updated Nov 19, 2024
  • IMS-Toucan Public

    Controllable and fast Text-to-Speech for over 7000 languages!

    DigitalPhonetics/IMS-Toucan’s past year of commit activity
    Python 1,497 Apache-2.0 168 10 0 Updated Nov 7, 2024
  • speaker-anonymization Public

    Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.

    DigitalPhonetics/speaker-anonymization’s past year of commit activity
    Shell 62 GPL-3.0 4 2 0 Updated Sep 13, 2024
  • DigitalPhonetics/hard-negative-captions’s past year of commit activity
    Python 4 0 0 0 Updated Jul 3, 2024
  • bloomzmms Public

    Materials for the publication "Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training"

    DigitalPhonetics/bloomzmms’s past year of commit activity
    Python 2 Apache-2.0 0 0 0 Updated Jun 16, 2024
  • multilingual-seq2seq-slu Public

    Materials for the publication "Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding"

    DigitalPhonetics/multilingual-seq2seq-slu’s past year of commit activity
    Python 2 Apache-2.0 0 0 0 Updated Jun 16, 2024
  • VoicePAT Public

    VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.

    DigitalPhonetics/VoicePAT’s past year of commit activity
    Shell 47 Apache-2.0 4 2 0 Updated May 14, 2024
  • diagraph Public

    DIAGRAPH: An open-source graphic interface for dialog flow design

    DigitalPhonetics/diagraph’s past year of commit activity
    Python 4 GPL-3.0 0 0 0 Updated Oct 23, 2023
  • adviser Public

    ADvISER is a flexible framework to encourage task-oriented dialog system research & development

    DigitalPhonetics/adviser’s past year of commit activity
    Python 58 GPL-3.0 34 3 8 Updated Aug 14, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…