Skip to content
Change the repository type filter

All

    Repositories list

    • Predicting a subgraph alongside the answer in a graph based VQA model
      Python
      MIT License
      1800Updated Dec 12, 2024Dec 12, 2024
    • Code and Data for Conversational Tree Search: A new task that bridges the gap between FAQ-style information retrieval and task-oriented dialog.
      Python
      0610Updated Nov 19, 2024Nov 19, 2024
    • Controllable and fast Text-to-Speech for over 7000 languages!
      Python
      Apache License 2.0
      1681.5k100Updated Nov 7, 2024Nov 7, 2024
    • Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.
      Shell
      GNU General Public License v3.0
      46220Updated Sep 13, 2024Sep 13, 2024
    • Python
      0400Updated Jul 3, 2024Jul 3, 2024
    • bloomzmms

      Public
      Materials for the publication "Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training"
      Python
      Apache License 2.0
      0200Updated Jun 16, 2024Jun 16, 2024
    • Materials for the publication "Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding"
      Python
      Apache License 2.0
      0200Updated Jun 16, 2024Jun 16, 2024
    • VoicePAT

      Public
      VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.
      Shell
      Apache License 2.0
      44720Updated May 14, 2024May 14, 2024
    • diagraph

      Public
      DIAGRAPH: An open-source graphic interface for dialog flow design
      Python
      GNU General Public License v3.0
      0400Updated Oct 23, 2023Oct 23, 2023
    • adviser

      Public
      ADvISER is a flexible framework to encourage task-oriented dialog system research & development
      Python
      GNU General Public License v3.0
      345838Updated Aug 14, 2023Aug 14, 2023
    • Code accompanying our paper on finetuning self-supervised general speech representations with a combination of contrastive and non-contrastive methods.
      Python
      Apache License 2.0
      0110Updated Oct 5, 2022Oct 5, 2022
    • IMS-Speech is a tool for German, English and Russian speech transcription aiming to facilitate research in various disciplines. We are willing to provide a speech transcription service with an intuitive web interface accessible with a wide range of computing devices and to people with various backgrounds. Our service is available here: https://7…
      Go
      MIT License
      2510Updated May 13, 2022May 13, 2022
    • Our_Fault

      Public
      A collaborative dialog game playable by a human and an AI system, designed to better understand how users view such an AI partner. The repository contains code for the game as well as dialog logs, survey responses, and annotations from a user study conducted with this scenario.
      Python
      GNU General Public License v3.0
      0000Updated Nov 10, 2021Nov 10, 2021
    • A project exploring ethical implications of chatbot design, in particular affective language style. The repository contains code, survey responses, and annotated data for the experiment conducted using this implementation.
      Python
      GNU General Public License v3.0
      0000Updated Nov 9, 2021Nov 9, 2021
    • CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition
      Python
      GNU General Public License v3.0
      11210Updated Oct 7, 2019Oct 7, 2019
    • nlg-eval

      Public
      Code accompanying the INLG 2018 paper Sequence-to-Sequence Models for Data-to-Text Natural Language Generation: Word- vs. Character-based Processing and Output Diversity
      Python
      GNU General Public License v3.0
      0600Updated Aug 30, 2019Aug 30, 2019
    • Comparing attention-based convolutional and recurrent neural networks under adversarial attacks to investigate their success and limitations in machine reading comprehension
      Python
      GNU General Public License v3.0
      31000Updated Aug 24, 2018Aug 24, 2018