Skip to content
View RayeRen's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@msra-alumni @MLNLP-World @NATSpeech

Block or report RayeRen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
RayeRen/README.md

Hi there 👋

I work at TikTok as a research scientist now in Singapore.

I am now working on audio-driven talking face generation, text-to-speech and music generation research. If you are seeking any form of academic cooperation, please feel free to email me at [email protected]. We are hiring interns!

I graduated from Chu Kochen Honors College, Zhejiang University (浙江大学竺可桢学院) with a bachelor's degree and from the Department of Computer Science and Technology, Zhejiang University (浙江大学计算机科学与技术学院) with a master's degree, advised by Zhou Zhao (赵洲). I also collaborate with Xu Tan (谭旭), Tao Qin (秦涛) and Tie-yan Liu (刘铁岩) from Microsoft Research Asia closely.

I won the Baidu Scholarship (10 candidates worldwide each year) and ByteDance Scholars Program (10 candidates worldwide each year) in 2020 and was selected as one of the top 100 AI Chinese new stars and AI Chinese New Star Outstanding Scholar (10 candidates worldwide each year).

My research interest includes speech synthesis, neural machine translation and automatic music generation. I have published 50+ papers at the top international AI conferences such as NeurIPS, ICML, ICLR, KDD.

To promote the communication among the Chinese ML & NLP community, we (along with other 11 young scholars worldwide) founded the MLNLP community in 2021. I am honored to be one of the chairs of the MLNLP committee.

📎 Homepages

🔥 News

  • 2024.03: 🎉 Two papers are accepted by ICLR 2024
  • 2023.05: 🎉 Five papers are accepted by ACL 2023
  • 2023.01: DiffSinger was introduced in a very popular video (2000k+ views) in Bilibili!
  • 2023.01: I join TikTok as a speech research scientist in Singapore!
  • 2022.02: I release a modern and responsive academic personal homepage template. Welcome to STAR and FORK!

💻 Selected Research Papers

My full paper list is shown at my personal homepage.

🎙 Audio and Speech Processing

👄 Talkingface Generation

📚 Machine Translation

🎼 Music Generation

🧑‍🎨 Generative Model

Pinned Loading

  1. NATSpeech/NATSpeech NATSpeech/NATSpeech Public

    A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

    Python 970 100

  2. acad-homepage.github.io acad-homepage.github.io Public

    AcadHomepage: A Modern and Responsive Academic Personal Homepage

    SCSS 1.5k 3k

  3. MoonInTheRiver/DiffSinger MoonInTheRiver/DiffSinger Public

    DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

    Python 4.4k 717

  4. AIGC-Audio/AudioGPT AIGC-Audio/AudioGPT Public

    AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

    Python 10.1k 868

  5. luping-liu/PNDM luping-liu/PNDM Public

    The official implementation for Pseudo Numerical Methods for Diffusion Models on Manifolds (PNDM, PLMS | ICLR2022)

    Python 334 31

  6. MoonInTheRiver/NeuralSVB MoonInTheRiver/NeuralSVB Public

    Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

    Python 426 52