[WIP]Add speaker recognition example #5327

KPatr1ck · 2021-07-20T02:44:23Z

PR types

New feature

PR changes

dataset, model, example

Describe

Model:
- Add ECAPA-TDNN model
Datasets:
- Add VoxCeleb1 dataset
- Add OpenRIRNoise dataset
- Update api usage in other datasets
Data Augment
- Add on the fly data augment
Loss
- Add Additive Angular Margin Loss
Scripts
- Add training script of speaker identification
- Add speaker verification and EER evaluation

…ognition

ZeyuChen · 2021-07-21T04:13:10Z

@KPatr1ck any update？

KPatr1ck · 2021-07-21T12:27:58Z

@KPatr1ck any update？

数据集，模型，loss，训练和预测脚本已齐。
全量数据在训练对齐中。

KPatr1ck · 2021-07-21T12:31:53Z

@ZeyuChen @ranchlai
Doc string持续补齐中，model、dataset、训练预测脚本可以先reivew一波～

ZeyuChen · 2021-07-27T06:42:46Z

PaddleAudio/examples/speaker_recognition/model.py

+        for fc in self.blocks:
+            x = fc(x)
+
+        # KP: W和x的向量归一化，输出为余弦相似度，供Additive Angular Margin计算loss


使用标准的注释
# NOTE(xxxxxgithuid or username): blabla

ZeyuChen · 2021-07-27T06:43:43Z

PaddleAudio/examples/speaker_recognition/signal_processing.py

+    return 10**(SNR / 20)
+
+
+def convolve1d(


convolved1d和conv1d的区别是什么？

ZeyuChen · 2021-07-27T06:44:29Z

PaddleAudio/examples/speaker_recognition/signal_processing.py

+    return (hlpf + hhpf).reshape([1, -1, 1])
+
+
+def reverberate(waveforms,


参考SpeechBrain看这些基础function是否有注释

ZeyuChen · 2021-07-27T06:45:35Z

PaddleAudio/examples/speaker_recognition/speaker_verification.py

+
+                id2embedding_norm.update(dict(zip(ids, embeddings)))
+
+        # Score normalization based on trainning samples.


trainning -> training

ZeyuChen · 2021-07-27T06:48:49Z

PaddleAudio/paddleaudio/datasets/open_rir_noise.py

+__all__ = ['OpenRIRNoise']
+
+
+class OpenRIRNoise(Dataset):


Need more comments for dataset source

ZeyuChen · 2021-07-27T06:49:40Z

PaddleAudio/paddleaudio/datasets/voxceleb1.py

+__all__ = ['VoxCeleb1']
+
+
+class VoxCeleb1(Dataset):


More dataset source comments.

ZeyuChen · 2021-07-27T06:51:18Z

PaddleAudio/paddleaudio/models/ecapa_tdnn.py

+        return x + residual
+
+
+class ECAPA_TDNN(nn.Layer):


Move this model implementation to examples if without pretrained weight.

CLAassistant · 2024-09-18T15:05:41Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

KPatr1ck added 9 commits July 14, 2021 21:12

Add speaker recognition example

136b8e2

-

e0fd481

-

126a157

Add speaker verification and compute EER

bfd2c4f

Add audio augmentation

ac25ffb

Update usage of new APIs

cb1df8f

-

0d0a582

-

914088a

Merge remote-tracking branch 'update_stream/develop' into speaker_rec…

53411d8

…ognition

KPatr1ck mentioned this pull request Jul 20, 2021

[WIP]Add speaker recognition example #5322

Closed

KPatr1ck added 2 commits July 20, 2021 14:29

-

46f063a

-

e87edc8

Add score norm

7d0ead8

KPatr1ck marked this pull request as ready for review July 21, 2021 12:29

-

e5298de

ZeyuChen self-assigned this Jul 25, 2021

ZeyuChen requested a review from ranchlai July 25, 2021 16:37

ZeyuChen reviewed Jul 27, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP]Add speaker recognition example #5327

[WIP]Add speaker recognition example #5327

KPatr1ck commented Jul 20, 2021 •

edited

Loading

ZeyuChen commented Jul 21, 2021

KPatr1ck commented Jul 21, 2021

KPatr1ck commented Jul 21, 2021

ZeyuChen Jul 27, 2021

ZeyuChen Jul 27, 2021

ZeyuChen Jul 27, 2021

ZeyuChen Jul 27, 2021

ZeyuChen Jul 27, 2021

ZeyuChen Jul 27, 2021

ZeyuChen Jul 27, 2021

CLAassistant commented Sep 18, 2024

		return (hlpf + hhpf).reshape([1, -1, 1])


		def reverberate(waveforms,


		id2embedding_norm.update(dict(zip(ids, embeddings)))

		# Score normalization based on trainning samples.

		__all__ = ['OpenRIRNoise']


		class OpenRIRNoise(Dataset):

		__all__ = ['VoxCeleb1']


		class VoxCeleb1(Dataset):

[WIP]Add speaker recognition example #5327

Are you sure you want to change the base?

[WIP]Add speaker recognition example #5327

Conversation

KPatr1ck commented Jul 20, 2021 • edited Loading

PR types

PR changes

Describe

ZeyuChen commented Jul 21, 2021

KPatr1ck commented Jul 21, 2021

KPatr1ck commented Jul 21, 2021

ZeyuChen Jul 27, 2021

Choose a reason for hiding this comment

ZeyuChen Jul 27, 2021

Choose a reason for hiding this comment

ZeyuChen Jul 27, 2021

Choose a reason for hiding this comment

ZeyuChen Jul 27, 2021

Choose a reason for hiding this comment

ZeyuChen Jul 27, 2021

Choose a reason for hiding this comment

ZeyuChen Jul 27, 2021

Choose a reason for hiding this comment

ZeyuChen Jul 27, 2021

Choose a reason for hiding this comment

CLAassistant commented Sep 18, 2024

KPatr1ck commented Jul 20, 2021 •

edited

Loading