Day 94 – Multi-Speaker Speech Separation and Recognition Using SpeechBrain

We saw a post in the previous blog about SpeechBrain, Features, PreTrained models, and Speech Recognition On Different Languages By SpeechBrain.

Today, we are going to see in detail about Multi-Speaker Separation and Recognition.

What is Multi-Speaker Separation and Recognition?

When you were listening to audio and found that there were many people talking on the audio.  However, you want to hear audio from a particular person.  This feature requires high-end software or need to work with sound engineers or audio professionals to extract only the voice which you want. The emergence of Artificialy Intelligence brings this task very easy in just 13 lines of code and produce multi-speaker separation.

Let’s get into a code to check simple Multi-Speaker Separation and Recognition.

I have used SpeechBrain Pretrained models and audio files and downloaded mixed audio files (Audacity) from Azure Github.

To check my full code in Google Colab as well as here.

Copy to Clipboard
Copy to Clipboard
Copy to Clipboard
SpeechSeparation Download
Copy to Clipboard
Copy to Clipboard

I have just pasted only image of audio file image, please go to Google Colab and play there (not to upload huge MB here).

speaker1
Copy to Clipboard

I have just pasted only image of audio file image, please go to Google Colab and play there (not to upload huge MB here).

speaker2

Welcome to your SpeechBrain Quiz.  Practice, Practice, Practice...Practice makes perfect.

SpeechBrain_thumbnail

SpeechBrain is an open-source toolkit based on Pytorch developed exclusively for Speech technology.

I have downloaded files from Github repository, mixed two files into one wave file using Audacity.

You can see the file output after a below code.

Copy to Clipboard
Resampling the audio from 16000 Hz to 8000 Hz
Copy to Clipboard
Copy to Clipboard

I have just pasted only image of audio file image, please go to Google Colab and play there (not to upload huge MB here).

Copy to Clipboard

I have just pasted only image of audio file image, please go to Google Colab and play there (not to upload huge MB here).

Have anyone tried SpeechBrain?

Would you please comment below?

Welcome to your SpeechBrain Quiz.  Practice, Practice, Practice…Practice makes perfect.

By |2021-07-27T19:42:00+00:00July 21st, 2021|Artificial Intelligence|0 Comments

About the Author:

Leave A Comment