Day 95 – Speaker Voice Verification Using SpeechBrain

Gopi Chandrakesan

🔹 Project Manager/Solution Architect ✍️ Blogger on SAP, 🤖 Artificial Intelligence, 🖥 Machine Learning, and 🧠 Deep Learning 📈 Ask me about Data Intelligence

Published Jul 23, 2021

We saw posts in previous blogs about SpeechBrain, Features, PreTrained models, Multi-Speaker Speech Separation and Recognition Using SpeechBrain, and Speech Recognition On Different Languages By SpeechBrain.

Today, we are going to see in detail about Speaker Voice Verification?

What is Speaker Voice Verification?

Sometimes, we listen to audios and feel we are hearing same voice on both audios though it’s a different voice on audios. Speaker voice verification model verifies both speakers are same for the audio and returns True or False.

Let’s get into a code to check simple Speaker Voice Verification.

I have used SpeechBrain Pretrained models and audio files and downloaded mixed audio files (Audacity) from Azure Github.

To check my full code in Google Colab as well as here.

#Install Torchaudio, SpeechBrain and Transformers
!pip install torchaudio==0.8.1 #Temporary (until pytorch 0.9 is supported in Colab)
!pip install speechbrain

!pip install transformers

#Import all libraries
import speechbrain as sb
from speechbrain.dataio.dataio import read_audio

from IPython.display import Audio

#Download pretrained SpeakerRecognition from SpeechBrain
from speechbrain.pretrained import SpeakerRecognition
verification = SpeakerRecognition.from_hparams(source="speechbrain/spkrec-ecapa-voxceleb", savedir="pretrained_models/spkrec-ecapa-voxceleb")
score, prediction = verification.verify_files("speechbrain/spkrec-ecapa-voxceleb/sampledata_samevoice1.wav", "speechbrain/spkrec-ecapa-voxceleb/sampledata_samevoice2.wav")

print(prediction, score)

Posts on Artificial Intelligence, Deep Learning, Machine Learning, and Design Thinking articles:

Autonomous Vehicle Environment Perception Task By Pandas Team

Deep Learning Basics: Introduction, Concepts, and Overview

TextStyleBrush by Facebook AI Research Team

Recommended by LinkedIn

Demystifying Large Language Models

Brij kishore Pandey 5 months ago

The Deep Learning Evolution: What’s Driving the Next…

Inspirisys Solutions Limited (a CAC Holdings Group Company) 2 months ago

Heard about one of the most game-changing fields…

Microland Limited 1 year ago

Watercolor Painting Under 5 Mins Using Stylized Neural Painting Artificial Intelligence

Marker Pen Painting Under 5 Mins Using Stylized Neural Painting Artificial Intelligence

Detail Sentence Analyzer Using spaCy Natural Language Processing – Part II

Named Entity Recognition Using spaCy Natural Language Processing – Part III

AI Converts Your Photo Or Video Selfie To Animation – GANsNRoses

Artificial Intelligence Chatbot Using Neural Network and Natural Language Processing

Tensorflow Sobel Filter Image Processing and Computer Vision

Oil Painting Under 5 Mins Using Stylized Neural Painting Artificial Intelligence

COCO Keypoint Detection Detectron2 Computer Vision by Facebook AI Research (FAIR)

Translate 125 Plus Languages Using Google Artificial Intelligence – Part 1

Translate 125 Plus Languages Using Google Artificial Intelligence – Part 2

Posts on SAP:

How to Transform Your Business with SAP Data Intelligence?

SAP AI Business Services – Business Entity Recognition

SAP AI Business Services – Document Information Extraction

SAP AI Business Services – Service Ticket Intelligence

SAP Intelligent Robotic Process Automation, Use Case, Benefits, and Available Features

SAP Conversational AI

SAP AI Business Services

Andrei Larionov

CTO at Aurum Window Cleaning & Property Care - A Gold Standard Company

I do not understand AT ALL why the speech recognition requires a pre-trained model, namely, this line "source="speechbrain/spkrec-ecapa-voxceleb" Do you ?? I mean, why comparing 2 voice samples and figuring out whether they are the same requires pre-trained neural network that is pre-trained on completely different samples ?

Day 95 – Speaker Voice Verification Using SpeechBrain

Gopi Chandrakesan

🔹 Project Manager/Solution Architect ✍️ Blogger on SAP, 🤖 Artificial Intelligence, 🖥 Machine Learning, and 🧠 Deep Learning 📈 Ask me about Data Intelligence

What is Speaker Voice Verification?

Further Reading

Posts on Artificial Intelligence, Deep Learning, Machine Learning, and Design Thinking articles:

Recommended by LinkedIn

Posts on SAP:

More articles by Gopi Chandrakesan

Insights from the community

Others also viewed

Autonomous Agentic AI - Alternatives to Neuro-Symbolic Systems for Enhancing LLMs for Improved Rule-Following & Reasoning

How Is Transformer Algorithm & Deep-Learning Architecture Reshaping AI?

Understanding Artificial Intelligence: A Beginner’s Guide

The Building Blocks of Generative AI: From Sub-Domains to LLMs

Understanding the Core Concepts of AI: Definitions, Analogies, Real-World Examples, and More

Fundamental Concepts of Artificial Intelligence

Branches of Artificial Intelligence

Mastering AI Testing for Advanced Quality Engineering

This weekend I used Google's new NotebookLM.

Integrating Large Language Models with Computer Vision for Human-Computer Interactions

Explore topics

What is Speaker Voice Verification?

Further Reading

Posts on Artificial Intelligence, Deep Learning, Machine Learning, and Design Thinking articles:

Recommended by LinkedIn

Posts on SAP:

More articles by Gopi Chandrakesan

Day 100 – 100+ Free Resources To Learn And Excel In Artificial Intelligence, Machine Learning, and Deep Learning

Day 99 – Convert Your Photos To 3D Effect Videos Using 3D Ken Burns

Day 98 – Blender Bot 2.0 Open Source Chatbot From Facebook AI Research(FAIR)

Day 97 – Cartoonize Your Photo or Video Using Cartoonize Artificial Intelligence

Day 96 – Facing Image Region Missing On Your Photo? Let’s Check The Image On Missing Pixel Filler

Day 94 – Multi-Speaker Speech Separation and Recognition Using SpeechBrain

Day 93 – Speech Recognition On Different Languages By SpeechBrain

Day 92 – Pytorch SpeechBrain All-In-One Speech Toolkit

Day 91 – Free Deep Leaning 15 Weeks Program in Pytorch

Day 90 – Autonomous Vehicle Environment Perception Task By Pandas Team

Insights from the community

Others also viewed

Autonomous Agentic AI - Alternatives to Neuro-Symbolic Systems for Enhancing LLMs for Improved Rule-Following & Reasoning

How Is Transformer Algorithm & Deep-Learning Architecture Reshaping AI?

Understanding Artificial Intelligence: A Beginner’s Guide

The Building Blocks of Generative AI: From Sub-Domains to LLMs

Understanding the Core Concepts of AI: Definitions, Analogies, Real-World Examples, and More

Fundamental Concepts of Artificial Intelligence

Branches of Artificial Intelligence

Mastering AI Testing for Advanced Quality Engineering

This weekend I used Google's new NotebookLM.

Integrating Large Language Models with Computer Vision for Human-Computer Interactions

Explore topics