A Comprehensive Hands on guide to transfer learning

Sreeshti Singh

Cloud Consultant at E2E Networks - 6th largest IAAS platform in India | NSE Listed | High Performance cloud platform | Migrate to E2E Cloud and save up to 50%

Published Sep 12, 2022

Introduction

Machine learning and deep learning techniques largely designed in a way that we have to train and test models every time for each new problem we want to solve, as each problem has unique set of dataset and classes distribution, while in opposed to this Transfer learning means the applications of skills and knowledge that were learned and acquired in one task to another related learning task or situation.

Concept of transfer learning can be understood by the way humans learn from other subject matter experts. Analogy would go something like this: A teacher has a huge expertise in the subject he/she teaches. It is beneficial for students to get all the information through a lecture or classes from the teacher. In this case ‘transfer’ of knowledge from teacher to student is being done.

Other, example of transfer learning could be,

Knowing how to play classic piano helps in learning jazz piano
Knowing math and statistics helps in learning machine learning techniques

In this blog, we are going to cover detailed understanding of Transfer learning, Types of Transfer learning and real-world applications of transfer learning.

What to transfer: this tells which part of existing knowledge needs to be transferred to the new task
When to transfer: this question answers in which task transfer learning helps in to improve accuracy of given task
How to transfer: this tells which exact method to use in order to transfer knowledge on models to others.

Transfer learning for Deep Learning

The inductive learning method is exemplified by deep learning models. To infer a mapping from a set of training instances is the goal of inductive learning methods. For instance, the model learns how to link input features to class labels in classification scenarios. Such a learner relies on a set of assumptions about the distribution of the training data in order to generalize successfully to new data. Inductive bias is the term used to describe these sets of presumptions. Numerous characteristics, such as the hypothesis space it confines to and the search procedure through the hypothesis space, can be used to describe the inductive bias. Therefore, these biases affect what and how the model learns about the given task and domain.

Transfer learning is a concept which allows to reuse trained model instead training from scratch:

Take a network trained on a different domain for a different source task from where model was originally trained
Adapt it for your new domain or target problem at hand

Now, let’s look at some of strategies used in transfer learning for Deep learning

Feature extraction:

Layered architectures used in deep learning systems and models allow for the learning of various features at various layers (hierarchical representations of layered features). To obtain the final output, these layers are finally connected to a final layer (often a fully connected layer in the case of supervised learning).

“The key idea here is to just leverage the pre-trained model’s weighted layers to extract features but not to update the weights of the model’s layers during training with new data for the new task.”

Fine tuning:

Fine tuning consists of unfreezing a few of the top layers of a frozen model base used for feature extraction, and jointly training both the newly added part of the model and these top layers. This Is nothing but the Fine-tuning because it slightly adjusts the more abstract representations of the model being reused, in order to make them more relevant for the problem at hand.

Recommended by LinkedIn

AutoGL - A Library For Automated Graph Learning

360DigiTMG 1 year ago

Difference between Supervised Learning and…

Blockchain Council 8 months ago

Top 5 RAG (Retrieval Augmented Generation)Tutorials: A…

Aqsa Z. 3 weeks ago

Now, let’s look at the pre-trained models that are used in transfer learning for deep learning model development:

Everyone can use pre-trained models thanks to a variety of methods. An interface for downloading various well-liked models is provided by the renowned deep learning Python library ‘keras’. Pre-trained models are also available online because the majority of them are open-sourced.

For computer vision:

Xception
Inception V3
ResNet50
VGG16
VGG19
MobileNet
MobileNetV2

For Natural language processing:

Word2Vec
GloVe
FastText
BERT
GPT3
Roberta
XLM
XLNET
FlauBERT

Types of Transfer Learning

The words related with transfer learning have been used broadly and frequently interchangeably throughout the literature's many incarnations, as was indicated at the beginning of this chapter. Therefore, differentiating between transfer learning, domain adaptation, and multi-task learning can be difficult at times.

Domain Adaption:

Domain adaption is usually referred to in scenarios where the marginal probabilities between the source and target domains are different, such as P(Xₛ) ≠ P(Xₜ).

Domain Confusion:

We talked about the potential applications of feature-representation transfer. It is important to emphasise once more that different layers in a deep learning network capture various feature set. This fact allows us to acquire domain-invariant features and enhance their domain portability. We push both domains' representations to be as comparable as feasible rather than letting the model learn any representation. This can be done by directly applying specific pre-processing procedures to the representations. The idea behind this technique is to add another objective to the source model to encourage similarity by confusing the domain itself, hence the domain confusion.

Multitask learning:

In this technique, several tasks are learned simultaneously without distinction between the source and target. As opposed to transfer learning, where the learner initially has no knowledge of the target task, in this situation the learner learns information about numerous tasks at once.

Applications of Transfer learning

Transfer learning for NLP:

Textual data provides a variety of difficulties for ML and deep learning. These are typically vectorized or converted using various methods. Various training datasets have been used to create embeddings like Word2vec and FastText.

Transfer learning for Audio/Speech:

Automatic Speech Recognition (ASR) models created for English have been utilized successfully to boost German speech recognition performance. Another area where transfer learning has been quite helpful is automatic speaker identification.

Transfer learning for Computer Vision:

Using various CNN architectures, deep learning has been used relatively well for a variety of computer vision tasks, including object detection and identification.

Advantages of transfer learning

We saw, what are the types of transfer learning, and applications of it, now let’s look at some of the advantages of transfer learning

It helps in solving complex real-world problems with several constraints.
Helps solve problems where we have little or no labelled data available
Ease of transferring knowledge in form of parameters from one model to another based on specific problem or task.

Real-world use-cases of Transfer learning

Image classification or Object recognition tasks such as Dog vs Cat classification, natural images like, forest, mountain, glacier, etc. classification.
Personalized Display of products in shopping Mall/offline stores, etc.
Multiclass image classification with less data availability
Spam detection and filtering using BERT and GPT3 pre-trained models
Image captioning using VGG16 as an encoder and BERT decoder

Conclusion:

In this article, we saw detailed understanding of Transfer learning techniques, types of transfer learning are available in deep learning. Also, we saw some of the real-world applications and use-cases of transfer learning in deep learning.

A Comprehensive Hands on guide to transfer learning

Sreeshti Singh

Cloud Consultant at E2E Networks - 6th largest IAAS platform in India | NSE Listed | High Performance cloud platform | Migrate to E2E Cloud and save up to 50%

Introduction

Table of Contents:

Understanding of Transfer learning:

To gain understanding of transfer learning, we first understand the difference between the traditional approach of building and training machine learning models, and the concept of transfer learning.

Transfer learning for Deep Learning

Recommended by LinkedIn

Types of Transfer Learning

Applications of Transfer learning

Advantages of transfer learning

Real-world use-cases of Transfer learning

Conclusion:

More articles by this author

Insights from the community

Others also viewed

Power of Ladder Networks: Two Success Stories in Semi-Supervised Learning

Personalising learning Experiences with AI

Unveiling Andrew Ng's Latest Leap in AI Learning: Five Cutting-Edge Courses

🤓 This Prompt Makes Learning FUN

Beyond Content

Deep learning has never been so important...

Types of Machine Learning - Mustafa Mahmud HussAIn

Self-Supervised Learning

5 Transformative Benefits of Deep Learning Training

AI Atlas #21: Zero-Shot Learning

Explore topics

Introduction

Table of Contents:

Understanding of Transfer learning:

To gain understanding of transfer learning, we first understand the difference between the traditional approach of building and training machine learning models, and the concept of transfer learning.

Transfer learning for Deep Learning

Recommended by LinkedIn

Types of Transfer Learning

Applications of Transfer learning

Advantages of transfer learning

Real-world use-cases of Transfer learning

Conclusion:

Inside the H200 Tensor Core GPU: An In-Depth Architectural Analysis

Nov 12, 2024

A Comparative Analysis of H200 vs. H100 vs. A100 vs. L40S vs. L4 GPUs

Oct 22, 2024

Step-by-Step Guide to Fine-Tune Flux.1 with AI Toolkit and Generate Images for Ecommerce

Oct 17, 2024

Step-by-Step Guide to Creating Enterprise AI Chatbot Using RAG and Reranking

Oct 14, 2024

Machine Learning Models: Unveiling Security Vulnerabilities and Fortifying Robustness

Apr 18, 2024

How to Launch LLM Chatbot Powered by Enterprise Data on E2E Cloud

Apr 11, 2024

Nougat: Neural Optical Understanding for Academic Documents

Apr 4, 2024

A Deep-Dive into H100 Cloud GPUs for CXOs and Leaders

Apr 2, 2024

Step-by-Step Guide to Unlocking Open-Vocabulary Object Detection with YOLO-World

Mar 28, 2024

Building a RAG Pipeline for Enterprise Content Using Mamba

Mar 26, 2024

Insights from the community

Others also viewed

Power of Ladder Networks: Two Success Stories in Semi-Supervised Learning

Personalising learning Experiences with AI

Unveiling Andrew Ng's Latest Leap in AI Learning: Five Cutting-Edge Courses

🤓 This Prompt Makes Learning FUN

Beyond Content

Deep learning has never been so important...

Types of Machine Learning - Mustafa Mahmud HussAIn

Self-Supervised Learning

5 Transformative Benefits of Deep Learning Training

AI Atlas #21: Zero-Shot Learning

Explore topics