PyTorch -V

content:

21. Model Optimization, Quantization, and Deployment with TorchScript & ONNX

22. Generate Adversarial Networks (GANs) with PyTorch

23. Variational Autoencoders (VAEs) with PyTorch

24. Transfer Learning with Pretrained Models in PyTorch

🧠 Section 21: Model Optimization, Quantization, and Deployment with TorchScript & ONNX

Building a model is only half the journey — deploying it efficiently in the real world is where PyTorch truly shines.
In this section, we’ll cover how to make your model faster, lighter, and deployable on different platforms (from cloud to edge devices).

You’ll learn:

What model optimization means
How to use TorchScript for deployment
How to quantize and prune a model for efficiency
Exporting to ONNX format for cross-platform inference
Real-world deployment strategies

⚙️ 21.1 Why Model Optimization Matters

Deep learning models often contain millions of parameters and require large compute resources.
When deploying models on mobile, embedded systems, or web apps, optimization helps by:

🚀 Reducing latency (faster predictions)
💾 Reducing memory footprint
🔋 Improving battery efficiency (on mobile)
☁️ Lowering cloud compute costs

Optimization is the process of transforming your model without sacrificing much accuracy.

🧩 21.2 TorchScript — Making PyTorch Models Deployable

PyTorch models are usually written in Python, which can’t directly run in C++ environments or on mobile devices.
TorchScript allows you to serialize (.pt) models and run them independently of Python.

🔹 Converting a Model with TorchScript

There are two main ways:

Tracing – Records operations as they run (good for static models)
Scripting – Converts models by analyzing Python code (handles dynamic control flow)

Example:

import torch

# Suppose 'model' is your trained PyTorch model
model.eval()

# Option 1: Tracing
example_input = torch.rand(1, 3, 128, 128)
traced_model = torch.jit.trace(model, example_input)

# Option 2: Scripting
scripted_model = torch.jit.script(model)

# Save and load
traced_model.save("model_traced.pt")
loaded_model = torch.jit.load("model_traced.pt")

# Inference
output = loaded_model(example_input)
print(output)

✅ Why TorchScript?

Runs in C++, C#, Java, or Swift environments
Enables PyTorch Mobile and TorchServe deployment
Faster inference due to graph optimization

⚡ 21.3 Model Quantization

Quantization reduces the precision of numbers used in the model (e.g., from 32-bit floats to 8-bit integers).
This leads to smaller and faster models — often with minimal loss in accuracy.

There are three types:

Type	Description	When to Use
Dynamic Quantization	Quantizes weights after training	Text / NLP models
Static Quantization	Quantizes both weights & activations (requires calibration)	Vision / CNN models
Quantization-Aware Training (QAT)	Simulates quantization during training	When accuracy is critical

🔹 Example: Dynamic Quantization

import torch.quantization

quantized_model = torch.quantization.quantize_dynamic(
    model, {torch.nn.Linear}, dtype=torch.qint8
)

print("Original size:", sum(p.numel() for p in model.parameters()))
print("Quantized size:", sum(p.numel() for p in quantized_model.parameters()))

✅ 2x–4x smaller size
✅ 1.5x–3x faster inference

⚖️ 21.4 Model Pruning

Pruning removes redundant or less important parameters to shrink model size.

Example — structured pruning:

import torch.nn.utils.prune as prune

for name, module in model.named_modules():
    if isinstance(module, torch.nn.Linear):
        prune.l1_unstructured(module, name='weight', amount=0.3)

This removes 30% of weights based on the smallest L1 norm.

After pruning, make it permanent:

prune.remove(module, 'weight')

✅ Useful when deploying to mobile or embedded systems
✅ Works best after fine-tuning for accuracy recovery

Sponsor Key-Word

"This Content Sponsored by SBO Digital Marketing.
Mobile-Based Part-Time Job Opportunity by SBO!
Earn money online by doing simple content publishing and sharing tasks. Here's how:
Job Type: Mobile-based part-time work
Work Involves:
Content publishing
Content sharing on social media
Time Required: As little as 1 hour a day
Earnings: ₹300 or more daily
Requirements:
Active Facebook and Instagram account
Basic knowledge of using mobile and social media
For more details:
WhatsApp your Name and Qualification to 9994104160
a.Online Part Time Jobs from Home
b.Work from Home Jobs Without Investment
c.Freelance Jobs Online for Students
d.Mobile Based Online Jobs
e.Daily Payment Online Jobs
Keyword & Tag: #OnlinePartTimeJob #WorkFromHome #EarnMoneyOnline #PartTimeJob #jobs #jobalerts #withoutinvestmentjob"

📦 21.5 Exporting Models to ONNX

ONNX (Open Neural Network Exchange) is an open standard for representing machine learning models — enabling interoperability between frameworks like PyTorch, TensorFlow, Caffe2, and scikit-learn.

🔹 Export PyTorch Model to ONNX

dummy_input = torch.randn(1, 3, 128, 128)
torch.onnx.export(
    model,
    dummy_input,
    "model.onnx",
    input_names=["input"],
    output_names=["output"],
    dynamic_axes={"input": {0: "batch_size"}, "output": {0: "batch_size"}},
)

🔹 Verify the ONNX Model

import onnx
onnx_model = onnx.load("model.onnx")
onnx.checker.check_model(onnx_model)
print("Model is valid!")

✅ ONNX models can be run using ONNX Runtime, TensorRT, or OpenVINO for even faster inference.
✅ Ideal for cross-framework deployment (e.g., PyTorch → TensorFlow → mobile).

🚀 21.6 Deployment Workflows

Let’s explore how to serve optimized PyTorch models in real-world scenarios.

🔹 1. TorchServe (Production API Deployment)

TorchServe (by AWS & PyTorch) allows you to deploy models as REST APIs.

Steps:

torch-model-archiver --model-name catdog --version 1.0 --serialized-file model_traced.pt --handler image_classifier
mkdir model_store
mv catdog.mar model_store/
torchserve --start --model-store model_store --models catdog=catdog.mar

Access inference via:

curl http://127.0.0.1:8080/predictions/catdog -T test_image.jpg

🔹 2. PyTorch Mobile

For Android/iOS deployment, use TorchScript models:

# Convert and optimize
model.eval()
scripted_model = torch.jit.script(model)
scripted_model._save_for_lite_interpreter("mobile_model.ptl")

Then integrate into an Android app via PyTorch Mobile libraries.

🔹 3. Web/Edge Deployment

Use ONNX Runtime Web or TensorFlow.js for browser inference.
Or deploy to NVIDIA Jetson, Raspberry Pi, or Edge TPU devices.

🔬 21.7 Real-World Use Cases

Industry	Use Case	Deployment Platform
🏥 Healthcare	Tumor classification	TorchServe API
🛒 Retail	Product image tagging	ONNX Runtime
🚗 Automotive	Driver monitoring	PyTorch Mobile
📱 Social Media	Real-time filters	TorchScript (mobile)
📡 IoT	Edge-based anomaly detection	Quantized model on Jetson Nano

🧮 21.8 Benchmarking Performance

Measure improvements post-optimization:

import time

input_data = torch.randn(1, 3, 128, 128)
start = time.time()
for _ in range(100):
    _ = model(input_data)
end = time.time()

print("Average inference time:", (end - start) / 100)

Compare the original, TorchScript, and quantized models to observe:

Latency reduction
Memory efficiency
Accuracy retention

💡 21.9 Best Practices for Deployment

✅ Use TorchScript for production inference
✅ Apply quantization and pruning after fine-tuning
✅ Export to ONNX for cross-platform compatibility
✅ Benchmark before and after optimization
✅ Monitor real-world performance (latency, throughput, accuracy drift)

🏁 21.10 Summary

You learned how to optimize PyTorch models for speed and efficiency.
Explored TorchScript, quantization, pruning, and ONNX exports.
Covered deployment workflows via TorchServe, PyTorch Mobile, and ONNX Runtime.
Gained practical insights into production-ready AI systems.

Section 22: Generative Adversarial Networks (GANs) with PyTorch

🎯 Objective

In this section, we’ll explore Generative Adversarial Networks (GANs) — one of the most fascinating deep learning architectures. You’ll understand how GANs work, the intuition behind them, and how to implement a simple GAN in PyTorch from scratch.

🧠 What Are GANs?

Generative Adversarial Networks (GANs) were introduced by Ian Goodfellow in 2014. They consist of two neural networks competing against each other — the Generator and the Discriminator.

Generator (G): Learns to create fake data that looks real.
Discriminator (D): Learns to distinguish between real data and fake data.

Over time, both networks improve through this adversarial training process — the generator gets better at fooling the discriminator, and the discriminator gets better at detecting fakes.

⚙️ How GANs Work — The Core Idea

The Generator takes random noise (z) as input and produces a fake image (or data sample).
The Discriminator takes both real and fake samples and outputs a probability (real vs fake).
Both networks are trained simultaneously:
- Discriminator loss: Correctly classify real and fake samples.
- Generator loss: Fool the discriminator into believing fake samples are real.

Mathematical Formulation

The overall objective can be expressed as a minimax game:

[
\min_G \max_D V(D, G) = \mathbb{E}{x \sim p{data}(x)}[\log D(x)] + \mathbb{E}_{z \sim p_z(z)}[\log (1 - D(G(z)))]
]

( D(x) ): Probability that x is real.
( G(z) ): Generated (fake) data from random noise z.

🧩 Architecture Overview

Component	Input	Output	Purpose
Generator	Random noise (z)	Fake image	Learn data distribution
Discriminator	Real/Fake image	Probability (0–1)	Classify authenticity

💻 Implementing a Simple GAN with PyTorch

Here’s how you can implement a simple GAN for generating handwritten digits using the MNIST dataset.

Step 1: Import Dependencies

import torch
import torch.nn as nn
import torch.optim as optim
from torchvision import datasets, transforms
from torch.utils.data import DataLoader
import matplotlib.pyplot as plt

Step 2: Define Hyperparameters

latent_dim = 100
batch_size = 128
lr = 0.0002
epochs = 50
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

Step 3: Data Loader

transform = transforms.Compose([
    transforms.ToTensor(),
    transforms.Normalize([0.5], [0.5])
])
train_data = datasets.MNIST(root='./data', train=True, transform=transform, download=True)
train_loader = DataLoader(train_data, batch_size=batch_size, shuffle=True)

Step 4: Build the Generator

class Generator(nn.Module):
    def __init__(self):
        super(Generator, self).__init__()
        self.model = nn.Sequential(
            nn.Linear(latent_dim, 256),
            nn.ReLU(),
            nn.Linear(256, 512),
            nn.ReLU(),
            nn.Linear(512, 1024),
            nn.ReLU(),
            nn.Linear(1024, 28*28),
            nn.Tanh()
        )

    def forward(self, z):
        img = self.model(z)
        return img.view(z.size(0), 1, 28, 28)

Step 5: Build the Discriminator

class Discriminator(nn.Module):
    def __init__(self):
        super(Discriminator, self).__init__()
        self.model = nn.Sequential(
            nn.Linear(28*28, 512),
            nn.LeakyReLU(0.2),
            nn.Linear(512, 256),
            nn.LeakyReLU(0.2),
            nn.Linear(256, 1),
            nn.Sigmoid()
        )

    def forward(self, img):
        img_flat = img.view(img.size(0), -1)
        validity = self.model(img_flat)
        return validity

Step 6: Initialize Models and Optimizers

generator = Generator().to(device)
discriminator = Discriminator().to(device)

criterion = nn.BCELoss()
optimizer_G = optim.Adam(generator.parameters(), lr=lr)
optimizer_D = optim.Adam(discriminator.parameters(), lr=lr)

Step 7: Training Loop

for epoch in range(epochs):
    for imgs, _ in train_loader:
        imgs = imgs.to(device)
        valid = torch.ones(imgs.size(0), 1).to(device)
        fake = torch.zeros(imgs.size(0), 1).to(device)

        # Train Generator
        optimizer_G.zero_grad()
        z = torch.randn(imgs.size(0), latent_dim).to(device)
        generated_imgs = generator(z)
        g_loss = criterion(discriminator(generated_imgs), valid)
        g_loss.backward()
        optimizer_G.step()

        # Train Discriminator
        optimizer_D.zero_grad()
        real_loss = criterion(discriminator(imgs), valid)
        fake_loss = criterion(discriminator(generated_imgs.detach()), fake)
        d_loss = (real_loss + fake_loss) / 2
        d_loss.backward()
        optimizer_D.step()

    print(f"Epoch [{epoch+1}/{epochs}] | D loss: {d_loss.item():.4f} | G loss: {g_loss.item():.4f}")

    if epoch % 10 == 0:
        with torch.no_grad():
            test_z = torch.randn(16, latent_dim).to(device)
            gen_imgs = generator(test_z).cpu().view(-1, 1, 28, 28)
            plt.figure(figsize=(4,4))
            for i in range(16):
                plt.subplot(4,4,i+1)
                plt.imshow(gen_imgs[i][0], cmap='gray')
                plt.axis('off')
            plt.show()

🔍 Results

After training for a few epochs, the generator learns to create realistic handwritten digits, improving as training progresses.

Sponsor Key-Word

"This Content Sponsored by SBO Digital Marketing.
Mobile-Based Part-Time Job Opportunity by SBO!
Earn money online by doing simple content publishing and sharing tasks. Here's how:
Job Type: Mobile-based part-time work
Work Involves:
Content publishing
Content sharing on social media
Time Required: As little as 1 hour a day
Earnings: ₹300 or more daily
Requirements:
Active Facebook and Instagram account
Basic knowledge of using mobile and social media
For more details:
WhatsApp your Name and Qualification to 9994104160
a.Online Part Time Jobs from Home
b.Work from Home Jobs Without Investment
c.Freelance Jobs Online for Students
d.Mobile Based Online Jobs
e.Daily Payment Online Jobs
Keyword & Tag: #OnlinePartTimeJob #WorkFromHome #EarnMoneyOnline #PartTimeJob #jobs #jobalerts #withoutinvestmentjob"

💡 Key Insights

GANs are unsupervised — they learn to generate data without explicit labels.
Training GANs is unstable — balancing the Generator and Discriminator is challenging.
Common improvements: Wasserstein GAN (WGAN), DCGAN, CycleGAN, StyleGAN, etc.

🌍 Real-World Applications of GANs

Domain	Application
🎨 Art & Design	AI-generated paintings, deepfakes
👗 Fashion	Generate new clothing designs
🧬 Healthcare	Synthetic medical image generation
🎮 Gaming	Realistic environment creation
🧠 NLP	Text-to-image generation (paired with transformers)

🔧 Best Practices for GAN Training

Use LeakyReLU in discriminator for better gradients.
Normalize data between [-1, 1].
Use Tanh activation in the generator output.
Train D slightly more often than G to maintain stability.
Monitor losses and visualize generated samples regularly.

✅ Summary

GANs revolutionized deep learning by introducing adversarial learning — an elegant idea where two neural networks compete to improve each other. With PyTorch’s flexibility, building and experimenting with GANs becomes intuitive and powerful.

🧩 Section 23: Variational Autoencoders (VAEs) with PyTorch

🎯 Objective

In this section, we’ll explore Variational Autoencoders (VAEs) — a generative model that learns meaningful latent representations of data and can generate new, realistic samples. You’ll learn theory, mathematical intuition, and a complete PyTorch implementation.

1. What is a Variational Autoencoder (VAE)?

A Variational Autoencoder (VAE) is a probabilistic generative model that learns to encode data into a latent space and then decode it to reconstruct the input.
Unlike standard autoencoders, VAEs don’t just compress data — they learn a probability distribution of the latent variables.

This allows VAEs to generate new data by sampling from this distribution.

💡 Key Idea

Traditional Autoencoder → Learns fixed encodings
Variational Autoencoder → Learns probabilistic encodings

Each input is represented by a distribution (mean μ and standard deviation σ) in the latent space rather than a single point.

When generating new data, we sample from these learned distributions.

2. VAE Architecture

Component	Description
Encoder	Maps input data (x) to latent parameters (μ, σ).
Latent Space (z)	Sampled from the Gaussian distribution: ( z \sim \mathcal{N}(\mu, \sigma^2) ).
Decoder	Reconstructs data ( \hat{x} ) from z, approximating original input.

🔢 Mathematical Formulation

A VAE maximizes the Evidence Lower Bound (ELBO):

[
\mathcal{L}(\theta, \phi; x) = \mathbb{E}{q\phi(z|x)}[\log p_\theta(x|z)] - D_{KL}(q_\phi(z|x) | p(z))
]

Where:

( q_\phi(z|x) ): Encoder (approximate posterior)
( p_\theta(x|z) ): Decoder (likelihood)
( D_{KL} ): Kullback–Leibler divergence
( p(z) ): Prior distribution (usually ( \mathcal{N}(0, I) ))

The first term ensures reconstruction accuracy, and the second term ensures latent space regularization.

🧠 Reparameterization Trick

We can’t directly sample ( z \sim \mathcal{N}(\mu, \sigma^2) ) during backpropagation.
So we rewrite it as:

[
z = \mu + \sigma \odot \epsilon, \quad \epsilon \sim \mathcal{N}(0, I)
]

This allows gradients to flow through μ and σ during training.

Sponsor Key-Word

"This Content Sponsored by SBO Digital Marketing.
Mobile-Based Part-Time Job Opportunity by SBO!
Earn money online by doing simple content publishing and sharing tasks. Here's how:
Job Type: Mobile-based part-time work
Work Involves:
Content publishing
Content sharing on social media
Time Required: As little as 1 hour a day
Earnings: ₹300 or more daily
Requirements:
Active Facebook and Instagram account
Basic knowledge of using mobile and social media
For more details:
WhatsApp your Name and Qualification to 9994104160
a.Online Part Time Jobs from Home
b.Work from Home Jobs Without Investment
c.Freelance Jobs Online for Students
d.Mobile Based Online Jobs
e.Daily Payment Online Jobs
Keyword & Tag: #OnlinePartTimeJob #WorkFromHome #EarnMoneyOnline #PartTimeJob #jobs #jobalerts #withoutinvestmentjob"

3. Building a Variational Autoencoder in PyTorch

Let’s implement a simple VAE trained on the MNIST dataset (handwritten digits).

Step 1: Import Libraries

import torch
import torch.nn as nn
import torch.optim as optim
from torchvision import datasets, transforms
from torch.utils.data import DataLoader
import matplotlib.pyplot as plt

Step 2: Define Hyperparameters

batch_size = 128
learning_rate = 1e-3
epochs = 20
latent_dim = 20
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

Step 3: Load Data

transform = transforms.Compose([
    transforms.ToTensor(),
    transforms.Normalize([0.5], [0.5])
])
train_data = datasets.MNIST(root="./data", train=True, transform=transform, download=True)
train_loader = DataLoader(train_data, batch_size=batch_size, shuffle=True)

Step 4: Define the VAE Model

class VAE(nn.Module):
    def __init__(self):
        super(VAE, self).__init__()
        
        # Encoder
        self.fc1 = nn.Linear(28*28, 400)
        self.fc_mu = nn.Linear(400, latent_dim)
        self.fc_logvar = nn.Linear(400, latent_dim)
        
        # Decoder
        self.fc2 = nn.Linear(latent_dim, 400)
        self.fc3 = nn.Linear(400, 28*28)
    
    def encode(self, x):
        h = torch.relu(self.fc1(x))
        return self.fc_mu(h), self.fc_logvar(h)
    
    def reparameterize(self, mu, logvar):
        std = torch.exp(0.5 * logvar)
        eps = torch.randn_like(std)
        return mu + eps * std
    
    def decode(self, z):
        h = torch.relu(self.fc2(z))
        return torch.sigmoid(self.fc3(h))
    
    def forward(self, x):
        x = x.view(-1, 28*28)
        mu, logvar = self.encode(x)
        z = self.reparameterize(mu, logvar)
        recon_x = self.decode(z)
        return recon_x, mu, logvar

Step 5: Define Loss Function

The VAE loss is composed of:

Reconstruction Loss (Binary Cross Entropy)
KL Divergence Loss

def loss_function(recon_x, x, mu, logvar):
    BCE = nn.functional.binary_cross_entropy(recon_x, x.view(-1, 28*28), reduction='sum')
    KLD = -0.5 * torch.sum(1 + logvar - mu.pow(2) - logvar.exp())
    return BCE + KLD

Step 6: Train the VAE

model = VAE().to(device)
optimizer = optim.Adam(model.parameters(), lr=learning_rate)

model.train()
for epoch in range(epochs):
    train_loss = 0
    for x, _ in train_loader:
        x = x.to(device)
        optimizer.zero_grad()
        recon_x, mu, logvar = model(x)
        loss = loss_function(recon_x, x, mu, logvar)
        loss.backward()
        train_loss += loss.item()
        optimizer.step()
    
    print(f"Epoch {epoch+1}, Loss: {train_loss / len(train_loader.dataset):.4f}")

Step 7: Visualize Results

model.eval()
with torch.no_grad():
    z = torch.randn(64, latent_dim).to(device)
    sample = model.decode(z).cpu()
    sample = sample.view(-1, 1, 28, 28)

plt.figure(figsize=(6,6))
for i in range(16):
    plt.subplot(4,4,i+1)
    plt.imshow(sample[i][0], cmap='gray')
    plt.axis('off')
plt.show()

4. Understanding VAE Results

After training:

The model reconstructs digits with decent accuracy.
Sampling from the latent space produces realistic new digits.

Unlike GANs, VAEs have a continuous and interpretable latent space, meaning nearby latent points produce similar outputs.

5. Advantages of VAEs

Feature	Description
✅ Stable Training	Unlike GANs, VAEs don’t suffer from mode collapse.
🧩 Latent Space Interpretability	Latent dimensions capture features (e.g., rotation, thickness).
🔄 Smooth Interpolation	Can generate smooth transitions between samples.
🌍 Generative Power	Can generate unseen yet realistic data.

6. Real-World Applications

Domain	Use Case
🖼️ Image Generation	Creating new faces, digits, or artwork.
🧠 Anomaly Detection	Identify unusual data by measuring reconstruction error.
💬 Natural Language Processing	Variational Autoencoders for sentence generation.
🔊 Speech Synthesis	Generate realistic human-like speech.
🧬 Bioinformatics	Latent representation of genomic data.

7. Comparison: VAE vs GAN

Feature	VAE	GAN
Training Stability	✅ Stable	⚠️ Unstable
Output Sharpness	Slightly blurry	Very sharp
Latent Space	Structured, continuous	Unstructured
Use Case	Data encoding, generation	Realistic data generation

✅ Summary

VAEs combine probabilistic modeling and deep learning for powerful unsupervised representation learning.
They learn to generate data similar to the input distribution, making them ideal for anomaly detection, data synthesis, and latent representation learning.
With PyTorch, VAEs are straightforward to implement, offering flexibility and interpretability for advanced AI applications.

🧩 Section 24: Transfer Learning with Pretrained Models in PyTorch

🎯 Objective

In this section, you’ll learn how to use Transfer Learning in PyTorch — a technique that allows you to reuse a pre-trained model on a new task with minimal training. This approach is widely used in modern AI applications like image classification, NLP, and medical imaging, saving both time and compute power.

1. What is Transfer Learning?

Transfer Learning means taking a model that’s already trained on a large, general dataset and adapting it to your specific task.

For example:

A model trained on ImageNet (millions of images across 1000 categories) can be reused to classify medical images, wildlife species, or industrial defects.
The model already “knows” how to detect general features (edges, shapes, textures), so you only need to fine-tune it to learn your specific classes.

Sponsor Key-Word

"This Content Sponsored by SBO Digital Marketing.
Mobile-Based Part-Time Job Opportunity by SBO!
Earn money online by doing simple content publishing and sharing tasks. Here's how:
Job Type: Mobile-based part-time work
Work Involves:
Content publishing
Content sharing on social media
Time Required: As little as 1 hour a day
Earnings: ₹300 or more daily
Requirements:
Active Facebook and Instagram account
Basic knowledge of using mobile and social media
For more details:
WhatsApp your Name and Qualification to 9994104160
a.Online Part Time Jobs from Home
b.Work from Home Jobs Without Investment
c.Freelance Jobs Online for Students
d.Mobile Based Online Jobs
e.Daily Payment Online Jobs
Keyword & Tag: #OnlinePartTimeJob #WorkFromHome #EarnMoneyOnline #PartTimeJob #jobs #jobalerts #withoutinvestmentjob"

💡 Why Use Transfer Learning?

Benefit	Description
⚡ Faster Training	Reduces training time drastically.
💾 Less Data Required	Works well even with small labeled datasets.
🧠 Improved Accuracy	Leverages powerful pre-learned features.
💰 Cost Efficient	Reduces compute cost (no need to train from scratch).

2. How Transfer Learning Works

There are two main strategies:

1️⃣ Feature Extraction

Freeze all layers of the pre-trained model.
Replace the final layer (classifier) with your custom output layer.
Train only the final layer.

2️⃣ Fine-Tuning

Unfreeze some top layers of the model.
Retrain both the top layers and the classifier.
Useful when your dataset is large and similar to the pretraining dataset.

3. Popular Pretrained Models in PyTorch

PyTorch provides pre-trained models via the torchvision.models module, such as:

ResNet (Residual Network)
VGG
DenseNet
MobileNet
EfficientNet
Vision Transformer (ViT)

All pre-trained on the ImageNet dataset.

4. Example: Transfer Learning with ResNet18

Let’s implement transfer learning to classify images of cats and dogs using a pre-trained ResNet18 model.

Step 1: Import Libraries

import torch
import torch.nn as nn
import torch.optim as optim
from torchvision import datasets, models, transforms
from torch.utils.data import DataLoader
import time

Step 2: Data Preparation

We’ll use data augmentation to make the model more robust.

data_dir = "./data/cats_dogs"

train_transforms = transforms.Compose([
    transforms.Resize((224, 224)),
    transforms.RandomHorizontalFlip(),
    transforms.ToTensor(),
    transforms.Normalize([0.485, 0.456, 0.406],
                         [0.229, 0.224, 0.225])
])

val_transforms = transforms.Compose([
    transforms.Resize((224, 224)),
    transforms.ToTensor(),
    transforms.Normalize([0.485, 0.456, 0.406],
                         [0.229, 0.224, 0.225])
])

train_data = datasets.ImageFolder(root=data_dir + '/train', transform=train_transforms)
val_data = datasets.ImageFolder(root=data_dir + '/val', transform=val_transforms)

train_loader = DataLoader(train_data, batch_size=32, shuffle=True)
val_loader = DataLoader(val_data, batch_size=32, shuffle=False)

Step 3: Load Pretrained Model (ResNet18)

model = models.resnet18(pretrained=True)

# Freeze all convolutional layers
for param in model.parameters():
    param.requires_grad = False

# Replace the classifier layer (fc) for 2 output classes
num_features = model.fc.in_features
model.fc = nn.Linear(num_features, 2)

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = model.to(device)

Step 4: Define Loss and Optimizer

criterion = nn.CrossEntropyLoss()
optimizer = optim.Adam(model.fc.parameters(), lr=0.001)

Step 5: Train the Model

num_epochs = 5
for epoch in range(num_epochs):
    model.train()
    running_loss, correct, total = 0.0, 0, 0

    for inputs, labels in train_loader:
        inputs, labels = inputs.to(device), labels.to(device)
        optimizer.zero_grad()

        outputs = model(inputs)
        loss = criterion(outputs, labels)
        loss.backward()
        optimizer.step()

        running_loss += loss.item()
        _, preds = torch.max(outputs, 1)
        correct += torch.sum(preds == labels.data)
        total += labels.size(0)

    epoch_loss = running_loss / len(train_loader)
    epoch_acc = correct.double() / total
    print(f"Epoch {epoch+1}/{num_epochs} | Loss: {epoch_loss:.4f} | Acc: {epoch_acc:.4f}")

Step 6: Evaluate on Validation Data

model.eval()
correct, total = 0, 0

with torch.no_grad():
    for inputs, labels in val_loader:
        inputs, labels = inputs.to(device), labels.to(device)
        outputs = model(inputs)
        _, preds = torch.max(outputs, 1)
        correct += torch.sum(preds == labels.data)
        total += labels.size(0)

val_acc = correct.double() / total
print(f"Validation Accuracy: {val_acc:.4f}")

Step 7: Save the Model

torch.save(model.state_dict(), "transfer_learning_resnet18.pth")
print("Model saved successfully!")

5. Results and Analysis

After just a few epochs, the model typically achieves >90% accuracy on cats vs dogs — even with a small dataset — thanks to the transfer of pre-learned features from ImageNet.

6. Fine-Tuning the Model

If you have more data or a slightly different domain, you can unfreeze the last few layers of the ResNet model:

for name, param in model.named_parameters():
    if "layer4" in name or "fc" in name:  # unfreeze top layers
        param.requires_grad = True

Then retrain with a smaller learning rate, e.g., lr = 1e-4.

This allows the network to slightly adjust the pre-trained filters to your new dataset.

7. Visualizing Model Predictions

import matplotlib.pyplot as plt
import numpy as np

def imshow(inp, title=None):
    inp = inp.numpy().transpose((1, 2, 0))
    mean = np.array([0.485, 0.456, 0.406])
    std = np.array([0.229, 0.224, 0.225])
    inp = std * inp + mean
    plt.imshow(np.clip(inp, 0, 1))
    if title: plt.title(title)
    plt.pause(0.001)

# Display predictions
inputs, classes = next(iter(val_loader))
outputs = model(inputs.to(device))
_, preds = torch.max(outputs, 1)
imshow(inputs[0], title=f"Predicted: {val_data.classes[preds[0]]}")

8. Real-World Applications of Transfer Learning

Domain	Example
🧠 Healthcare	Disease detection from X-rays or MRI scans
🐦 Ecology	Animal species classification
🚘 Automotive	Road sign and pedestrian detection
📷 Computer Vision	Object detection, facial recognition
💬 NLP	Language models like BERT, GPT (via transfer learning on text)

9. Best Practices

✅ Use pretrained weights for faster convergence.
✅ Apply data normalization matching the pretrained model.
✅ Start with feature extraction, then fine-tune gradually.
✅ Use a smaller learning rate for fine-tuning (1e-4 or less).
✅ Regularly monitor validation loss to prevent overfitting.

✅ Summary

Transfer Learning allows leveraging existing deep models to solve new tasks efficiently.
PyTorch provides easy access to state-of-the-art pretrained architectures like ResNet, VGG, and EfficientNet.
Through feature extraction and fine-tuning, you can achieve high performance with minimal data and training effort.

Sponsor Key-Word

"This Content Sponsored by SBO Digital Marketing.
Mobile-Based Part-Time Job Opportunity by SBO!
Earn money online by doing simple content publishing and sharing tasks. Here's how:
Job Type: Mobile-based part-time work
Work Involves:
Content publishing
Content sharing on social media
Time Required: As little as 1 hour a day
Earnings: ₹300 or more daily
Requirements:
Active Facebook and Instagram account
Basic knowledge of using mobile and social media
For more details:
WhatsApp your Name and Qualification to 9994104160
a.Online Part Time Jobs from Home
b.Work from Home Jobs Without Investment
c.Freelance Jobs Online for Students
d.Mobile Based Online Jobs
e.Daily Payment Online Jobs
Keyword & Tag: #OnlinePartTimeJob #WorkFromHome #EarnMoneyOnline #PartTimeJob #jobs #jobalerts #withoutinvestmentjob"

PyTorch -V : Model Optimization, Quantization, and Deployment with TorchScript & ONNX, GAN, VAE

PyTorch -V

content:

🧠 Section 21: Model Optimization, Quantization, and Deployment with TorchScript & ONNX

⚙️ 21.1 Why Model Optimization Matters

🧩 21.2 TorchScript — Making PyTorch Models Deployable

🔹 Converting a Model with TorchScript

⚡ 21.3 Model Quantization

🔹 Example: Dynamic Quantization

⚖️ 21.4 Model Pruning

Sponsor Key-Word

📦 21.5 Exporting Models to ONNX

🔹 Export PyTorch Model to ONNX

🔹 Verify the ONNX Model

🚀 21.6 Deployment Workflows

🔹 1. TorchServe (Production API Deployment)

🔹 2. PyTorch Mobile

🔹 3. Web/Edge Deployment

🔬 21.7 Real-World Use Cases

🧮 21.8 Benchmarking Performance

💡 21.9 Best Practices for Deployment

🏁 21.10 Summary

Section 22: Generative Adversarial Networks (GANs) with PyTorch

🎯 Objective

🧠 What Are GANs?

⚙️ How GANs Work — The Core Idea

Mathematical Formulation

🧩 Architecture Overview

💻 Implementing a Simple GAN with PyTorch

Step 1: Import Dependencies

Step 2: Define Hyperparameters

Step 3: Data Loader

Step 4: Build the Generator

Step 5: Build the Discriminator

Step 6: Initialize Models and Optimizers

Step 7: Training Loop

🔍 Results

Sponsor Key-Word

💡 Key Insights

🌍 Real-World Applications of GANs

🔧 Best Practices for GAN Training

✅ Summary

🧩 Section 23: Variational Autoencoders (VAEs) with PyTorch

🎯 Objective

1. What is a Variational Autoencoder (VAE)?

💡 Key Idea

2. VAE Architecture

🔢 Mathematical Formulation

🧠 Reparameterization Trick

Sponsor Key-Word

3. Building a Variational Autoencoder in PyTorch

Step 1: Import Libraries

Step 2: Define Hyperparameters

Step 3: Load Data

Step 4: Define the VAE Model

Step 5: Define Loss Function

Step 6: Train the VAE

Step 7: Visualize Results

4. Understanding VAE Results

5. Advantages of VAEs

6. Real-World Applications

7. Comparison: VAE vs GAN

✅ Summary

🧩 Section 24: Transfer Learning with Pretrained Models in PyTorch

🎯 Objective

1. What is Transfer Learning?

Sponsor Key-Word

💡 Why Use Transfer Learning?

2. How Transfer Learning Works

1️⃣ Feature Extraction

2️⃣ Fine-Tuning

3. Popular Pretrained Models in PyTorch

4. Example: Transfer Learning with ResNet18

Step 1: Import Libraries

Step 2: Data Preparation

Step 3: Load Pretrained Model (ResNet18)

Step 4: Define Loss and Optimizer

Step 5: Train the Model

Step 6: Evaluate on Validation Data

Step 7: Save the Model