Computer vision Assignment 3: Deep Learning for Perception Tasks

Computer vision 2022 Assignment 3: Deep Learning for Perception Tasks

This assignment contains 2 questions. The first question gives you a basic understanding of the classifier. The second question requires you to write a simple proposal.

Question 1: A simple classifier (60%)

For this exercise, we will provide a demo code showing how to train a network on a small dataset called FashionMinst. Please go through the following tutorials first. You will get a basic understanding about how to train an image classification network in pytorch. You can change the training scheme and the network structure. Please answer the following questions then. You can orginaze your own text and code cell to show the answer of each questions.

Note: Please plot the loss curve for each experiment (2 point).

Requirement:

Q1.1 (1 point) Change the learning rate and train for 10 epochs. Fill this table:

Lr Accuracy

0.1

0.01

0.001

Q1.2 (2 point) Report the number of epochs when the network is converged. Hint: The network is called "converged" when the accuracy is not changed (or the change is smaller than a threshold).

Fill this table:

Lr Accuracy Epoch

0.1

0.01

0.001

Q1.3 (2 points) Compare the results in table 1 and table 2, what is your observation and your understanding of learning rate?

Q1.4 (3 point) Build a deeper/ wider network. Report the accuracy and the parameters for each structure. Parameters represent the number of trainable parameters in your model, e.g. a 3 x 3 conv has 9 parameters.

Structures Accuracy Parameters

Base

Deeper

Wider

Q1.5 (2 points) Choose to do one of the following two tasks:

a. Write a code to calculate the parameter and expian the code.

b. Write done the process of how to calculate the parameters by hand.

Q1.6 (1 points) What are your observations and conclusions for changing network structure?

Q1.7 (2 points) Calculate the mean of the gradients of the loss to all trainable parameters. Plot the gradients curve for the first 100 training steps. What are your observations? Note that this gradients will be saved with the training weight automatically after you call loss.backwards(). Hint: the mean of the gradients should be decreased.

For more exlanation of q1.7, you could refer to the following simple instructions: https://colab.research.google.com/drive/1XAsyNegGSvMf3_B6MrsXht7-fHqtJ7OW?usp=sharing

import numpy as np # This is for mathematical operations

# this is used in plotting

import matplotlib.pyplot as plt

import time

import pylab as pl

from IPython import display

%matplotlib inline

%load_ext autoreload

%autoreload 2

%reload_ext autoreload

#### Tutorial Code

####PyTorch has two primitives to work with data: torch.utils.data.DataLoader and torch.utils.data.Dataset.

#####Dataset stores samples and their corresponding labels, and DataLoader wraps an iterable around the Dataset.

import torch

from torch import nn

from torch.utils.data import DataLoader

from torchvision import datasets

from torchvision.transforms import ToTensor, Lambda, Compose

import matplotlib.pyplot as plt

# Download training data from open datasets.

##Every TorchVision Dataset includes two arguments:

##transform and target_transform to modify the samples and labels respectively.

training_data = datasets.FashionMNIST(

root="data",

train=True,

download=True,

transform=ToTensor(),

)

# Download test data from open datasets.

test_data = datasets.FashionMNIST(

root="data",

train=False,

download=True,

transform=ToTensor(),

)

We pass the Dataset as an argument to DataLoader. This wraps an iterable over our dataset and supports automatic batching, sampling, shuffling, and multiprocess data loading. Here we define a batch size of 64, i.e. each element in the dataloader iterable will return a batch of 64 features and labels.

batch_size = 64

# Create data loaders.

train_dataloader = DataLoader(training_data, batch_size=batch_size)

test_dataloader = DataLoader(test_data, batch_size=batch_size)

for X, y in test_dataloader:

print("Shape of X [N, C, H, W]: ", X.shape)

print("Shape of y: ", y.shape, y.dtype)

break

To define a neural network in PyTorch, we create a class that inherits from nn.Module. We define the layers of the network in the init function and specify how data will pass through the network in the forward function. To accelerate operations in the neural network, we move it to the GPU if available.

# Get cpu or gpu device for training.

device = "cuda" if torch.cuda.is_available() else "cpu"

print("Using {} device".format(device))

# Define model

class NeuralNetwork(nn.Module):

def __init__(self):

super(NeuralNetwork, self).__init__()

self.flatten = nn.Flatten()

self.linear_relu_stack = nn.Sequential(

nn.Linear(28*28, 512),

nn.ReLU(),

nn.Linear(512, 512),

nn.ReLU(),

nn.Linear(512, 10)

)

def forward(self, x):

x = self.flatten(x)

logits = self.linear_relu_stack(x)

return logits

model = NeuralNetwork().to(device)

print(model)

###Define the loss function and the optimizer

loss_fn = nn.CrossEntropyLoss()

optimizer = torch.optim.SGD(model.parameters(), lr=1e-3)

In a single training loop, the model makes predictions on the training dataset (fed to it in batches), and backpropagates the prediction error to adjust the model’s parameters.

def train(dataloader, model, loss_fn, optimizer):

size = len(dataloader.dataset)

model.train()

for batch, (X, y) in enumerate(dataloader):

X, y = X.to(device), y.to(device)

# Compute prediction error

pred = model(X)

loss = loss_fn(pred, y)

# Backpropagation

optimizer.zero_grad()

loss.backward()

optimizer.step()

if batch % 100 == 0:

loss, current = loss.item(), batch * len(X)

print(f"loss: {loss:>7f} [{current:>5d}/{size:>5d}]")

##Define a test function

def test(dataloader, model, loss_fn):

size = len(dataloader.dataset)

num_batches = len(dataloader)

model.eval()

test_loss, correct = 0, 0

with torch.no_grad():

for X, y in dataloader:

X, y = X.to(device), y.to(device)

pred = model(X)

test_loss += loss_fn(pred, y).item()

correct += (pred.argmax(1) == y).type(torch.float).sum().item()

test_loss /= num_batches

correct /= size

print(f"Test Error: \n Accuracy: {(100*correct):>0.1f}%, Avg loss: {test_loss:>8f} \n")

#Train and test the model

epochs = 5

for t in range(epochs):

print(f"Epoch {t+1}\n-------------------------------")

train(train_dataloader, model, loss_fn, optimizer)

test(test_dataloader, model, loss_fn)

print("Done!")

Question 2: Proposal for Practical Applications (40%)

Look for a typical computer vision problem, such as: a. removing noise on the image

b. increasing the resolution of the image

c. identifying objects in the image

d. segmenting the area to which the image belongs

e. estimating the depth of an object

f. estimating the motion of two object in different frames

h. others

Discuss possible applications of this problem in life, e.g. image editing systems in your phone, improved quality of the old film, sweeping robot avoiding obstacles, unlocks the face of the mobile phone, identifies the cancer area according to the medical scan image, determines the identity according to the face, identifies the trash can on the road, and the detection system tracks the target object, etc.

In this question, you need to do

Clearly define the problem and describe its application scenarios

Briefly describe a feasible solution based on image processing and traditional machine learning algorithms.

Briefly describe a feasible deep learning-based solution.

Compare the advantages and disadvantages of the two options.

Hint1: Submit an individua report for question 2.

Hint2: Well orginaze your report.

Hint3: You can draw flow chart or inculde other figures for better understanding of your solution.

Please restrict your report within 800 words. In this question, you do not need to implement your solution. You only need to write down a proposal. Please submit this report in a seperate pdf.

QQ：99515681
WeChat：codinghelp
Email：99515681@qq.com
Work Time：8:00-23:00

Hots

Ghostwriter Fit2096 - Games Programmin... 2024-04-16
Help With Cs 4104 (Spring 2024) Homewo... 2024-04-16
Help With Financial Management Ii (Pad... 2024-04-16
Help With Mg5637 Understanding Busines... 2024-04-16
Help With Gds104 Global Business And S... 2024-04-16
Ghostwriter Final Data Analysis Projec... 2024-04-16
Help With International Macroeconomics... 2024-04-16
Ghostwriter Elec0048 – Antennas And Pr 2024-04-16
Ghostwriter Cs314 Assignment 7Ghostwri... 2024-04-16
Help With Masy1-Gc 1240-104 Informatio... 2024-04-16
Help With 110.309 Advanced Financial A... 2024-04-16
Help With Acct1101 Financial Accountin... 2024-04-16
Ghostwriter Mso3610 Financial Data Ana... 2024-04-16
Help With 07 33984 Management Educatio... 2024-04-16
Ghostwriter Christmas Concertghostwrit... 2024-04-16
Help With Macro-Econometrics Homework ... 2024-04-16
Ghostwriter Fit2093 Assignment 1 Tasks 2024-04-16
Help With Cs3334 Project: Find All Uni... 2024-04-16
Help With 125.320 International Financ... 2024-04-16
Help With Biol0006 End-Of-Term Assignm... 2024-04-16

Programming Assignment Help！