CS 486/686 神经网络

标签：functions validation each will test marks CS 486 686

CS 486/686 Winter 2023 Assignment 4

2 Neural Networks (65 marks)

In this part of the assignment, you will implement a feedforward neural network from scratch. Additionally, you will implement multiple activation functions, loss functions, and perfor- mance metrics. Lastly, you will train a neural network model to perform both a classification and a regression task.

2.1 Bank Note Forgery - A Classification Problem

The classification problem we will examine is the prediction of whether or not a bank note is forged. The labelled dataset included in the assignment was downloaded from the UCI Machine Learning Repository. The target y 2 {0, 1} is a binary variable, where 0 and 1 refer to fake and real respectively. The features are all real-valued. They are listed below:

Variance of the transformed image of the bank note ? Skewness of the transformed image of the bank note ? Curtosis of the transformed image of the bank note ? Entropy of the image

2.2 Red Wine Quality - A Regression Problem

The task is to predict the quality of red wine from northern Portugal, given some physical characteristics of the wine. The target y 2 [0, 10] is a continuous variable, where 10 is the best possible wine, according to human tasters. Again, this dataset was downloaded from the UCI Machine Learning Repository. The features are all real-valued. They are listed below:

Fixed acidity

Volatile acidity ? Citric acid

Residual sugar

Chlorides

Free sulfur dioxide ? Total sulfur dioxide ? Density

Sulphates Alcohol

2.3 Training a Neural Network

In Lecture 15, you learned how to train a neural network using the backpropagation algo- rithm. In this assignment, you will apply the forward and backward pass to the entire dataset

Wenhu Chen 2022 v1.2 Page 4 of 8

CS 486/686 Winter 2023 Assignment 4

simultaneously (i.e. batch gradient descent, where one batch is the entire dataset). As a

result, your forward and backward passes will manipulate tensors, where the first dimension

is the number of examples in the training set, n. When updating an individual weight W(l), i,j

you will need to find the sum of partial derivatives @E across all examples in the training @W(l)
set to apply the update. Algorithm 1 gives the training algorithm in terms of functions that you will implement in this assignment. Further details can be found in the documentation for each function in the provided source code.

Algorithm 1 Gradient descent with backpropagation

Require: ? > 0 Require: nepochs 2 N+ Require: X 2 Rn?f Require: y 2 Rn

Initiate weight matrices W (l) randomly for each layer. fori2{1,2,...,nepochs}do

Avals,Zvals net.forwardpass(X) y? Z vals[-1]

L L ( y? , y ) Compute @ L(y?, y)

. Derivative of error with respect to predictions

@ y?

deltas backward pass(A vals, @ L(y?, y) )

. Backward pass @L for each weight

@ y? updategradients() . W(`)

end for

return trained weight matrices W(`)

2.4 Activation and Loss Functions

W(`) ?? i,j

. Learning rate . Number of epochs . Training examples with n examples and f features . Targets for training examples . Initialize net .Conductnepochs epochs . Forward pass . Predictions

You will implement the following activation functions and their derivatives:

Sigmoid

ReLU

g(x) = 1 1+e?kx

i,j

n @W(`) i,j

g(x) = max(0, x)

You will implement the following loss functions and their derivatives:

Cross entropy loss: for binary classification

Wenhu Chen 2022 v1.2 Page 5 of 8

CS 486/686 Winter 2023 Assignment 4

Compute the average over all the examples. Note that log() refers to the natural logarithm.

1 Xn

L(y?,y) = n i=1 ?(ylog(y?)+(1?y)log(1?y?))

Mean squared error loss: for regression

1 Xn

L ( y? , y ) = n

2.5 Implementation

We have provided three Python files. Please read the detailed comments in the provided files carefully. Note that some functions have already been implemented for you.

1. neural net.py:

2. operations.py:

Contains an implementation of a NeuralNetwork class. You must implement the forward_pass(), backward_pass(), and update_weights() methods in the NeuralNetwork class. Do not change the function signatures. Do not change anything else in this file!

Contains multiple classes for multiple activation functions, loss functions, and functions for performance metrics. The activation functions extend a base Activation class and the loss functions extend a base Loss class. You must implement all the blank functions as indicated in this file. Do not change the function signatures. Do not change anything else in this file!

3. train experiment.py:Provides a demonstration of how to define a NeuralNetwork object and train it on one of the provided datasets. Feel free to

change this file as you desire.

Please complete the following tasks.

(a) Implement the empty functions in neural_net.py and operations.py. Zip and sub- mit these two files on Marmoset.

Please do not invoke any numpy random operations in neural_net.py and operations.py. This may tamper with the automatic grading.
Wenhu Chen 2022 v1.2 Page 6 of

CS 486/686 Winter 2023 Assignment 4

Marking Scheme: (52 marks) Unit tests for neural network.py:

NeuralNetwork.forward_pass()

(1 public test + 1 secret test) * 6 marks = 12 marks

NeuralNetwork.backward_pass()

(1 public test + 1 secret test) * 6 marks = 12 marks

NeuralNetwork.update_weights()

(1 public test + 1 secret test) * 6 marks = 12 marks

Unit tests for operations.py:

Sigmoid.value()