吳恩達深度學習2.3練習_Improving Deep Neural Networks_Tensorflow

阿新 • • 發佈：2018-12-16

轉載自吳恩達老師深度學習練習notebook

TensorFlow Tutorial

Welcome to this week’s programming assignment. Until now, you’ve always used numpy to build neural networks. Now we will step you through a deep learning framework that will allow you to build neural networks more easily. Machine learning frameworks like TensorFlow, PaddlePaddle, Torch, Caffe, Keras, and many others can speed up your machine learning development significantly. All of these frameworks also have a lot of documentation, which you should feel free to read. In this assignment, you will learn to do the following in TensorFlow:

Initialize variables
Start your own session
Train algorithms
Implement a Neural Network

Programing frameworks can not only shorten your coding time, but sometimes also perform optimizations that speed up your code.

1 - Exploring the Tensorflow Library

To start, you will import the library:

import math
import numpy as np
import h5py
import matplotlib.pyplot as plt
import tensorflow as tf
from tensorflow.python.framework import ops
from tf_utils import load_dataset, random_mini_batches, convert_to_one_hot, predict

%matplotlib inline
np.random.seed(1)

Now that you have imported the library, we will walk you through its different applications. You will start with an example, where we compute for you the loss of one training example.
$\begin{matrix} (1) \end{matrix}$

l o s s = L ( y ^ , y ) = ( y ^ ( i ) − y ( i ) ) 2 loss = \mathcal{L}(\hat{y}, y) = (\hat y^{(i)} - y^{(i)})^2 \tag{1}

l o s s = L (\overset{y}{^}, y) = (\overset{y}{^}^{(i)} - y^{(i)})^{2} (1)

y_hat = tf.constant(36, name='y_hat')            # Define y_hat constant. Set to 36.
y = tf.constant(39, name='y')                    # Define y. Set to 39

loss = tf.Variable((y - y_hat)**2, name='loss')  # Create a variable for the loss

init = tf.global_variables_initializer()         # When init is run later (session.run(init)),
                                                 # the loss variable will be initialized and ready to be computed
with tf.Session() as session:                    # Create a session and print the output
    session.run(init)                            # Initializes the variables
    print(session.run(loss))                     # Prints the loss

Writing and running programs in TensorFlow has the following steps:

Create Tensors (variables) that are not yet executed/evaluated.
Write operations between those Tensors.
Initialize your Tensors.
Create a Session.
Run the Session. This will run the operations you’d written above.

Therefore, when we created a variable for the loss, we simply defined the loss as a function of other quantities, but did not evaluate its value. To evaluate it, we had to run init=tf.global_variables_initializer(). That initialized the loss variable, and in the last line we were finally able to evaluate the value of loss and print its value.

Now let us look at an easy example. Run the cell below:

a = tf.constant(2)
b = tf.constant(10)
c = tf.multiply(a,b)
print(c)

Tensor("Mul_2:0", shape=(), dtype=int32)

As expected, you will not see 20! You got a tensor saying that the result is a tensor that does not have the shape attribute, and is of type “int32”. All you did was put in the ‘computation graph’, but you have not run this computation yet. In order to actually multiply the two numbers, you will have to create a session and run it.

sess = tf.Session()
print(sess.run(c))

Great! To summarize, remember to initialize your variables, create a session and run the operations inside the session.

Next, you’ll also have to know about placeholders. A placeholder is an object whose value you can specify only later.
To specify values for a placeholder, you can pass in values by using a “feed dictionary” (feed_dict variable). Below, we created a placeholder for x. This allows us to pass in a number later when we run the session.

# Change the value of x in the feed_dict

x = tf.placeholder(tf.int64, name = 'x')
print(sess.run(2 * x, feed_dict = {x: 3}))
sess.close()

When you first defined x you did not have to specify a value for it. A placeholder is simply a variable that you will assign data to only later, when running the session. We say that you feed data to these placeholders when running the session.

Here’s what’s happening: When you specify the operations needed for a computation, you are telling TensorFlow how to construct a computation graph. The computation graph can have some placeholders whose values you will specify only later. Finally, when you run the session, you are telling TensorFlow to execute the computation graph.

1.1 - Linear function

Lets start this programming exercise by computing the following equation: $Y = WX + b$ , where $W$ and $X$ are random matrices and b is a random vector.

Exercise: Compute $WX + b$ where $W, X$ , and $b$ are drawn from a random normal distribution. W is of shape (4, 3), X is (3,1) and b is (4,1). As an example, here is how you would define a constant X that has shape (3,1):

X = tf.constant(np.random.randn(3,1), name = "X")

You might find the following functions helpful:

tf.matmul(…, …) to do a matrix multiplication
tf.add(…, …) to do an addition
np.random.randn(…) to initialize randomly

# GRADED FUNCTION: linear_function

def linear_function():
    """
    Implements a linear function: 
            Initializes W to be a random tensor of shape (4,3)
            Initializes X to be a random tensor of shape (3,1)
            Initializes b to be a random tensor of shape (4,1)
    Returns: 
    result -- runs the session for Y = WX + b 
    """
    
    np.random.seed(1)
    
    ### START CODE HERE ### (4 lines of code)
    X = tf.constant(np.random.randn(3,1), name = "X")
    W = tf.constant(np.random.randn(4,3), name = "W")
    b = tf.constant(np.random.randn(4,1), name = "b")
    Y = tf.add(tf.matmul(W,X),b)
    ### END CODE HERE ### 
    
    # Create the session using tf.Session() and run it with sess.run(...) on the variable you want to calculate
    
    ### START CODE HERE ###
    sess = tf.Session()
    result = sess.run(Y)
    ### END CODE HERE ### 
    
    # close the session 
    sess.close()

    return result

print( "result = " + str(linear_function()))

result = [[-2.15657382]
 [ 2.95891446]
 [-1.08926781]
 [-0.84538042]]

Expected Output :

result

[[-2.15657382] [ 2.95891446] [-1.08926781] [-0.84538042]]

1.2 - Computing the sigmoid

Great! You just implemented a linear function. Tensorflow offers a variety of commonly used neural network functions like tf.sigmoid and tf.softmax. For this exercise lets compute the sigmoid function of an input.

You will do this exercise using a placeholder variable x. When running the session, you should use the feed dictionary to pass in the input z. In this exercise, you will have to (i) create a placeholder x, (ii) define the operations needed to compute the sigmoid using tf.sigmoid, and then (iii) run the session.

** Exercise **: Implement the sigmoid function below. You should use the following:

tf.placeholder(tf.float32, name = "...")
tf.sigmoid(...)
sess.run(..., feed_dict = {x: z})

Note that there are two typical ways to create and use sessions in tensorflow:

Method 1:

sess = tf.Session()
# Run the variables initialization (if needed), run the operations
result = sess.run(..., feed_dict = {...})
sess.close() # Close the session

Method 2:

with tf.Session() as sess: 
    # run the variables initialization (if needed), run the operations
    result = sess.run(..., feed_dict = {...})
    # This takes care of closing the session for you :)

# GRADED FUNCTION: sigmoid

def sigmoid(z):
    """
    Computes the sigmoid of z
    
    Arguments:
    z -- input value, scalar or vector
    
    Returns: 
    results -- the sigmoid of z
    """
    
    ### START CODE HERE ### ( approx. 4 lines of code)
    # Create a placeholder for x. Name it 'x'.
    x = tf.placeholder(tf.float32,name='x')

    # compute sigmoid(x)
    sigmoid = tf.sigmoid(x)

    # Create a session, and run it. Please use the method 2 explained above. 
    # You should use a feed_dict to pass z's value to x. 
    sess = tf.Session()
    # Run session and call the output "result"
    result = sess.run(sigmoid,feed_dict={x:z})
    sess.close()
    
    ### END CODE HERE ###
    
    return result

print ("sigmoid(0) = " + str(sigmoid(0)))
print ("sigmoid(12) = " + str(sigmoid(12)))

sigmoid(0) = 0.5
sigmoid(12) = 0.9999938

Expected Output :

sigmoid(0)	0.5
sigmoid(12)	0.999994

**To summarize, you how know how to**: 1. Create placeholders 2. Specify the computation graph corresponding to operations you want to compute 3. Create the session 4. Run the session, using a feed dictionary if necessary to specify placeholder variables' values.

1.3 - Computing the Cost

You can also use a built-in function to compute the cost of your neural network. So instead of needing to write code to compute this as a function of $a^{[2](i)}$ and $y^{(i)}$ for i=1…m:
$J = - \frac{1}{m} \sum_{i = 1}^m \large ( \small y^{(i)} \log a^{ [2] (i)} + (1-y^{(i)})\log (1-a^{ [2] (i)} )\large )\small\tag{2}$

you can do it in one line of code in tensorflow!

Exercise: Implement the cross entropy loss. The function you will use is:

tf.nn.sigmoid_cross_entropy_with_logits(logits = ..., labels = ...)

Your code should input z, compute the sigmoid (to get a) and then compute the cross entropy cost $J$ . All this can be done using one call to tf.nn.sigmoid_cross_entropy_with_logits, which computes

$- \frac{1}{m} \sum_{i = 1}^m \large ( \small y^{(i)} \log \sigma(z^{[2](i)}) + (1-y^{(i)})\log (1-\sigma(z^{[2](i)})\large )\small\tag{2}$

# GRADED FUNCTION: cost

def cost(logits, labels):
    """
    Computes the cost using the sigmoid cross entropy
    
    Arguments:
    logits -- vector containing z, output of the last linear unit (before the final sigmoid activation)
    labels -- vector of labels y (1 or 0) 
    
    Note: What we've been calling "z" and "y" in this class are respectively called "logits" and "labels" 
    in the TensorFlow documentation. So logits will feed into z, and labels into y. 
    
    Returns:
    cost -- runs the session of the cost (formula (2))
    """
    
    ### START CODE HERE ### 
    
    # Create the placeholders for "logits" (z) and "labels" (y) (approx. 2 lines)
    z = tf.placeholder(tf.float32,name='z')
    y = tf.placeholder(tf.float32,name='y')
    
    # Use the loss function (approx. 1 line)
    cost = tf.nn.sigmoid_cross_entropy_with_logits(logits=z,labels=y)
    
    # Create a session (approx. 1 line). See method 1 above.
    sess = tf.Session()
    
    # Run the session (approx. 1 line).
    cost = sess.run(cost,feed_dict={z:logits,y:labels})
    # Close the session (approx. 1 line). See method 1 above.
    sess.close()
    
    ### END CODE HERE ###
    
    return cost

logits = sigmoid(np.array([0.2,0.4,0.7,0.9]))
cost = cost(logits, np.array([0,0,1,1

 
 
              
           
              
              
            
            相關推薦
			   
            
            
            
 

    

    
    吳恩達深度學習2.3練習_Improving Deep Neural Networks_Tensorflow
       
  
  
 轉載自吳恩達老師深度學習練習notebook 
 TensorFlow Tutorial 
 Welcome to this week’s programming assignment. Until now, you’ve always used numpy to build neural  

  
 

    

    
    吳恩達深度學習2.1練習_Improving Deep Neural Networks(Initialization_Regularization_Gradientchecking)
       
  
  
 版權宣告：本文為博主原創文章，未經博主允許不得轉載。 https://blog.csdn.net/weixin_42432468 
 學習心得： 1、每週的視訊課程看一到兩遍 2、做筆記 
 3、做每週的作業練習，這個裡面的含金量非常高。先根據notebook過一遍，掌握後一定要自己敲一遍， 

  
 

    

    
    吳恩達深度學習2.1練習_Improving Deep Neural Networks_initialization
       
  
  
 轉載自吳恩達老師深度學習練習notebook 
 Initialization 
 Welcome to the first assignment of “Improving Deep Neural Networks”. 
 Training your neural network requ 

  
 

    

    
    吳恩達深度學習2.3筆記_Improving Deep Neural Networks_超引數除錯 和 Batch Norm
       
  
  
 版權宣告：本文為博主原創文章，未經博主允許不得轉載。 https://blog.csdn.net/weixin_42432468 
 學習心得： 1、每週的視訊課程看一到兩遍 2、做筆記 
 3、做每週的作業練習，這個裡面的含金量非常高。先根據notebook過一遍，掌握後一定要自己敲一遍， 

  
 

    

    
    吳恩達深度學習2.1筆記_Improving Deep Neural Networks_深度學習的實踐層面
       
  
  
 版權宣告：本文為博主原創文章，未經博主允許不得轉載。 https://blog.csdn.net/weixin_42432468 
 學習心得： 1、每週的視訊課程看一到兩遍 2、做筆記 
 3、做每週的作業練習，這個裡面的含金量非常高。先根據notebook過一遍，掌握後一定要自己敲一遍， 

  
 

    

    
    吳恩達深度學習4.3練習_Convolutional Neural Networks_Car detection
       
  
  
 轉載自吳恩達老師深度學習課程作業notebook 
 Autonomous driving - Car detection 
 Welcome to your week 3 programming assignment. You will learn about object detecti 

  
 

    

    
    吳恩達深度學習2-Week2課後作業3-優化演算法
       
 
 
 一、deeplearning-assignment 
 到目前為止，在之前的練習中我們一直使用梯度下降來更新引數並最小化成本函式。在本次作業中，將學習更先進的優化方法，它在加快學習速度的同時，甚至可以獲得更好的最終值。一個好的優化演算法可以讓你幾個小時內就獲得一個結果，而不是等待幾天。 
 1. 

  
 

    

    
    吳恩達深度學習2-Week1課後作業3-梯度檢測
       
 
 
 一、deeplearning-assignment 
 神經網路的反向傳播很複雜，在某些時候需要對反向傳播演算法進行驗證，以證明確實有效，這時我們引入了“梯度檢測”。 
 反向傳播需要計算梯度 , 其中θ表示模型的引數。J是使用前向傳播和損失函式計算的。因為前向傳播實現相對簡單, 所以 

  
 

    

    
    吳恩達深度學習2.2練習_Improving Deep Neural Networks_Optimization
       
  
  
 版權宣告：本文為博主原創文章，未經博主允許不得轉載。 https://blog.csdn.net/weixin_42432468 
 學習心得： 1、每週的視訊課程看一到兩遍 2、做筆記 
 3、做每週的作業練習，這個裡面的含金量非常高。先根據notebook過一遍，掌握後一定要自己敲一遍， 

  
 

    

    
    吳恩達深度學習2-Week3課後作業-Tensorflow
       
 
 
 一、deeplearning-assignment 
 到目前為止，我們一直使用numpy來建立神經網路。這次作業將深入學習框架，可以更容易地建立神經網路。 
 TensorFlow，PaddlePaddle，Torch，Caffe，Keras等機器學習框架可以顯著地加速機器學習開發。這些框架有 

  
 

    

    
    吳恩達深度學習2-Week1課後作業2-正則化
      
                一、deeplearning-assignment

這一節作業的重點是理解各個正則化方法的原理，以及它們的優缺點，而不是去注重演算法實現的具體末節。

問題陳述：希望你通過一個數據集訓練一個合適的模型，從而幫助推薦法國守門員應該踢球的位置，這樣法國隊的球員可以用頭打。法國過 

  
 

    

    
    吳恩達深度學習4.1練習_Convolutional Neural Networks_Convolution_model_Application_2
       
  
  
 版權宣告：本文為博主原創文章，未經博主允許不得轉載。 https://blog.csdn.net/weixin_42432468 
 學習心得： 1、每週的視訊課程看一到兩遍 2、做筆記 
 3、做每週的作業練習，這個裡面的含金量非常高。先根據notebook過一遍，掌握後一定要自己敲一遍， 

  
 

    

    
    吳恩達深度學習4.1練習_Convolutional Neural Networks_Convolution_model_StepByStep_1
       
  
  
 轉載自吳恩達老師深度學習練習notebook 
 Convolutional Neural Networks: Step by Step 
 Welcome to Course 4’s first assignment! In this assignment, you will implem 

  
 

    

    
    吳恩達深度學習2.2筆記_Improving Deep Neural Networks_優化演算法
       
  
  
 版權宣告：本文為博主原創文章，未經博主允許不得轉載。 https://blog.csdn.net/weixin_42432468 
 學習心得： 1、每週的視訊課程看一到兩遍 2、做筆記 
 3、做每週的作業練習，這個裡面的含金量非常高。先根據notebook過一遍，掌握後一定要自己敲一遍， 

  
 

    

    
    吳恩達深度學習4.4練習_Convolutional Neural Networks_Face Recognition for the Happy House
       
  
  
 轉載自吳恩達老師深度學習課程作業notebook 
 Face Recognition for the Happy House 
 Welcome to the first assignment of week 4! Here you will build a face recognitio 

  
 

    

    
    吳恩達深度學習流程3部分筆記--強烈推薦(這裡是重點Review的)
       
 
 
 第一個是github的https://github.com/marsggbo/deeplearning.ai_JupyterNotebooks，以下就是：
 第一章 神經網路與深度學習(Neural Network & Deeplearning)
 
  DeepLearning.ai學 

  
 

    

    
    吳恩達深度學習筆記3-Course1-Week3【淺層神經網路】
      
							
							
							淺層神經網路:





一、淺層神經網路的表示

本文中的淺層神經網路指的是 two layer nn 即 one input layer + one hidden layer + one output layer。通常計算神經網路的層數不包括 input l 

  
 

    

    
    吳恩達-深度學習-課程筆記-3: Python和向量化( Week 2 )
      有時   指數   檢查   都是   效果   很快   -1   tro   str   1 向量化( Vectorization )
在邏輯回歸中，以計算z為例，z = w的轉置和x進行內積運算再加上b，你可以用for循環來實現。
但是在python中z可以調用numpy的方法，直接一句z = np.d 

  
 

    

    
    吳恩達深度學習4.2練習_Convolutional Neural Networks_Happy House & Residual Networks
       
  
  
 1、Happy House 
  
  1.1、 Load Dataset 
  
  
  1.2、構建流圖：def HappyModel 
  
  
  1.3、PlaceHolder --> happyModel = HappyModel((64,64,3)) 
  
  
  

  
 

    

    
    吳恩達深度學習4.2練習_Convolutional Neural Networks_Residual Networks
       
  
  
 轉載自吳恩達老師深度學習課程作業notebook 
 Residual Networks 
 Welcome to the second assignment of this week! You will learn how to build very deep convolutional