tf.contrib.rnn.static_bidirectional_rnn和MultiRNNCell構建多層靜態雙向LSTM

阿新 • • 發佈：2018-12-10

import tensorflow as tf
import numpy as np

# 設定訓練引數
learning_rate = 0.01
max_examples = 40
batch_size = 128
display_step = 10  # 每間隔10次訓練就展示一次訓練情況

n_input = 100#詞向量維度
n_steps = 300#時間步長
fw_n_hidden = 256#正向神經元數量
bw_n_hidden = 128#反向神經元數量
n_classes = 10

x = tf.placeholder("float", [max_examples, n_steps, n_input])
y = tf.placeholder('float', [max_examples, n_classes])
weights = tf.Variable(tf.random_normal([(fw_n_hidden + bw_n_hidden), n_classes]))
biases = tf.Variable(tf.random_normal([n_classes]))

x = tf.transpose(x, [1, 0, 2])
print(x.shape)  
x = tf.reshape(x, [-1, n_input])
print(x.shape) 
x = tf.split(x, n_steps)
print(len(x), x[0].shape) 

# lstm_fw_cell = tf.contrib.rnn.BasicLSTMCell(fw_n_hidden, forget_bias=1.0)  # 正向RNN,輸出神經元數量為256
# lstm_bw_cell = tf.contrib.rnn.BasicLSTMCell(bw_n_hidden, forget_bias=1.0)  # 反向RNN,輸出神經元數量為128

lstm_fw_cell=[]
lstm_bw_cell=[]
for i in range(3):
    lstm_fw_cell.append(tf.contrib.rnn.BasicLSTMCell(fw_n_hidden, forget_bias=1.0) )
    lstm_bw_cell.append( tf.contrib.rnn.BasicLSTMCell(bw_n_hidden, forget_bias=1.0))

mul_lstm_fw_cell=tf.contrib.rnn.MultiRNNCell(lstm_fw_cell)
mul_lstm_bw_cell=tf.contrib.rnn.MultiRNNCell(lstm_bw_cell)

outputs, fw_state, bw_state = tf.contrib.rnn.static_bidirectional_rnn(mul_lstm_fw_cell, mul_lstm_bw_cell, x, dtype=tf.float32)

print(len(outputs))##300,等於時間步的長度，一般取outputs[-1]也就是最後一步的輸出進行運算
print(outputs[0].shape)#(40, 384)
print(outputs[-1].shape)#(40, 384),一般取最後一個時間步的輸出來進行運算

print(len(fw_state))#三個LSTM隱藏層
# print(fw_state)

#正向RNN第一個LSTM隱藏層的c狀態
print(fw_state[0].c.shape)#(40, 256)
print(fw_state[1].c.shape)#(40, 256)
print(fw_state[2].c.shape)#(40, 256)

#正向RNN第一個LSTM隱藏層的h狀態
print(fw_state[0].h.shape)#(40, 256)
print(fw_state[1].h.shape)#(40, 256)
print(fw_state[2].h.shape)#(40, 256)

#反向RNN第一個LSTM隱藏層的c狀態
print(bw_state[0].c.shape)#(40, 256)
print(bw_state[1].c.shape)#(40, 256)
print(bw_state[2].c.shape)#(40, 256)

#反向RNN第一個LSTM隱藏層的h狀態
print(bw_state[0].h.shape)#(40, 256)
print(bw_state[1].h.shape)#(40, 256)
print(bw_state[2].h.shape)#(40, 256)

tf.contrib.rnn.static_bidirectional_rnn和MultiRNNCell構建多層靜態雙向LSTM

import tensorflow as tf import numpy as np # 設定訓練引數 learning_rate = 0.01 max_examples = 40 batch_size = 128 display_step = 10 # 每間隔10次訓練

tf.contrib.rnn.LSTMCell 和 tf.nn.rnn_cell.LSTMCell

tf.contrib.rnn.LSTMCell 和 tf.nn.rnn_cell.LSTMCell 兩個是一樣的 tf.nn.rnn_cell_LSTMCell() __init__( num_units, use_peepholes=False, cell

tf.nn.bidirectional_dynamic_rnn和MultiRNNCell構建雙向多層RNN(LSTM)

import tensorflow as tf import numpy as np X = np.random.randn(10, 5, 5) # 輸入資料,批次、序列長度、樣本維度 # 第二個

tf.nn.static_rnn 和 tf.contrib.rnn.static_rnn

tf.nn.static_rnn 和 tf.contrib.rnn.static_rnn 是一樣的,都表示同一個這裡講解一下 tf.nn.static_rnn tf.nn.static_rnn tf.nn.static_rnn( cell, inputs,

tf.contrib.rnn.BasicLSTMCell, tf.contrib.rnn.MultiRNNCell深度解析

tf.contrib.rnn.BasicRnnCell 首先來看看BasicRNNCell的原始碼 class BasicRNNCell(RNNCell): """The most basic RNN cell.""" def __init__

關於tensorflow裡面的tf.contrib.rnn.BasicLSTMCell 中num_units引數問題

這裡的num_units引數並不是指這一層油多少個相互獨立的時序lstm，而是lstm單元內部的幾個門的引數，這幾個門其實內部是一個神經網路，答案來自知乎： class TRNNConfig(object): """RNN配置引數

Tensorflow正則化函式tf.contrib.layers.l1_regularizer()和tf.contrib.layers.l2_regularizer()

L1正則化公式： L2正則化公式： tf.contrib.layers.l1_regularizer()和tf.contrib.layers.l2_regularizer()是Tensoflow中L1正則化函式和L2正則化函式的API。其基本用法如下： import

tensor flow 學習 tf.contrib.layers.flatten()和tf.contrib.layers.fully_connection()

tf.contrib.layers.flatten(P)這個函式就是把P保留第一個維度，把第一個維度包含的每一子張量展開成一個行向量，返回張量是一個二維的， shape=（batch_size，….）,

用cnn構建多層神經網絡來識別mnist中的圖片

argv padding out load 神經網絡 dir sco ack import mnist.py import tensorflow as tf import numpy as np import argparse import sys import urll

構建多層感知器神經網路對數字圖片進行文字識別

在Keras環境下構建多層感知器模型，對數字影象進行精確識別。模型不消耗大量計算資源，使用了cpu版本的keras，以Tensorflow 作為backended，在ipython互動環境jupyter notebook中進行編寫。 1.資料來源此資料庫包含四部分：訓練資

用Helm3構建多層微服務

Helm是一款非常流行的k8s包管理工具。以前就一直想用它，但看到它產生的檔案比k8s要複雜許多，就一直猶豫，不知道它的好處能不能抵消掉它的複雜度。但如果不用，而是用Kubectl來進行調式真的很麻煩。正好最近Helm3正式版出來了，比原來的Helm2簡單了不少，就決定還是試用一下。結果證明確實很複雜，它的好

[LeetCode] Flatten a Multilevel Doubly Linked List 壓平一個多層的雙向鏈表

structure children data more flatten nodes pointer next input You are given a doubly linked list which in addition to the next and pre

[LeetCode] Flatten a Multilevel Doubly Linked List 壓平一個多層的雙向連結串列

You are given a doubly linked list which in addition to the next and previous pointers, it could have a child pointer, which may or may not point to a se

TF之RNN：TF的RNN中的常用的兩種定義scope的方式get_variable和Variable—Jason niu

重復及其 orf with gpo val 定義系統 brush # tensorflow中的兩種定義scope(命名變量)的方式tf.get_variable和tf.Variable。Tensorflow當中有兩種途徑生成變量 variable import te

CountVectorizer，Tf-idfVectorizer和word2vec構建詞向量的區別

tor 兩種方法閾值出現使用方法詞典 idfv 情感 CountVectorizer和Tf-idfVectorizer構建詞向量都是通過構建字典的方式，比如在情感分析問題中，我需要把每一個句子（評論）轉化為詞向量，這兩種方法是如何構建的呢？拿CountVector

tf.nn.conv2d 和tf.contrib.slim.conv2d的區別

有程式碼用到卷積層是tf.nn.conv2d 或者tf.contrib,slim.conv2d. 這兩個函式呼叫的卷積層是否一致，檢視原始碼如下: conv2d(input, filter, strides, padding, u

簡單的RNN和BP多層網路之間的區別

先來個簡單的多層網路 RNN的原理和出現的原因，解決什麼場景的什麼問題關於RNN出現的原因，RNN詳細的原理，已經有很多博文講解的非常棒了。如下： http://ai.51cto.com/art/201711/559441.htm 更多的例子可以百度瞭解為什麼我寫這篇部落格主要是我從自己

tf.nn.conv2d和tf.contrib.slim.conv2d的區別

在檢視程式碼的時候，看到有程式碼用到卷積層是tf.nn.conv2d，但是也有的使用的卷積層是tf.contrib.slim.conv2d，這兩個函式呼叫的卷積層是否一致，在查看了API的文件，以及slim.conv2d的原始碼後，做如下總結：首先是常見使用的tf.nn.

tensorflow學習（一）——有關tensorflow不同層的使用（tf.nn 和tf.layers以及tf.contrib.layers）的簡單區別

小trick: 對於使用tf.layers建立的神經網路，如果想要對loss函式進行正則話，可以採用如下方式[1]：但是該方法不適用於程式設計者自己定義不同層的正則化。 l2 = tf.add_n([tf.nn.l2_loss(var) for var in tf.t

Maven和Eclipse整合和構建多模組Maven專案

最近在工作中越來越經常的用到了Maven作為專案管理和Jar包管理和構建的工具，感覺Maven的確是很好用的。而且要將Maven的功能最大發揮出來，多模組是一個很好的整合例子。一個Maven專案包括一個MavenProject和多個MavenModule 下面用一個

tf.contrib.rnn.static_bidirectional_rnn和MultiRNNCell構建多層靜態雙向LSTM

相關推薦