CBHG 模組來自TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS

阿新 • • 發佈：2018-11-20

作者的靈感來源於在文章Fully Character-Level Neural Machine Translation without Explicit Segmentation中的模型。原型如下圖所示：

CBHG模組如下圖所示。首次提出在Goggle的一篇文章：TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS

回到CBHG模組，該模組善於提取序列特徵。下面為模組步驟：

1.輸入序列，先經過K個1-D卷積，第K個卷積核（filter）通道為k。這些卷積核可以對當前以及上下文資訊有效建模。

2.卷積輸出被堆疊（stack）一起，沿著時間軸最大池化（maxpooling）以增加當前資訊不變性。stride取為1維持時間解析度。

3.然後輸入到幾個固定寬度的1-D卷積，將輸出增加到起始的輸入序列（參考ResNet連線方式）。所有的卷積都採用Batch Normalization.

4.輸入多層的highway 網路。用以提取更高級別的特徵。highway網路可以參考https://blog.csdn.net/l494926429/article/details/51737883

5.最後在頂部加入雙向GRU，用於提取序列的上下文特徵。

與原文中的不同是加入了batch normlization,殘差連線以及stride=1的最大池化，表現比原文更好。

CBHG 模組來自TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS

作者的靈感來源於在文章Fully Character-Level Neural Machine Translation without Explicit Segmentation中的模型。原型如下圖所示： CBHG模組如下圖所示。首次提出在Goggle的一篇文章：TACO

DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks

step with 圖片 eight enter sub img layer each 1、Introduction DL解決VO問題：End-to-End VO with RCNN 2、Network structure a.CNN based Feature Ext

Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks閱讀筆記

1.摘要論文提出一種統一的網路結構模型，這種模型可以直接通過一次前向計算就可以同時實現對影象中文字定位和識別的任務。這種網路結構可以直接以end-to-end的方式訓練，訓練的時候只需要輸入影象，影象中文字的bbox，以及文字對應的標籤資訊。這種end-to-end訓練的

論文筆記|Towards End-to-End Lane Detection: an Instance Segmentation

用盡量少的語言描述一篇paper 本文看點：結合embedding和Segmentation mask提供一種做Lane Instance Segmentation的思路 Lane的Instance Segmentation可以比單純的Segmentati

Overview：end-to-end深度學習網絡在超分辨領域的應用（待續）

向量不同的這就是 src dimens sep max pos pca 目錄 1. SRCNN Contribution Inspiration Network O. Pre-processing I. Patch extraction and representat

【USE】《An End-to-End System for Automatic Urinary Particle Recognition with CNN》

Urine Sediment Examination（USE） JMOS-2018 目錄目錄 1 Background and Motivation 2 Innovation

端到端的學習end-to-end learning （理解）

傳統的機器學習的流程是由多個獨立的模組組成，每一個獨立的任務其結果的好壞都會影響到下一個步驟，從而影響到整個訓練的結果，這個是非端到端的而深度學習模型在訓練過程中，從輸入端（輸入資料）到輸出端會得到一個預測結果，與真實結果相比較會得到一個誤差，這個誤差會在模型中的每一層傳遞（反向傳播），每一層

目標檢測中對端對端（End to end）的理解

End to end：指的是輸入原始資料，輸出的是最後結果，應用在特徵學習融入演算法，無需單獨處理。 end-to-end（端對端）的方法，一端輸入我的原始資料，一端輸出我想得到的結果。只關心輸入和輸出，中間的步驟全部都不管。　　端到端指的是輸入是原始資料，輸出是最後的結果，原來輸入端不是

Direct Shape Regression Networks for End-to-End Face Alignment

端到端人臉對齊的直接形狀迴歸網路1 主要的挑戰在於人臉影象和相關的面部形狀之間的高度非線性關係，這種非線性關係是基於標記的相關性耦合。現有的方法主要依賴於級聯迴歸，存在固有的缺點，例如對初始化的強依賴性和未能利用相關的標記。本文提出了一種**直接形狀迴歸網路（direct shap

FlowTrack－End-to-end Flow Correlation Tracking with Spatial-temporal Attention(CVPR2018)

動機：大多數DCF方法僅考慮當前幀的特徵，而很少受益於運動和幀間資訊。發生遮擋和形變時，時間資訊缺失導致效能減低。本文提出FlowTrack，利用連續幀中豐富的光流資訊來改善特徵表示和跟蹤精度。具體是將光流估計，特徵提取，聚合和相關濾波器跟蹤制定為網路中的特殊層，從而實現端到端學習。這種在深度

《An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its...》論文閱讀之CRNN

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition paper: CRNN 翻譯：CRNN

《End-to-End Learning of Motion Representation for Video Understanding》論文閱讀

CVPR 2018 | 騰訊AI Lab、MIT等機構提出TVNet：可端到端學習視訊的運動表徵動機儘管端到端的特徵學習已經取得了重要的進展，但是人工設計的光流特徵仍然被廣泛用於各類視訊分析任務中。為了彌補這個不足而提出；以前的方法：

深度學習論文翻譯解析（二）：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

論文標題：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition 論文作者： Baoguang Shi, Xiang B

CBHG 模組來自TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS

CBHG 模組來自TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS

DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks

Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks閱讀筆記

論文筆記|Towards End-to-End Lane Detection: an Instance Segmentation

Overview：end-to-end深度學習網絡在超分辨領域的應用（待續）

【USE】《An End-to-End System for Automatic Urinary Particle Recognition with CNN》

端到端的學習end-to-end learning （理解）

目標檢測中對端對端（End to end）的理解

Direct Shape Regression Networks for End-to-End Face Alignment

FlowTrack－End-to-end Flow Correlation Tracking with Spatial-temporal Attention(CVPR2018)

《An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its...》論文閱讀之CRNN

《End-to-End Learning of Motion Representation for Video Understanding》論文閱讀

深度學習論文翻譯解析（二）：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

機器學習專案開發過程（End-to-End Machine Learning Project）

End-to-end recovery of human shape and pose閱讀筆記

end-to-end-for-chinese-plate-recognition車牌識別

End-To-End Memory Network 學習整理

A Hierarchical End-to-End Model for Jointly Improving Text Summarization and Sentiment Classificatio

「Medical Image Analysis」Note on End-to-end DP with CNN (EDPCNN)

MFCNET: END-TO-END APPROACH FOR CHANGE DETECTION IN IMAGES

CBHG 模組 來自TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS

相關推薦

CBHG 模組來自TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS