AliMe Chat: A Sequence to Sequence and Rerank based Chatbot Engine論文筆記

阿新 • • 發佈：2018-12-21

摘要

阿里小蜜是開放域的問答系統，是檢索式問答系統和生成式問答系統的結合體。

框架

直接上流程圖，比較清晰

使用者輸入一個問題q，先採用IR(Information Retrieval)模型檢索出一些資料庫中的QA對作為候選，然後採用attentive Seq2Seq模型對上述檢索出的候選答案進行重新排序，如果排名第一的候選答案的得分高於某個閾值，將此答案作為標準答案輸出，否則輸出基於attentive Seq2Seq模型生產的答案。

此框架包含三個模型：1）IR模型；2）生成式模型； 3)重排模型（對候選答案進行重排）

模組講解

IR模型

採用的演算法為

BM25，主要是計算使用者問題和語料庫中問題的相似度，將最相似k個（論文中k=10）QA對作為候選集。在採用BM25之前，對語料庫中的所有問題進行分詞（不做word embedding），然後通過將每個次對映到包含該詞的方式對所有問題建立倒序索引（原文：we build an inverted index for the set of all 9,164,834 questions by mapping each word to a set of questions that contain that word，PS:具體做法和這麼做得目的我還沒想清楚，希望知道的留下自己的idea）；對於使用者的問題，進行分詞、去停用詞、利用近義詞擴充套件相關性，然後採用BM25演算法找回k個最相似的QA對。

生成式模型

採用的是 attentive Seq2Seq框架。假設，在位置i產生詞yi的概率為

其中f為計算概率的非線性函式，si-1為輸出位置i-1處的隱含層狀態，ci為取決於的上下文向量，為輸入序列的隱含層狀態，，，下圖所示為i=3,m=4時的情況

迴圈網路單元選用的是GRU，輸入資料處理採用Bucketing和pading，定義五個（5,5）（5,10）（10,15）（20,30）（45,60）五個buckets，假如問題為4個詞，答案為8個詞，要採用(5,10)，即通過新增“_PAD”符號，將問題擴充套件為5個詞，將答案擴充套件為10個詞放到Attentive Seq2Seq中處理

Attentive Seq2Seq的輸出採用Beam search，每time step包含top-k(k=10)

重排模型

還是採用Attentive Seq2Seq模型，採用平均概率作為得分如下，將每個候選答案當作詞序列

實驗

實驗就不細講了，比較簡單，無非就是以某些標準和現有的chatbot進行pk然後贏了的故事，直接上一張圖

AliMe Chat: A Sequence to Sequence and Rerank based Chatbot Engine論文筆記

摘要阿里小蜜是開放域的問答系統，是檢索式問答系統和生成式問答系統的結合體。框架直接上流程圖，比較清晰使用者輸入一個問題q，先採用IR(Information Retrieval)模型檢索出一些資料庫中的QA對作為候選，然後採用attentive Seq2Seq模型對上述

論文筆記-Sequence to Sequence Learning with Neural Networks

map tran between work down all 9.png ever onf 大體思想和RNN encoder-decoder是一樣的，只是用來LSTM來實現。 paper提到三個important point： 1）encoder和decoder的LSTM

ZooKeeper Administrator's Guide A Guide to Deployment and Administration（吃別人嚼過的饃沒意思，直接看官網資料）

section pla dconf trace log content dir exc everyone efi Deployment System Requirements Supported Platforms Required Software Clus

Seq2Seq sequence-to-sequence模型簡介

enc art 翻譯文本序列聊天機器人問題 .net 自動問答 Sequence-to-sequence (seq2seq) 模型。突破了傳統的固定大小輸入問題框架開創了將DNN運用於翻譯、聊天(問答)這類序列型任務的先河並且在各主流語言之間的相互翻譯，和語

TensorFlow中Sequence-to-Sequence樣例程式碼詳解

　　在NLP領域，sequence to sequence模型有很多應用，比如機器翻譯、自動應答機器人等。在看懂了相關的論文後，我開始研讀TensorFlow提供的原始碼，剛開始看時感覺非常晦澀，現在基本都弄懂了，我在這裡主要介紹Sequence-to-Sequence Models用到

機器翻譯模型之Fairseq：《Convolutional Sequence to Sequence Learning》

近年來，NLP領域發展迅速，而機器翻譯是其中比較成功的一個應用，自從2016年穀歌宣佈新一代谷歌翻譯系統上線，神經機器翻譯（NMT，neural machine translation）就取代了統計機器翻譯（SMT，statistical machine translation），在翻譯

【論文閱讀】Sequence to Sequence Learning with Neural Networks

看論文時查的知識點前饋神經網路就是一層的節點只有前面一層作為輸入，並輸出到後面一層，自身之間、與其它層之間都沒有聯絡，由於資料是一層層向前傳播的，因此稱為前饋網路。 BP網路是最常見的一種前饋網路，BP體現在運作機制上，資料輸入後，一層層向前傳播，然後計算損失函式，得到損失函式的殘差

Sequence to Sequence Learning with Neural Networks

用神經網路進行序列到序列的學習摘要 1.介紹 2.模型 3.實驗 3.1 Dataset details 3.2 Decoding and Rescoring 3.3 Reversing the Source Sent

Facebook的Fairseq模型詳解(Convolutional Sequence to Sequence Learning)

1. 前言近年來，NLP領域發展迅速，而機器翻譯是其中比較成功的一個應用，自從2016年穀歌宣佈新一代谷歌翻譯系統上線，神經機器翻譯（NMT，neural machine translation）就取代了統計機器翻譯（SMT，statistical machine translation），在翻譯質量上面

a place to read and write big ideas and important stories

Welcome to Medium, where words matter.We’ll deliver the best stories and ideas on the topics you care about most straight to your homepage, app, or inbox.

A model to predict and quantify racism, sexism, and other unequal treatment: Researchers show direct connection between stereoty

But from a scientific perspective, making a direct connection between people's biases and the degree to which they treat others differently is tricky. The

AliMe Chat: A Sequence to Sequence and Rerank based Chatbot Engine論文筆記

摘要

框架

模組講解

IR模型

生成式模型

重排模型

實驗

AliMe Chat: A Sequence to Sequence and Rerank based Chatbot Engine論文筆記

論文筆記-Sequence to Sequence Learning with Neural Networks

ZooKeeper Administrator's Guide A Guide to Deployment and Administration（吃別人嚼過的饃沒意思，直接看官網資料）

Seq2Seq sequence-to-sequence模型簡介

TensorFlow中Sequence-to-Sequence樣例程式碼詳解

機器翻譯模型之Fairseq：《Convolutional Sequence to Sequence Learning》

【論文閱讀】Sequence to Sequence Learning with Neural Networks

Sequence to Sequence Learning with Neural Networks

Facebook的Fairseq模型詳解(Convolutional Sequence to Sequence Learning)

a place to read and write big ideas and important stories

A model to predict and quantify racism, sexism, and other unequal treatment: Researchers show direct connection between stereoty

Convolutional Sequence to Sequence Learning筆記

The design and implementation of a system to detect and filter large sessions automatically

Sequence to Sequence 實現機器翻譯（keras demo）

keras實現attention based sequence to sequence model(首稿)

（翻譯）Sequence to Sequence Learning with Neural Networks

基於CNN的Seq2Seq模型-Convolutional Sequence to Sequence Learning

NLP中Sequence-to-Sequence model程式碼詳解

深度學習方法（八）：自然語言處理中的Encoder-Decoder模型，基本Sequence to Sequence模型

論文復現Sequence to sequence learning with neural networks

AliMe Chat: A Sequence to Sequence and Rerank based Chatbot Engine論文筆記

摘要

框架

模組講解

IR模型

生成式模型

重排模型

實驗

相關推薦