Wide & Deep Learning for Recommender Systems 模型實踐

阿新 • • 發佈：2019-01-07

Wide & Deep 模型實踐

部落格程式碼均以上傳至GitHub，歡迎follow和start~~!

1. 資料集

資料集如下，其中，最後一行是label，預測收入是否超過5萬美元，二分類問題。

2. Wide Linear Model

離散特徵處理分為兩種情況：

知道所有的不同取值，而且取值不多。tf.feature_column.categorical_column_with_vocabulary_list
不知道所有不同取值，或者取值非常多。tf.feature_column.categorical_column_with_hash_bucket


## 3.1 Base Categorical Feature Columns
# 如果我們知道所有的取值，並且取值不是很多
relationship = tf.feature_column.categorical_column_with_vocabulary_list(
    'relationship', [
        'Husband', 'Not-in-family', 'Wife', 'Own-child', 'Unmarried', 'Other-relative'
    ]
)

# 如果不知道有多少取值
occupation = tf.feature_column. 
categorical_column_with_hash_bucket(
    'occupation', hash_bucket_size=1000
)

原始連續特徵：tf.feature_column.numeric_column

# 3.2 Base Continuous Feature Columns
age = tf.feature_column.numeric_column('age')
education_num = tf.feature_column.numeric_column('education_num')
capital_gain = tf.feature_column. 
numeric_column('capital_gain')
capital_loss = tf.feature_column.numeric_column('capital_loss')
hours_per_week = tf.feature_column.numeric_column('hours_per_week')

規範化到[0,1]的連續特徵：tf.feature_column.bucketized_column

# 3.2.1 連續特徵離散化
# 之所以這麼做是因為：有些時候連續特徵和label之間不是線性的關係。
# 可能剛開始是正的線性關係，後面又變成了負的線性關係，這樣一個折線的關係整體來看就不再是線性關係。
# bucketization 裝桶
# 10個邊界，11個桶

age_buckets = tf.feature_column.bucketized_column(
    age, boundaries=[18, 25, 30, 35, 40, 45, 50, 55, 60, 65])

組合特徵/交叉特徵：tf.feature_column.crossed_column

# 3.3 組合特徵/交叉特徵
education_x_occupation = tf.feature_column.crossed_column(
    ['education', 'occupation'], hash_bucket_size=1000)

age_buckets_x_education_x_occupation = tf.feature_column.crossed_column(
    [age_buckets, 'education', 'occupation'], hash_bucket_size=1000
)

組裝模型：這裡主要用了離散特徵 + 組合特徵

# 4. 模型
"""
之前的特徵：
1. CategoricalColumn
2. NumericalColumn
3. BucketizedColumn
4. CrossedColumn
這些特徵都是FeatureColumn的子類，可以放到一起
"""
base_columns = [
    education, marital_status, relationship, workclass, occupation,
    age_buckets,
]

crossed_column = [
    tf.feature_column.crossed_column(
        ['education', 'occupation'], hash_bucket_size=1000
    ),
    tf.feature_column.crossed_column(
        [age_buckets, 'education', 'occupation'], hash_bucket_size=1000
    )
]

model_dir = "./model/wide_component"
model = tf.estimator.LinearClassifier(
    model_dir = model_dir, feature_columns = base_columns + crossed_column
)

訓練 & 評估：

# 5. Train & Evaluate & Predict
model.train(input_fn=lambda: input_fn(data_file=train_file, num_epochs=1, shuffle=True, batch_size=512))
results = model.evaluate(input_fn=lambda: input_fn(val_file, 1, False, 512))
for key in sorted(results):
    print("{0:20}: {1:.4f}".format(key, results[key]))

執行結果：

Parsing ./data/adult.data
2018-12-21 15:39:37.182512: I T:\src\github\tensorflow\tensorflow\core\platform\cpu_feature_guard.cc:140] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2
Parsing ./data/adult.data
accuracy            : 0.8436
accuracy_baseline   : 0.7592
auc                 : 0.8944
auc_precision_recall: 0.7239
average_loss        : 0.3395
global_step         : 256.0000
label/mean          : 0.2408
loss                : 172.7150
prediction/mean     : 0.2416
Parsing ./data/adult.test

3. Wide & Deep Model

Deep部分用的特徵：未處理的連續特徵 + Embedding(離散特徵)

在Wide的基礎上，增加Deep部分：
離散特徵embedding之後，和連續特徵串聯。

# 3. The Deep Model: Neural Network with Embeddings
"""
1. Sparse Features -> Embedding vector -> 串聯(Embedding vector, 連續特徵) -> 輸入到Hidden Layer
2. Embedding Values隨機初始化
3. 另外一種處理離散特徵的方法是：one-hot or multi-hot representation. 但是僅僅適用於維度較低的，embedding是更加通用的做法
4. embedding_column(embedding);indicator_column(multi-hot);
"""

deep_columns = [
    age,
    education_num,
    capital_gain,
    capital_loss,
    hours_per_week,

    # 對類別少的分類特徵列做 one-hot 編碼
    tf.feature_column.indicator_column(workclass),
    tf.feature_column.indicator_column(education),
    tf.feature_column.indicator_column(marital_status),
    tf.feature_column.indicator_column(relationship),

    # To show an example of embedding
    # 事實上，這裡只是為了作為演示用，embedding的長度一般會經驗設定為 categories ** (0.25)
    tf.feature_column.embedding_column(occupation, dimension=8)
]

組合Wide & Deep：DNNLinearCombinedClassifier

# 4. Combine Wide & Deep
model = tf.estimator.DNNLinearCombinedClassifier(
    model_dir=model_dir,
    linear_feature_columns=base_columns + crossed_columns,
    dnn_feature_columns=deep_columns,
    dnn_hidden_units=[100, 50]
)

訓練 & 評估：

for n in range(train_epochs // epochs_per_eval):
    model.train(input_fn=lambda: input_fn(train_file, epochs_per_eval, True, batch_size))
    results = model.evaluate(input_fn=lambda: input_fn(
        test_file, 1, False, batch_size
    ))

    # Display Eval results
    print("Results at epoch {0}".format((n+1) * epochs_per_eval))
    print('-'*30)

    for key in sorted(results):
        print("{0:20}: {1:.4f}".format(key, results[key]))

執行結果：

Parsing ./data/adult.data
2018-12-21 15:35:49.183730: I T:\src\github\tensorflow\tensorflow\core\platform\cpu_feature_guard.cc:140] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2
Parsing ./data/adult.test
Results at epoch 2
------------------------------
accuracy            : 0.8439
accuracy_baseline   : 0.7638
auc                 : 0.8916
auc_precision_recall: 0.7433
average_loss        : 0.3431
global_step         : 6516.0000
label/mean          : 0.2362
loss                : 13.6899
prediction/mean     : 0.2274
Parsing ./data/adult.data
Parsing ./data/adult.test
Results at epoch 4
------------------------------
accuracy            : 0.8529
accuracy_baseline   : 0.7638
auc                 : 0.8970
auc_precision_recall: 0.7583
average_loss        : 0.3335
global_step         : 8145.0000
label/mean          : 0.2362
loss                : 13.3099
prediction/mean     : 0.2345
Parsing ./data/adult.data
Parsing ./data/adult.test
Results at epoch 6
------------------------------
accuracy            : 0.8540
accuracy_baseline   : 0.7638
auc                 : 0.8994
auc_precision_recall: 0.7623
average_loss        : 0.3297
global_step         : 9774.0000
label/mean          : 0.2362
loss                : 13.1567
prediction/mean     : 0.2398

Process finished with exit code 0

參考文獻

Wide & Deep Learning for Recommender Systems
Google AI Blog Wide & Deep Learning: Better Together with TensorFlow https://ai.googleblog.com/2016/06/wide-deep-learning-better-together-with.html
TensorFlow Linear Model Tutorialhttps://www.tensorflow.org/tutorials/wide
TensorFlow Wide & Deep Learning Tutorialhttps://www.tensorflow.org/tutorials/wide_and_deep
TensorFlow 資料集和估算器介紹 http://developers.googleblog.cn/2017/09/tensorflow.html
absl https://github.com/abseil/abseil-py/blob/master/smoke_tests/sample_app.py
Wide & Deep 理論與實踐
Wide & Deep Learning for Recommender Systems 論文閱讀總結

Wide & Deep Learning for Recommender Systems 模型實踐

Wide & Deep 模型實踐部落格程式碼均以上傳至GitHub，歡迎follow和start~~! 1. 資料集資料集如下，其中，最後一行是label，預測收入是否超過5萬美元，二分類問題。 2. Wide Linear Model 離

論文筆記-Wide & Deep Learning for Recommender Systems

wiki body pos ear recommend sys con 損失函數 wrapper 本文提出的W&D是針對rank環節的模型。網絡結構：本文提出的W&D是針對rank環節的模型。網絡結構： wide是簡單的線性模型，但

Wide & Deep Learning for Recommender Systems 論文閱讀總結

Wide & Deep Learning for Recommender Systems 論文閱讀總結文章目錄 Wide & Deep Learning for Recommender Systems 論文閱讀總結 Abstract

《Wide & Deep Learning for Recommender Systems》論文筆記

推薦系統 CTR預估 CVR預估 0、概述線性模型被廣泛地應用於迴歸和分類問題，具有簡單、快速和可解釋性等優點，但是線性模型的表達能力有限，經常需要人工選擇特徵和交叉特徵才能取得一個良好的效果，但是實際工程中的特徵數量會很多，並且還會有大量的稀

Deep Learning for Recommender Systems資料

基於深度學習的推薦系統的論文（包括論文程式碼 PPT） https://handong1587.github.io/deep_learning/2015/10/09/recommendation-system.html https://github.com/robi56/Deep-

基於深度學習模型Wide&Deep的推薦

本實驗選用資料為UCI開源資料集，僅用於學習，請勿商用） Wide&Deep推薦演算法出自一篇論文《Wide&Deep Learning for RecommenderSystems》，Wide&Deep由兩部分組成，分別是Wide和Deep。先來說wide，表示的是generali

深度學習語言模型的通俗講解（Deep Learning for Language Modeling）

感想這是臺灣大學Speech Processing and Machine Learning Laboratory的李巨集毅 (Hung-yi Lee)的次課的內容，他的課有大量生動的例子，把原理也剖析得很清楚，感興趣的同學可以去看看，這裡是我對它的一次課的筆記，我覺得

wide&deep模型演化

推薦系統模型演化 LR-->GBDT+LR FM-->FFM-->GBDT+FM|FFM FTRL-->GBDT+FTRL Wide&DeepModel (Deep learning era) 將從以下4各方面進行模型分析： 1.why（模型設計背後的原理） 2.how（具

CTR學習筆記&程式碼實現2-深度ctr模型 MLP->Wide&Deep

## 背景這一篇我們從基礎的深度ctr模型談起。我很喜歡Wide&Deep的框架感覺之後很多改進都可以納入這個框架中。Wide負責樣本中出現的頻繁項挖掘，Deep負責樣本中未出現的特徵泛化。而後續的改進要麼用不同的IFC讓Deep更有效的提取特徵互動資訊，要麼是讓Wide更好的記憶樣本資訊 ##

巨經典論文！推薦系統經典模型Wide & Deep

今天我們剖析的也是推薦領域的經典論文，叫做Wide & Deep Learning for Recommender Systems。它發表於2016年，作者是Google App Store的推薦團隊。這年剛好是深度學習興起的時間。這篇文章討論的就是如何利用深度學習模型來進行推薦系統的CTR預測，可以

Deep Learning for Robotics 資源匯總

theano .text tor tro org () -c 四軸 parent 1 前言在最新Nature的Machine Intelligence 中Lecun。Hinton和Bengio三位大牛的Review文章Deep Learning中。最

最實用的深度學習教程 Practical Deep Learning For Coders (Kaggle 冠軍 Jeremy Howard 親授)

ted del src learning over attention wid multi 美國 Jeremy Howard 在業界可謂大名鼎鼎。他是大數據競賽平臺 Kaggle 的前主席和首席科學家。他本人還是 Kaggle 的冠軍選手。他是美國奇點大學（Singular

深度學習（十二）wide&deep model

結合稀疏正則化深度學習 img div 網絡傳遞討論推薦系統在電商等平臺使用廣泛，這裏討論wide&deep推薦模型，初始是由google推出的，主要用於app的推薦。概念理解 Wide & Deep模型，旨在使得訓練得到的模型能夠同時獲得

Python計算機視覺深度學習三合一Deep learning for computer vision with Python高清pdf

Deep Learning for Computer Vision with Python Starter Bundle pdf Deep Learning for Computer Vision with Python Practitioner Bundle pdf Deep Learning for

【論文閱讀筆記】Deep Learning based Recommender System: A Survey and New Perspectives

【論文閱讀筆記】Deep Learning based Recommender System: A Survey and New Perspectives 2017年12月04日 17:44:15 cskywit 閱讀數：1116更多個人分類：機器學習

「Computer Vision」Notes on Deep Learning for Generic Object Detection

QQ Group: 428014259 Sina Weibo：小鋒子Shawn Tencent E-mail：[email protected] http://blog.csdn.net/dgyuanshaofeng/article/details/83834249 [1]

Deep Learning for Generic Object Detection: A Survey

Abstract 通用物件檢測，旨在從自然影象中的大量預定義類別定位物件物體，是計算機視覺中最基本和最具挑戰性的問題之一。近幾年來，深度學習技術成為了直接從資料中學習特徵表示的有力方法，並在通用物件檢測領域取得了顯著的突破。鑑於這個快速發展的時代，本文的目標是對深度學習

《Transform- and multi-domain deep learning for single-frame rapid autofocusing》筆記

作者的快速聚焦方法是使用卷積網路從單個成像圖片中預測圖片的離焦距。之前的聚焦方法大多需要測量多張成像圖片的聚焦值來預測聚焦鏡頭的移動方向和移動距離，但是論文的方法可以直接預測出聚焦位置的方向和距離。作者使用不同的圖片特徵，包括圖片的空間域特徵、頻域特徵、自相

Week1.3 Simple deep learning for text classification

Neural networks for words（and characters) 在本節中我們將學習如何將神經網路用於文字分類，還將學習卷積神經網路相關的原理. 回顧–Bag of words way 在前面課程中，我們學習瞭如何將一段文本當作一系列word

Deep Learning for Generic Object Detection: A Survey 閱讀筆記

目錄摘要 1.介紹 2.背景 2.1問題 3.框架摘要目標監測旨在從自然影象中定位出大量預定義類別的例項物件，是機器視覺中最基本也是最具挑戰的問題。近年來，深度學習技術作為直接從資料學習特徵表示的強

Wide & Deep Learning for Recommender Systems 模型實踐

Wide & Deep 模型實踐

1. 資料集

2. Wide Linear Model

3. Wide & Deep Model

參考文獻

相關推薦