用戶數據同步

阿新 • • 發佈：2017-08-11

sta 偏移 cati 入參 cut -name txt line vpdn

文件名是今日，數據是昨天的數據。
dayid=`date -d "1 days ago" +%Y%m%d `
##############################################
# 功能：開始日期加上偏移天數後的日期字符串# 入參：開始日期,yyyymmdd# 偏移天數# 返回值：開始日期加上偏移天數日期，yyyymmdd###############################################function toDate(){ startdate=$1; days=$2; timestamp_startdate=`date -d ${startdate} +%s` timestamp_resultdate=`expr ${timestamp_startdate} ‘+‘ ${days} ‘*‘ 86400` resultdate=`date -d @${timestamp_resultdate} +%Y%m%d` echo $resultdate}filedayid=`toDate $dayid

1` spark-submit --class="com.zyuc.stat.iot.etl.UserInfoETL" \--master yarn-client \--name UserInfoETL \--conf "spark.app.appName=UserInfoETL" \--conf "spark.app.dataDayid=${dayid}" \--conf "spark.app.userTable=iot_customer_userinfo" \--conf "spark.app.syncType=full" \--conf "spark.app.inputPath=/hadoop/IOT/ANALY_PLATFORM/BasicData/UserInfo/" \

--conf "spark.app.outputPath=/hadoop/IOT/ANALY_PLATFORM/BasicData/output/UserInfo/" \--conf "spark.app.fileWildcard=all_userinfo_qureyes_${filedayid}*" \--conf "spark.app.vpdnInput=/hadoop/IOT/ANALY_PLATFORM/BasicData/VPDNProvince/" \--conf "spark.app.vpdnWildcard=vpdninfo.txt" \--conf spark.yarn.executor.memoryOverhead=700

\--executor-memory 2G \--executor-cores 1 \--num-executors 6 \/slview/test/zcw/jars/userETL.jar

dayid=`date -d "1 days ago" +%Y%m%d `
##############################################
# 功能：開始日期加上偏移天數後的日期字符串# 入參：開始日期,yyyymmdd# 偏移天數# 返回值：開始日期加上偏移天數日期，yyyymmdd###############################################function toDate(){ startdate=$1; days=$2; timestamp_startdate=`date -d ${startdate} +%s` timestamp_resultdate=`expr ${timestamp_startdate} ‘+‘ ${days} ‘*‘ 86400` resultdate=`date -d @${timestamp_resultdate} +%Y%m%d` echo $resultdate}filedayid=`toDate $dayid 1` spark-submit --class="com.zyuc.stat.iot.etl.UserInfoETL" \--master yarn-client \--name UserInfoETL \--conf "spark.app.appName=UserInfoETL" \--conf "spark.app.dataDayid=${dayid}" \--conf "spark.app.userTable=iot_customer_userinfo" \--conf "spark.app.syncType=incr" \--conf "spark.app.inputPath=/hadoop/IOT/ANALY_PLATFORM/BasicData/UserInfo/" \--conf "spark.app.outputPath=/hadoop/IOT/ANALY_PLATFORM/BasicData/output/UserInfo/" \--conf "spark.app.fileWildcard=incr_userinfo_qureyes_${filedayid}*" \--conf "spark.app.vpdnInput=/hadoop/IOT/ANALY_PLATFORM/BasicData/VPDNProvince/" \--conf "spark.app.vpdnWildcard=vpdninfo.txt" \--conf spark.yarn.executor.memoryOverhead=700 \--executor-memory 2G \--executor-cores 1 \--num-executors 6 \/slview/test/zcw/jars/userETL.jar

--conf "spark.app.fileWildcard= 如下：all_userinfo_qureyes_20170714*
incr_userinfo_qureyes_20170715*

create table iot_customer_userinfo(vpdncompanycode string, mdn string, imsicdma string, imsilte string, iccid string, imei string, company string, nettype string, vpdndomain string, isvpdn string, subscribetimeaaa string, subscribetimehlr string, subscribetimehss string, subscribetimepcrf string, firstactivetime string, userstatus string, atrbprovince string, userprovince string, crttime string, custProvince string) partitioned by (d int) stored as orc location ‘/hadoop/IOT/ANALY_PLATFORM/BasicData/output/UserInfo/data/‘;

alter table iot_customer_userinfo add IF NOT EXISTS partition(d=‘20170714‘);

shell 調度腳本：增量數據：

$ cat userETL.sh 
dayid=$1
if [ -z $dayid ] ; then
    dayid=`date -d "1 days ago" "+%Y%m%d"`
fi
##############################################
# 功能： 開始日期加上偏移天數後的日期字符串
# 入參： 開始日期,yyyymmdd
#        偏移天數
# 返回值：開始日期加上偏移天數日期，yyyymmdd
###############################################
function toDate()
{
   startdate=$1;
   days=$2;
   timestamp_startdate=`date -d ${startdate} +%s`
   timestamp_resultdate=`expr ${timestamp_startdate} ‘+‘ ${days} ‘*‘ 86400`
   resultdate=`date -d @${timestamp_resultdate} +%Y%m%d`
   echo $resultdate
}
filedayid=`toDate $dayid 1` 
spark-submit --class="com.zyuc.stat.iot.etl.UserInfoETL"  --master yarn-client  --name UserInfoETL --conf "spark.app.appName=UserInfoETL" --conf "spark.app.dataDayid=${dayid}" --conf "spark.app.userTable=iot_customer_userinfo" --conf "spark.app.syncType=incr" --conf "spark.app.inputPath=/hadoop/IOT/ANALY_PLATFORM/BasicData/UserInfo/"  --conf "spark.app.outputPath=/hadoop/IOT/ANALY_PLATFORM/BasicData/output/UserInfo/"  --conf "spark.app.fileWildcard=incr_userinfo_qureyes_${filedayid}*" --conf "spark.app.vpdnInput=/hadoop/IOT/ANALY_PLATFORM/BasicData/VPDNProvince/"  --conf "spark.app.vpdnWildcard=vpdninfo.txt" --conf spark.yarn.executor.memoryOverhead=700 --executor-memory 2G --executor-cores  1 --num-executors  6 /slview/test/zcw/shell/userinfo/jars/userETL.jar >/slview/test/zcw/shell/userinfo/logs/${dayid}.log   2>&1

用戶數據同步

sta 偏移 cati 入參 cut -name txt line vpdn 文件名是今日，數據是昨天的數據。dayid=`date -d "1 days ago" +%Y%m%d `###########################################

用戶數據同步

用戶數據同步

報表如何同步用戶數據集

windows2003用戶數據文件

Hadoop單點部署與案例開發（微博用戶數據分析）

阿裏雲輸了官司事小，保護用戶數據才是大事

python+rabbitMQ抓取某婚戀網站用戶數據

Android後門GhostCtrl，完美控制設備任意權限並竊取用戶數據

udp用戶數據報協議

深入理解 WordPress 數據庫中的用戶數據 wp_user

數據埋點最基本需要獲取的用戶數據有哪些？

UWP 應用獲取各類系統、用戶信息 (1) - 設備和系統的基本信息、應用包信息、用戶數據賬戶信息和用戶賬戶信息

通過shell備份oracle用戶數據，並將巡檢結果發送至windows跳板機

iOS 鑰匙串存儲用戶數據

六LWIP學習筆記之用戶數據報協議（UDP）

Exchange 2013數據庫可移植性恢復用戶數據庫（一）

Exchange 2013數據庫可移植性恢復用戶數據庫（二）

Facebook 被指收集用戶數據：通過照片和文本

從excel中讀取用戶數據發送email

Maven項目的RSA加密及解密（用戶數據）的配置流程:

加拿大航空公司遭遇黑客泄露2萬用戶數據

用戶數據同步

相關推薦