Storm1.0.6中的Trident

阿新 • • 發佈：2018-12-23

http://storm.apache.org/releases/1.0.6/Trident-tutorial.html
Trident是Storm做實時計算的一個高層次抽象，實現無縫的高吞吐量、有狀態的、低延時的分散式查詢。Trident操作有join、aggregation、grouping、function、filters.
以對流資料的單詞進行計數為例：
為演示流處理，先生成一個源源不斷地測試資料

FixedBatchSpout spout = new FixedBatchSpout(new Fields("sentence"), 3,
               new Values 
("the cow jumped over the moon"),
               new Values("the man went to the store and bought some candy"),
               new Values("four score and seven years ago"),
               new Values("how many apples can you eat"));
spout.setCycle(true);//源源不斷地傳送流

然後生成一個TridentTopology,來處理生產的spout。

TridentTopology topology = 
 new TridentTopology();        
TridentState wordCounts =
     topology.newStream("spout1", spout)
       .each(new Fields("sentence"), new Split(), new Fields("word"))
       .groupBy(new Fields("word"))
       .persistentAggregate(new MemoryMapState.Factory(), new Count(), new Fields("count"))                
       . 
parallelismHint(6);

以上每一步都是流處理，newStream讀取輸入源的資料生成一個流，輸入源可以是Kestrel或者Kafka。Trident將哪些資料被處理的狀態元資料儲存在zookeeper中，這裡的spout1指定了元資料的名稱。這些流資料被分成小的tuple批次來處理。Trident提供了豐富的API來處理這些tuple批次。
each()函式中new Split()函式應用到流中的每個tuple。取"sentence" field的資料分解成單詞，每個sentence產生出多個tuple，命名為新的Field “word”.

public class Split extends BaseFunction {
   public void execute(TridentTuple tuple, TridentCollector collector) {
       String sentence = tuple.getString(0);
       for(String word: sentence.split(" ")) {
           collector.emit(new Values(word));                
       }
   }
}

接下來對word進行計數，並持久化。首先對"word"進行group，然後每個group用Count()聚合。Trident保證高容錯，並且僅處理一次

Storm1.0.6中的Trident

Storm1.0.6中的Trident

php自學筆記之wampserver3.0.6中mysql密碼修改

新版react16.6中 create-react-app升級版(webpack4.0) 配置http請求跨域問題

VC++6.0 MFC中CString與int、double、float等資料互相轉化

VC++6.0 MFC中獲取編輯框內容

以太坊學習（6）關於web3.js 1.0版本中event事件無法觸發的問題

在myeclilpse7.5中啟動tomcat7.0.6報錯java.lang.NoClassDefFoundError: org/apache/juli/logging/LogFactory的解決方案

安卓6.0許可權中很全狠好理解的文章

Android 6.0 執行中手動去設定介面取消該app的某個許可權，導致application被強制銷燬造成app崩潰問題的解決

Tomcat 6.0.32中調整JVM大小及最大執行緒數

.NET Core 3.0 Preview 6中對ASP.NET Core和Blazor的更新

組成原理|為什麼計算機中0.3 + 0.6 等於 0.899999999...？

Hyperledger fablic 0.6 在centos7環境下的安裝與部署

Victor 串口控件 1.5.0.6 VCL/FMX for C++ Builder 10.2 Tokyo, 10.1 Berlin, 10.0 Seattle, XE8, XE7, XE6 已經發布

發布 Victor 串口控件 1.5.0.6 VCL for C++ Builder 6.0

sqoop2 1.99.6 中遇到問題及源碼修改匯總

CentOS.6.6中 PHP-5.5.38安裝配置

ECMAScript 6 中的快捷語法匯總及代碼示例

【java規則引擎】drools6.5.0版本中kmodule.xml解析

python3.6中安裝numpy,pandas,scipy,scikit_learn,matplotlib等數據分析工具

Storm1.0.6中的Trident

相關推薦