SparkStreaming參數配置

阿新 • • 發佈：2017-06-07

property .org intern collect data cell level order ret

Property Name	Default	Meaning
spark.streaming.backpressure.enabled	false	Enables or disables Spark Streaming‘s internal backpressure mechanism (since 1.5). This enables the Spark Streaming to control the receiving rate based on the current batch scheduling delays and processing times so that the system receives only as fast as the system can process. Internally, this dynamically sets the maximum receiving rate of receivers. This rate is upper bounded by the values spark.streaming.receiver.maxRate andspark.streaming.kafka.maxRatePerPartition if they are set (see below).
spark.streaming.backpressure.initialRate	not set	This is the initial maximum receiving rate at which each receiver will receive data for the first batch when the backpressure mechanism is enabled.
spark.streaming.blockInterval	200ms	Interval at which data received by Spark Streaming receivers is chunked into blocks of data before storing them in Spark. Minimum recommended - 50 ms. See the performance tuningsection in the Spark Streaming programing guide for more details.
spark.streaming.receiver.maxRate	not set	Maximum rate (number of records per second) at which each receiver will receive data. Effectively, each stream will consume at most this number of records per second. Setting this configuration to 0 or a negative number will put no limit on the rate. See the deployment guide in the Spark Streaming programing guide for mode details.
spark.streaming.receiver.writeAheadLog.enable	false	Enable write ahead logs for receivers. All the input data received through receivers will be saved to write ahead logs that will allow it to be recovered after driver failures. See the deployment guidein the Spark Streaming programing guide for more details.
spark.streaming.unpersist	true	Force RDDs generated and persisted by Spark Streaming to be automatically unpersisted from Spark‘s memory. The raw input data received by Spark Streaming is also automatically cleared. Setting this to false will allow the raw data and persisted RDDs to be accessible outside the streaming application as they will not be cleared automatically. But it comes at the cost of higher memory usage in Spark.
spark.streaming.stopGracefullyOnShutdown	false	If true, Spark shuts down the StreamingContext gracefully on JVM shutdown rather than immediately.
spark.streaming.kafka.maxRatePerPartition	not set	Maximum rate (number of records per second) at which data will be read from each Kafka partition when using the new Kafka direct stream API. See the Kafka Integration guide for more details.
spark.streaming.kafka.maxRetries	1	Maximum number of consecutive retries the driver will make in order to find the latest offsets on the leader of each partition (a default value of 1 means that the driver will make a maximum of 2 attempts). Only applies to the new Kafka direct stream API.
spark.streaming.ui.retainedBatches	1000	How many batches the Spark Streaming UI and status APIs remember before garbage collecting.
spark.streaming.driver.writeAheadLog.closeFileAfterWrite	false	Whether to close the file after writing a write ahead log record on the driver. Set this to ‘true‘ when you want to use S3 (or any file system that does not support flushing) for the metadata WAL on the driver.
spark.streaming.receiver.writeAheadLog.closeFileAfterWrite	false	Whether to close the file after writing a write ahead log record on the receivers. Set this to ‘true‘ when you want to use S3 (or any file system that does not support flushing) for the data WAL on the receivers.

SparkStreaming參數配置

property .org intern collect data cell level order ret Property Name Default Meaning spark.streaming.backpressure.enabled f

Jvm參數配置

line 同時存在 sport 先後不想 bsp port src res 一、非穩態選項使用說明 -XX:+<option> 啟用option -XX:-<option> 不啟用option -XX:<option>=<num

kafka參數配置詳解

kafka 參數 broker.idbroker的唯一標識符，如果不配置則自動生成，建議配置且一定要保證集群中必須唯一，默認-1log.dir日誌數據存放的目錄，默認/tmp/kafka-logslog.dirs日誌數據存放的目錄，如果沒有配置則使用log.dir，建議此項配置。zookeeper.c

redis參數配置

strong max 大小 gre limit 說明 prot moni 調度 redis.conf配置文件配置項值說明 slave-read-only yes slave是否只讀 slave-serve-stale-data

MySQL性能優化-內存參數配置

性能問題體系 .net 協議配置 sort odbc image 分配內存　　Mysql對於內存的使用，可以分為兩類，一類是我們無法通過配置參數來配置的，如Mysql服務器運行、解析、查詢以及內部管理所消耗的內存；另一類如緩沖池所用的內存等。　　Mysql內存參數的

Spark 性能相關參數配置具體解釋－任務調度篇

div 設置宋體速度意義期望簡單的取數據全局作者：劉旭暉 Raymond 轉載請註明出處Email：colorant at 163.comBLOG：http://blog.csdn.net/colorant/隨著Spark的逐漸成熟完好, 越來越多的可配置

logback 常用參數配置詳解

dna 則表達式 manual 歸檔文件 caller 執行 fig lua stdout logback 常用配置詳解（二） <appender> <appender>： <appender>是<configuratio

nginx響應高並發參數配置

i/o ntp openbsd mac 本地 body fas 參考 time-wait 一、一般來說nginx 配置文件中對優化比較有作用的為以下幾項： 1. worker_processes 8; nginx 進程數，建議按照cpu 數目來指定，一般為它的倍數 (如,

IntellIJ IDEA 啟動參數配置

啟動參數 pat ble upa div bsp 型號 openjdk -s 系統環境：型號名稱： MacBook Pro型號標識符： MacBookPro11,4處理器名稱： Intel Core i7處理器速度： 2.8 GHz處理器數目： 1核總數： 4L2 緩存

【轉載】Spark學習——spark中的幾個概念的理解及參數配置

program submit man 聯眾 tail 進行 orb 數據源 work 首先是一張Spark的部署圖：節點類型有： 1. master 節點：常駐master進程，負責管理全部worker節點。2. worker 節點：常駐worker進程，負責管理

Nginx筆記02-nginx常用參數配置說明

win 事件驅動出現有客 display 只需要 byte 系統資源 spa nginx的主配置文件是nginx.conf，這裏主要針對這個文件進行說明 1.主配置文件nginx.conf 2.nginx配置文件的結構從上面的配置文件中我們可以總結出nginx

Tomcat學習總結（13）—— Tomcat常用參數配置說明

標簽 cat -xms windows ssi 端口配置 cto 出現 tomcat 1、修改端口號 Tomcat端口配置在server.xml文件的Connector標簽中，默認為8080，可根據實際情況修改。修改端口號 2、解決URL中文參數亂碼在server.x

JVM常用參數配置---摘自《深入理解java虛擬機》《Java性能權威指南》

blog jvm log msi onsize regions rms 使用常用 //常見配置匯總 //堆設置 -Xms:初始堆大小 -Xmx:最大堆大小 -XX:NewSize=n:設置新生代大小 -XX:NewRatio=n:設置新生代和老年代的比值.

bootstrap-table的一些參數配置

選擇條紋 loader num 匹配 left ins side repo bootstrap-table的一些配置參數[html] view plain copy$(‘#reportTable‘).bootstrapTable({ method: ‘post‘,

一個性能較好的JVM參數配置

大小 xms mx2 一段 ava 使用依然 java se end 一個性能較好的web服務器jvm參數配置： -server//服務器模式-Xmx2g //JVM最大允許分配的堆內存，按需分配-Xms2g //JVM初始分配的堆內存，一般和Xmx配置成一樣以避免每次g

數據庫常用授權和授權回收參數配置

mariadb標題索引官方幫助常用案例官方幫助在使用數據庫時必不可少的即是查看help幫助，通過help幫助再次尋找常用命令及參數，如下為help grant信息：MariaDB [(none)]> help grant; Name: 'GRANT' Description: S

轉 Spark參數配置

nds 字符 aps view lock .py rac 解碼器 req 原文地址：http://blog.csdn.net/qq_32252917/article/details/78529916 下面是整理的Spark中的一些配置參數，官方文檔請參考Spark Conf

swing開發一個修改項目數據庫連接參數配置文件

swa amr store creat next() iterator msg move queue 我們在開發web項目中，經常有properties配置文件配置數據庫連接參數，每次修改的時候還要去找到配置文件，感覺有點麻煩，就用swing做了個小工具修改參數，運行界面如

jquery datatables 的常見參數配置

time ucc call 支持 with mob XML cti pan 1:導入包： URL：http://www.datatables.net/ 分別導入css和js文件 Html代碼 <</span>style typ

Yarn 內存分配管理機制及相關參數配置

系統如果 ast nod java類其中指定 XML sam 上一篇hive on tez 任務報錯中提到了containter內存不足，現對yarn 內存分配管理進行介紹一、相關配置情況關於Yarn內存分配與管理，主要涉及到了ResourceManage、Ap

SparkStreaming參數配置

相關推薦