1. 程式人生 > >Bootstrap Action or Step in an EMR Cluster

Bootstrap Action or Step in an EMR Cluster

Bootstrap actions

As described in Understanding the Cluster Lifecycle, bootstrap actions are the first thing to run after an Amazon EMR cluster transitions from the STARTING state to the BOOTSTRAPPING state. Bootstrap actions, which run on all cluster nodes, are scripts that run as the Hadoop user by default, but they can also run as the root user with the sudo

command. You can specify up to 16 bootstrap actions per cluster by providing multiple bootstrap-action parameters from the console, AWS Command Line Interface (AWS CLI), or API.

Bootstrap actions can be used to install additional software on your cluster and can be configured to run commands conditionally

based on instance-specific values in the instance.json or job-flow.json file. Because bootstrap actions execute before core services such as Hadoop or Spark are installed, the cluster won't start if a bootstrap action fails.

Note: On AMI versions 2.x and 3.x of Amazon EMR, bootstrap actions execute after core services such as Hadoop or Spark are installed. Most predefined bootstrap actions for Amazon EMR AMI versions 2.x and 3.x aren't supported in Amazon EMR releases 4.x. For more information, see

Create Bootstrap Actions to Install Additional Software.


A step is a distinct unit of work, comprising one or more Hadoop jobs that run only on the master node of an Amazon EMR cluster. Because a cluster does not start if a bootstrap action fails, steps must always start after bootstrap actions. Steps are usually used to transfer or process data. One step might submit work to a cluster, and others might process the submitted data and then send the processed data to a particular location. Steps complete their work sequentially, as depicted in the diagram at Running Steps to Process Data. When configuring a step, you can choose what happens after a step fails, which provides a measure of fault tolerance. For more information about creating steps, see Work with Steps Using the AWS CLI and Console.


Bootstrap Action or Step in an EMR Cluster

Bootstrap actions As described in Understanding the Cluster Lifecycle, bootstrap actions are the first thing to run after an Amazon

Launch an Amazon EMR Cluster in a VPC Environment

Amazon Web Services is Hiring. Amazon Web Services (AWS) is a dynamic, growing business unit within Amazon.com. We are currently hiring So

【理解】column must appear in the GROUP BY clause or be used in an aggregate function

pin group func 函數 ear -- database nbsp like column "ms.xxx_time" must appear in the GROUP BY clause or be used in an aggregate function -

An Exciting 10 Days (Or So) In Genetic Discovery

It's been an action-packed week-and-a-half or so in the world of genetic science and medicine. Though the age of genetic cures is likely a good decade or t

[PWA] Show an Error when a POST or DELETE Fails in an Offline PWA

We're storing JSON data in the cache - but that only applies to HTTP GET requests - and not to POST or DELETE requests. We'l

Assign a Static Private IP Address to the Master Node of an Amazon EMR Cluster

Amazon Web Services is Hiring. Amazon Web Services (AWS) is a dynamic, growing business unit within Amazon.com. We are currently hiring So

Cancel an EMR Step

Amazon Web Services is Hiring. Amazon Web Services (AWS) is a dynamic, growing business unit within Amazon.com. We are currently hiring So

Forcing an Amazon EMR Cluster to Resize

Amazon Web Services is Hiring. Amazon Web Services (AWS) is a dynamic, growing business unit within Amazon.com. We are currently hiring So

Amazon EMR Cluster Bootstrap Failed

Amazon Web Services is Hiring. Amazon Web Services (AWS) is a dynamic, growing business unit within Amazon.com. We are currently hiring So

Change the VPC or Endpoint ID in an Amazon S3 Bucket Policy

The VPC ID or VPC endpoint ID is not valid If the VPC ID or VPC endpoint ID in the bucket policy is not valid (for example, it's mis

Unable to Sign In An invalid User Name or Password was entered解決辦法

OBIEE  Unable to Sign In An invalid User Name or Password was entered 如果登陸biee遇到以上提示,如果你load了rpd後 解決辦法: 用administrator tool 開啟rpd,管理-身份-d

PyTorch in Action: A Step by Step Tutorial

img 大學 and ack all website creat ssm 方法 PyTorch in Action: A Step by Step Tutorial Installation Guide Step 1, donwload the Miniconda

oracle 的自定義的儲存函式遇到的 package or function is in an invalid state

這是語法 我一開始這麼寫的 create function fun_test101(vid number) return number(10) is  usenum number(10); beg

【LeetCode】215. Kth Largest Element in an Array

distinct class ted ++ bsp order algo max git 題目: Find the kth largest element in an unsorted array. Note that it is the kth largest eleme

[leetcode-442-Find All Duplicates in an Array]

solution i++ it is runtime span col target ted other Given an array of integers, 1 ≤ a[i] ≤ n (n = size of array), some elements appear t

215. Kth Largest Element in an Array

ret span self div arr [] 如果 說明 ace 用heap解, 方法1. 維護一個 size = k 的最小堆。當前元如果大於堆頂的元素,那麽說明堆頂的元素肯定小於kth largest element。所以replace他。 1 class

oracle 11g錯誤ora-01033:oracle initialization or shutdown in progress解決辦法

想要 文件 area .net total rop shutdown 一個 pro 原文出自:http://blog.csdn.net/liverliu/article/details/6410287 一、首先:問題的產生原因,先前我在f:/llh/目錄創建的一個book

[LeetCode] 448.Find All Numbers Disappeared in an Array

return htm put lis inpu pear rand ati n) p.p1 { margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px Monaco; color: #4e9072 } p.p2 { margin: 0.0

[Leetcode] Heap, Divide and conquer--215. Kth Largest Element in an Array

bsp sting arc push lec archive left divide discuss Find the kth largest element in an unsorted array. Note that it is the kth largest ele

解決MAC下PHP連接MYSQL錯誤Warning: mysql_connect(): No such file or directory in conn.php

命令 data hpu mar 找到 clas p s connect file 今天在mac上用php去連接mysql數據庫,出現了 mac PHP Warning: mysql_connect(): [2002] No such file...