Build Hadoop 2.7 from source on Centos step by step

阿新 • • 發佈：2018-12-29

Hadoop is one of the best open source for store and processing big data. It has a lot of supports from community and many big companies have used it for their products. In my company, Hadoop ecosystem have used to store message chat and information log, it is very effective but it required many resources server as ram, cpu and disk. If your product is small system you should consider using it.

Ok let start find answer for question “How to build hadoop from source ?”

Step 1 : The fist you should disable Firewall local sed -i 's/SELINUX=enforcing/SELINUX=disabled/' /etc/selinux/config

Step 2 : Download JDK and setup environment tar -xzf jdk-8u45-linux-x64.tar.gz -C /opt/

Step 3 : Create user and group “hadoop” as user run service groupadd hadgroup useradd haduser -G hadgroup passwd haduser

Step 4 : Create ssh-key for authentication between servers ssh-keygen

Step 5 : Install tool development and library yum groupinstall "Development Tools" "Development Libraries" yum install openssl-devel cmake

Step 6 : Install maven to build Hadoop (source) tar -zxf apache-maven-3.3.9-bin.tar.gz -C /opt/

Step 7 : Setup maven environment export JAVA_HOME=/opt/jdk1.8.0_45 export M3_HOME=/opt/apache-maven-3.3.9 export PATH=/opt/apache-maven-3.3.9/bin:$PATH

Step 8 : Build Protobuf (source) tar -xzf protobuf-2.5.0.tar.gz -C /root ./configure make make install sudo ldconfig

Step 9 : Download source and build Hadoop (source) tar -xvf hadoop-2.7.1-src.tar.gz cd hadoop-2.7.1-src mvn package -Pdist,native -DskipTests -Dtar -Dmaven.javadoc.skip=true -Dmaven.javadoc.failOnError=false

Step 10 : Move build to new folder mv hadoop-2.7.0-src/hadoop-dist/target/hadoop-2.7.0 /opt/

Done, and now you have Hadoop was built at path /opt/hadoop-2.7.0 In the next post, i will write how to setup hadoop as cluster. Thank you!

Like this:

Like Loading...

Build Hadoop 2.7 from source on Centos step by step

Like this:

Related

Build Hadoop 2.7 from source on Centos step by step

How to install Hadoop 2.7.3 cluster on CentOS 7.3

原始碼安裝l CUDA 10.0, cuDNN 7.3 and build TensorFlow (GPU) from source on Ubuntu 18.04

Idea+Centos+hadoop-2.7.3源碼環境搭建

CentOS 6.5 hadoop 2.7.3 叢集環境搭建

在 CentOS 7.2 下安裝 Hadoop 2.7.5 並搭建偽分散式環境的方法

【轉載】Hadoop 2.7.3 和Hbase 1.2.4安裝教程

hadoop 2.7.3基本操作

Ububtu 14.04 安裝 Hadoop 2.7.3

Hadoop 2.7.3 完全分布式部署

Linux鞏固記錄（3） hadoop 2.7.4 環境搭建

Linux鞏固記錄（5） hadoop 2.7.4下自己編譯代碼並運行MapReduce程序

Hadoop 2.7.4 + HBase 1.2.6 + ZooKeeper 3.4.10

Hadoop 2.7.3 分布式集群安裝

超詳細 Hadoop 2.7.4 Installation Procedure

Hadoop-2.7.5完全分布式搭建

hadoop 2.7.7 安裝（測試環境部署） hadoop2.x部署

hadoop-2.7.6 完全分散式的安裝

2、docker安裝 on centos 6.8(64bit)

Hadoop-2.7.3 HA高可用搭建

Build Hadoop 2.7 from source on Centos step by step

Like this:

Related

相關推薦