当前位置:首页 > 服务端 > Hadoop-CDH5.7.0 for CentOS7

Hadoop-CDH5.7.0 for CentOS7

一、需求

系统

CentOS 7 最小化安装

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

JDK环境

JDK版本:1.8.0_91 jdk-8u91-linux-x64.rpm

下载地址:http://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html

CM包

CM版本:5.7.0 cloudera-manager-centos7-cm5.7.0_x86_64.tar.gz

下载地址:http://archive.cloudera.com/cm5/cm/5/cloudera-manager-centos7-cm5.7.0_x86_64.tar.gz

CDH包

CDH版本:5.7.0

CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel.sha1 manifest.json

下载地址

http://archive.cloudera.com/cdh5/parcels/5.7/CDH-5.7.1-1.cdh5.7.1.p0.11-el7.parcel

http://archive.cloudera.com/cdh5/parcels/5.7/CDH-5.7.1-1.cdh5.7.1.p0.11-el7.parcel.sha1

http://archive.cloudera.com/cdh5/parcels/5.7/manifest.json

java-mysql连接jar包

下载地址:http://central.maven.org/maven2/mysql/mysql-connector-java/6.0.2/mysql-connector-java-6.0.2.jar

数据库

类型:MariaDB

版本:5.5.47-MariaDB MariaDB Server

节点

主机名

IP地址

说明

bigdata01.tfpay.com

172.17.28.1

master datanode namenode

bigdata02.tfpay.com

172.17.28.2

datanode

bigdata03.tfpay.com

172.17.28.3

datanode

bigdata04.tfpay.com

172.17.28.4

datanode

节点的数量可以根据自己的需求变更,也可以在CDH安装完成后,在管理界面自己拓展添加

二、底层环境准备

注:准备安装前需要把相关的包文件下载下来,并上传到服务器上

1、安装JDK

所有节点

1)安装系统原有java

# yum -y install java
# java -version
openjdk version "1.8.0_91"
OpenJDK Runtime Environment (build 1.8.0_91-b14)
OpenJDK 64-Bit Server VM (build 25.91-b14, mixed mode)
# rpm -qa|grep java
javapackages-tools-3.4.1-11.el7.noarch
python-javapackages-3.4.1-11.el7.noarch
java-1.8.0-openjdk-headless-1.8.0.91-0.b14.el7_2.x86_64
java-1.8.0-openjdk-1.8.0.91-0.b14.el7_2.x86_64
tzdata-java-2016d-1.el7.noarch

2)卸载OpenJDK

# rpm -e --nodeps java-1.8.0-openjdk-headless-1.8.0.91-0.b14.el7_2.x86_64
# rpm -e --nodeps java-1.8.0-openjdk-1.8.0.91-0.b14.el7_2.x86_64
3)安装下载的java

# rpm -ivh jdk-8u91-linux-x64.rpm 
Preparing...                          ################################# [100%]
Updating / installing...
   1:jdk1.8.0_91-2000:1.8.0_91-fcs    ################################# [100%]
Unpacking JAR files...
    tools.jar...
    plugin.jar...
    javaws.jar...
    deploy.jar...
    rt.jar...
    jsse.jar...
    charsets.jar...
    localedata.jar...
    jfxrt.jar...
# java -version
java version "1.8.0_91"
Java(TM) SE Runtime Environment (build 1.8.0_91-b14)
Java HotSpot(TM) 64-Bit Server VM (build 25.91-b14, mixed mode)
# javac -version
javac 1.8.0_91

2、更改主机名和hosts

所有节点

注意每个主机名对应的IP,按实际情况更改。

# echo "bigdata0*.tfpay.com" > /etc/hostname
# echo "172.17.28.1    bigdata01.tfpay.com    bigdata01" >> /etc/hosts
# echo "172.17.28.2    bigdata02.tfpay.com    bigdata02" >> /etc/hosts
# echo "172.17.28.3    bigdata03.tfpay.com    bigdata03" >> /etc/hosts
# echo "172.17.28.4    bigdata04.tfpay.com    bigdata04" >> /etc/hosts

3、关闭防火墙

所有节点

# systemctl stop firewalld
# systemctl disable firewalld
Removed symlink /etc/systemd/system/dbus-org.fedoraproject.FirewallD1.service.
Removed symlink /etc/systemd/system/basic.target.wants/firewalld.service.
# systemctl status firewalld
 firewalld.service - firewalld - dynamic firewall daemon
   Loaded: loaded (/usr/lib/systemd/system/firewalld.service; disabled; vendor preset: enabled)
   Active: inactive (dead)

Jun 20 16:05:43 bigdata05.tfpay.com systemd[1]: Starting firewalld - dynamic firewall daemon...
Jun 20 16:05:45 bigdata05.tfpay.com systemd[1]: Started firewalld - dynamic firewall daemon.
Jun 20 16:17:51 bigdata05.tfpay.com systemd[1]: Stopping firewalld - dynamic firewall daemon...
Jun 20 16:17:52 bigdata05.tfpay.com systemd[1]: Stopped firewalld - dynamic firewall daemon.
# sed -i "s/SELINUX=enforcing/SELINUX=disabled/" /etc/selinux/config
# reboot

重启服务器使hostname和selinux的配置生效

# getenforce
Disabled
# sestatus -v
SELinux status:                 disabled

4、安装NTP服务

所有节点

# yum -y install ntp

更改配置文件/etc/ntp.conf

在master节点namenode节点

注释掉原有的server指向把时间服务器的域名地址指向一个靠谱可连的时间服务器,我选择的是ntp.api.bz.

# vi /etc/ntp.conf
    ……
     #server 0.centos.pool.ntp.org iburst
    #server 1.centos.pool.ntp.org iburst
    #server 2.centos.pool.ntp.org iburst
    #server 3.centos.pool.ntp.org iburst
    server ntp.api.gz iburst
    ......

在其余的节点上,把时间服务器的server指向master namenode节点

# vi /etc/ntp.conf
     ……
     #server 0.centos.pool.ntp.org iburst
    #server 1.centos.pool.ntp.org iburst
    #server 2.centos.pool.ntp.org iburst
    #server 3.centos.pool.ntp.org iburst
    server 172.17.28.1 iburst
    ......

所有节点

# systemctl start ntpd
# systemctl enable ntpd
Created symlink from /etc/systemd/system/multi-user.target.wants/ntpd.service to /usr/lib/systemd/system/ntpd.service.
5、SSH无密码登陆

配置master namenode节点无密码登陆到其他节点

在master namenode节点

# ssh-keygen -t rsa
Generating public/private rsa key pair.
Enter file in which to save the key (/root/.ssh/id_rsa): 回车
Enter passphrase (empty for no passphrase): 回车
Enter same passphrase again: 回车
Your identification has been saved in /root/.ssh/id_rsa.
Your public key has been saved in /root/.ssh/id_rsa.pub.
The key fingerprint is:
bc:14:e2:59:42:1f:21:80:eb:49:d8:3b:7c:8d:b7:f9 root@bigdata05.tfpay.com
The key's randomart image is:
+--[ RSA 2048]----+
|   ...o o.       |
|  .  . o .       |
| o .  o +        |
|. +  . * .       |
| + o oo S        |
|  * o o. .       |
|   o . o.        |
|      o          |
|       .E        |
+-----------------+
# ssh-copy-id 172.17.28.2
# ssh-copy-id 172.17.28.3
# ssh-copy-id 172.17.28.4

6、安装数据库

master namenode节点

# yum -y install mariadb-server
# vi /etc/my.cnf
    [mysqld]
    datadir=/var/lib/mysql
    socket=/var/lib/mysql/mysql.sock
    symbolic-links=0
    character-set-server=utf8     \\添加的行
    [mysqld_safe]
    log-error=/var/log/mariadb/mariadb.log
    pid-file=/var/run/mariadb/mariadb.pid
    !includedir /etc/my.cnf.d
# systemctl start mariadb 
# systemctl enable mariadb 
ln -s '/usr/lib/systemd/system/mariadb.service' '/etc/systemd/system/multi-user.target.wants/mariadb.service'
# mysql_secure_installation 
/usr/bin/mysql_secure_installation: line 379: find_mysql_client: command not found
NOTE: RUNNING ALL PARTS OF THIS SCRIPT IS RECOMMENDED FOR ALL MariaDB
      SERVERS IN PRODUCTION USE!  PLEASE READ EACH STEP CAREFULLY!
In order to log into MariaDB to secure it, we'll need the current
password for the root user.  If you've just installed MariaDB, and
you haven't set the root password yet, the password will be blank,
so you should just press enter here.
Enter current password for root (enter for none): 
OK, successfully used password, moving on...
Setting the root password ensures that nobody can log into the MariaDB
root user without the proper authorisation.
Set root password? [Y/n] y          \\设置数据库root密码
New password:                       \\设置密码
Re-enter new password:              \\再次输入密码
Password updated successfully!
Reloading privilege tables..
 ... Success!
By default, a MariaDB installation has an anonymous user, allowing anyone
to log into MariaDB without having to have a user account created for
them.  This is intended only for testing, and to make the installation
go a bit smoother.  You should remove them before moving into a
production environment.
Remove anonymous users? [Y/n] y  \\移除匿名用户
 ... Success!
Normally, root should only be allowed to connect from 'localhost'.  This
ensures that someone cannot guess at the root password from the network.
Disallow root login remotely? [Y/n] y  \\禁止root用户远程登录
 ... Success!
By default, MariaDB comes with a database named 'test' that anyone can
access.  This is also intended only for testing, and should be removed
before moving into a production environment.
Remove test database and access to it? [Y/n] y \\删除测试数据库
 - Dropping test database...
 ... Success!
 - Removing privileges on test database...
 ... Success!
Reloading the privilege tables will ensure that all changes made so far
will take effect immediately.
Reload privilege tables now? [Y/n]    //
 ... Success!
Cleaning up...
All done!  If you've completed all of the above steps, your MariaDB
installation should now be secure.
Thanks for using MariaDB!

数据库只是安装了一下,并没有做一些详细的配置。如果有需求可以另行配置。

7、其他安装与配置

所有节点

# yum  -y  install psmisc MySQL-python at bc bind-libs bind-utils cups-client cups-libs cyrus-sasl-gssapi cyrus-sasl-plain ed fuse fuse-libs httpd httpd-tools keyutils-libs-devel krb5-devel libcom_err-devel libselinux-devel libsepol-devel libverto-devel mailcap noarch mailx mod_ssl openssl-devel pcre-devel postgresql-libs python-psycopg2 redhat-lsb-core redhat-lsb-submod-security  x86_64 spax time zlib-devel
# chmod +x /etc/rc.d/rc.local
# echo "echo 0 > /proc/sys/vm/swappiness" >>/etc/rc.d/rc.local
# echo "echo never > /sys/kernel/mm/transparent_hugepage/defrag" >>/etc/rc.d/rc.local
# echo 0 > /proc/sys/vm/swappiness
# echo never > /sys/kernel/mm/transparent_hugepage/defrag
# yum -y install rpcbind
# systemctl start rpcbind
# echo "systemctl start rpcbind" >> /etc/rc.d/rc.local

三、开始安装CDH

1、传宝、解包

把下载好的一下相应包文件上传到master namenode节点内

mysql-java连接包在所有节点上都要上传

# ls
CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel       cloudera-manager-centos7-cm5.7.0_x86_64.tar.gz  manifest.json
CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel.sha1     jdk-8u91-linux-x64.rpm        mysql-connector-java-6.0.2.jar
# tar zxvf cloudera-manager-centos7-cm5.7.0_x86_64.tar.gz -C /opt/
# ls /opt/
cloudera  cm-5.7.0

2、创建数据库

# mysql -uroot -p
Enter password:          \\输入数据库密码
Welcome to the MariaDB monitor.  Commands end with ; or \g.
Your MariaDB connection id is 10
Server version: 5.5.47-MariaDB MariaDB Server

Copyright (c) 2000, 2015, Oracle, MariaDB Corporation Ab and others.

Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.
MariaDB [(none)]> create database hive DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
Query OK, 1 row affected (0.00 sec)

MariaDB [(none)]> create database amon DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
Query OK, 1 row affected (0.00 sec)

MariaDB [(none)]> create database hue DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
Query OK, 1 row affected (0.00 sec)

MariaDB [(none)]> create database monitor DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
Query OK, 1 row affected (0.00 sec)

MariaDB [(none)]> create database oozie DEFAULT CHARSET utf8 COLLATE utf8_general_ci;
Query OK, 1 row affected (0.00 sec)

MariaDB [(none)]> grant all on *.* to root@"%" Identified by "1234.com";
Query OK, 0 rows affected (0.00 sec)

MariaDB [(none)]> exit
# /opt/cm-5.7.0/share/cmf/schema/scm_prepare_database.sh mysql cm -hlocalhost -uroot -p1234.com --scm-host localhost scm scm scm
JAVA_HOME=/usr/java/jdk1.8.0_91
Verifying that we can write to /opt/cm-5.7.0/etc/cloudera-scm-server
Creating SCM configuration file in /opt/cm-5.7.0/etc/cloudera-scm-server
Executing:  /usr/java/jdk1.8.0_91/bin/java -cp /usr/share/java/mysql-connector-java.jar:/usr/share/java/oracle-connector-java.jar:/opt/cm-5.7.0/share/cmf/schema/../lib/* com.cloudera.enterprise.dbutil.DbCommandExecutor /opt/cm-5.7.0/etc/cloudera-scm-server/db.properties com.cloudera.cmf.db.
2016-06-17 13:32:20,562 [main] INFO  com.cloudera.enterprise.dbutil.DbCommandExecutor  - Successfully connected to database.
All done, your SCM database is configured correctly!

3、创建用户

所有节点

#useradd --system --home=/opt/cm-5.7.0/run/cloudera-scm-server  --shell=/bin/false --comment "Cloudera SCM User" cloudera-scm

4、制作本地源

master datanote节点执行

# ls
CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel       cloudera-manager-centos7-cm5.7.0_x86_64.tar.gz  manifest.json
CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel.sha1     jdk-8u91-linux-x64.rpm        mysql-connector-java-6.0.2.jar
# cp CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel.sha1 manifest.json /opt/cloudera/parcel-repo/
# mv CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel.sha1 CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel.sha

5、拷贝jar包

所有节点

# ls
CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel       cloudera-manager-centos7-cm5.7.0_x86_64.tar.gz  manifest.json
CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel.sha1     jdk-8u91-linux-x64.rpm        mysql-connector-java-6.0.2.jar
# cp mysql-connector-java-6.0.2.jar /usr/share/java/mysql-connector-java.jar

master namenode节点

# cp mysql-connector-java-6.0.2.jar /opt/cloudera/parcels/CDH-5.7.0-1.cdh5.7.0.p0.45/lib/hive/lib/
# cp mysql-connector-java-6.0.2.jar /opt/cm-5.7.0/share/cmf/lib/

6、更改配置

# sed -i "s/server_host=localhost/server_host=172.17.28.1/" /opt/cm-5.7.0/etc/cloudera-scm-agent/config.ini
# mkdir /opt/cm-5.7.0/run/cloudera-scm-agent

7、启动服务

master namenode节点

# /opt/cm-5.7.0/etc/init.d/cloudera-scm-server start

除了master namenode节点

# scp -r 172.17.28.1:/opt/cm-5.7.0 /opt

所有节点

# /opt/cm-5.7.0/etc/init.d/cloudera-scm-agent start

8、浏览页面开始安装

1)登陆

打开浏览器,访问maser namenode节点的7180端口

输入用户名和密码admin、admin

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

2)接受协议

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

3)选择版本

本文档选择Cloudera Express免费开源版本

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

4)指定主机

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

5)选择CDH版本

注:选择在制作本地源时选择的版本,如果选择别的版本的话会去官网下载那样安装速度会很慢

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

6)检查主机正确性

确保全部验证通过

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

7)选择安装的服务

本文选择的是安装所有服务

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

8)角色分配

这里应该根据自己的实际需求和服务器的硬件参数自己选择

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

9)数据库设置

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

10)群集设置

可以根据自己的实际情况更改设置,本文采用默认配置。这些配置可以在集群安装完成以后后续更改

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

11)开始安装

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

无错误信息全部通过

12)安装完成

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

第一次安装完成后,会出现一些配置的警告信息。这些可以根据提示信息更改。

四、错误总结

1、pstree: command not found

错误展示

# /opt/cm-5.7.0/etc/init.d/cloudera-scm-server start
/opt/cm-5.7.0/etc/init.d/cloudera-scm-server: line 109: pstree: command not found
Starting cloudera-scm-server:                              [FAILED]
# /opt/cm-5.7.0/etc/init.d/cloudera-scm-agent start
/opt/cm-5.7.0/etc/init.d/cloudera-scm-agent: line 108: pstree: command not found
Starting cloudera-scm-agent:                               [FAILED]

原因

因为系统是最小化安装,默认没有安装

解决方法

# yum -y install psmisc

2、cloudera-scm-agent start 启动失败

错误展示

# /opt/cm-5.7.0/etc/init.d/cloudera-scm-agent start
Starting cloudera-scm-agent:                               [FAILED]

查询日志

# cd /opt/cm-5.7.0/log/cloudera-scm-agent/
# ls
cloudera-scm-agent.out
# cat cloudera-scm-agent.out 
[22/Jun/2016 11:02:09 +0000] 3695 MainThread agent        INFO     SCM Agent Version: 5.7.0
Unable to create the pidfile.

原因

经过查找后发现/opt/cm-5.7.0/run目录下没有cloudera-scm-agent文件夹

# cd /opt/cm-5.7.0/run/
# ls
cloudera-scm-server

解决方法

# mkdir /opt/cm-5.7.0/run/cloudera-scm-agent

3、install: invalid user ‘cloudera-scm’

错误展示

# /opt/cm-5.7.0/etc/init.d/cloudera-scm-server start
install: invalid user ‘cloudera-scm’

原因

没有创建运行server的用户

解决方法

#useradd --system --home=/opt/cm-5.7.0/run/cloudera-scm-server  --shell=/bin/false --comment "Cloudera SCM User" cloudera-scm

4、检查主机正确性警告

警告

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

解决方法

# echo 0 > /proc/sys/vm/swappiness
# echo never > /sys/kernel/mm/transparent_hugepage/defrag
# echo "echo 0 > /proc/sys/vm/swappiness" >>/etc/rc.d/rc.local
# echo "echo never > /sys/kernel/mm/transparent_hugepage/defrag" >>/etc/rc.d/rc.local

5、启动Hive失败

错误展示

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

原因

根据提示发现是没有mysql-java连接的驱动jar包

解决方法

# ls
CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel       cloudera-manager-centos7-cm5.7.0_x86_64.tar.gz  manifest.json
CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel.sha1     jdk-8u91-linux-x64.rpm        mysql-connector-java-6.0.2.jar
# cp mysql-connector-java-6.0.2.jar /opt/cloudera/parcels/CDH-5.7.0-1.cdh5.7.0.p0.45/lib/hive/lib/

6、启动NFS Gateway失败

错误展示

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

原因

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

解决方法

# yum -y install rpcbind
# systemctl start rpcbind
# echo "systemctl start rpcbind" >> /etc/rc.d/rc.local 
# chmod +x /etc/rc.d/rc.local

7、启动Oozie,impala失败

错误展示

Hadoop-CDH5.7.0 for CentOS7 _ JavaClub全栈架构师技术笔记

原因

没有mysql-java连接jar包

解决方法

# ls
CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel       cloudera-manager-centos7-cm5.7.0_x86_64.tar.gz  manifest.json
CDH-5.7.0-1.cdh5.7.0.p0.45-el7.parcel.sha1     jdk-8u91-linux-x64.rpm        mysql-connector-java-6.0.2.jar
# cp mysql-connector-java-6.0.2.jar /usr/share/java/mysql-connector-java.jar

作者:付建功
来源链接:https://www.cnblogs.com/fujiangong/p/5620050.html

版权声明:
1、Java侠(https://www.javaxia.com)以学习交流为目的,由作者投稿、网友推荐和小编整理收藏优秀的IT技术及相关内容,包括但不限于文字、图片、音频、视频、软件、程序等,其均来自互联网,本站不享有版权,版权归原作者所有。

2、本站提供的内容仅用于个人学习、研究或欣赏,以及其他非商业性或非盈利性用途,但同时应遵守著作权法及其他相关法律的规定,不得侵犯相关权利人及本网站的合法权利。
3、本网站内容原作者如不愿意在本网站刊登内容,请及时通知本站(javaclubcn@163.com),我们将第一时间核实后及时予以删除。





本文链接:https://www.javaxia.com/server/124967.html

分享给朋友: