hadoop 集群安装
单机hadoop
集群部署,在单机版基础上进行的,我们需要配置4个机器master,node1,node2,node3
hadoop单机部署 教程 步骤
配置IP
安装完后,需要给4台服务器配置固定IP
教程:VirtualBox最小安装cetnos7后配置固定IP
然后要修改主机名和dns
# 4个依次为master node1 node2 node3
vim /etc/hostname
master
vim /etc/hosts
127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4
::1 localhost localhost.localdomain localhost6 localhost6.localdomain6
192.168.56.201 master
192.168.56.202 node1
192.168.56.203 node2
192.168.56.204 node3
免密登录
修改完成后,需要配置ssh免登陆
ssh免密登录教程
# 生成秘钥
ssh-keygen -t dsa -P '' -f ~/.ssh/id_dsa
将 id_dsa id_dsa.pub 两个文件复制到 master node1 node2 node3主机上
复制地址 ~/.ssh/ 这个文件夹下
然后都执行此操作
cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys
基本配置验证
ssh master
ssh node1
ssh node2
ssh node3
配置集群
vim /usr/local/hadoop/etc/hadoop/core-site.xml
修改 master:9000
<configuration>
<property>
<name>fs.default.name</name>
<value>hdfs://master:9000</value>
</property>
</configuration
vim /usr/local/hadoop/etc/hadoop/yarn-site.xml
新增
<property>
<name>yarn.resourcemanager.resource-tracker.address</name>
<value>master:8025</value>
</property>
<property>
<name>yarn.resourcemanager.scheduler.address</name>
<value>master:8030</value>
</property>
<property>
<name>yarn.resourcemanager.address</name>
<value>master:8050</value>
</property>
解释:
resourcemanager 与 nodemanager 连接地址 8025
resourcemanager 与 applicationmaster 连接地址 8030
resourcemanager 与 客户端连接地址 8050
vim /usr/local/hadoop/etc/hadoop/mapred-site.xml
修改为
<property>
<name>mapred.job.tracker</name>
<value>master:54311</value>
</property>
# 除了master是namenode,其它都是datanode
vim /usr/local/hadoop/etc/hadoop/hdfs-site.xml
<property>
<name>dfs.replication</name>
<value>3</value>
</property>
<property>
<name>dfs.namenode.name.dir</name>
<value>file:/usr/local/hadoop/hadoop_data/hdfs/datanode</value>
</property>
master 编辑slaves
vim /usr/local/hadoop/etc/hadoop/slaves
node1
node2
node3
node1 node2 node3操作
rm -rf /usr/local/hadoop/hadoop_data/hdfs
mkdir -p /usr/local/hadoop/hadoop_data/hdfs/datanode
# root 可省略给权限的步骤
# sudo chown -R hduser:hduser /usr/local/hadoop
master操作
rm -rf /usr/local/hadoop/hadoop_data/hdfs
mkdir -p /usr/local/hadoop/hadoop_data/hdfs/namenode
sudo chown -R hduser:hduser /usr/local/hadoop
master 启动集群
# 格式化hdfs
hadoop namenode -format
# 启动
start-dfs.sh 和 start-YARN.sh
或
start-all.sh
网页访问验证
# http://master:8088/
http://192.168.56.201:8088
http://192.168.56.201:50070
集群停止
stop-all.sh
还没有评论,来说两句吧...