hadoop - Cloudera安装疑惑？

我是cloudera的新手，我在我的系统中成功安装了cloudera我有两个疑问，

考虑一台机器的一些节点已经使用 hadoop 处理一些数据，我们可以安装 Cloudera 以使用现有的 Hadoop 而不对现有 hadoop 存储的数据进行任何更改或修改。
我在我的机器上安装了 Cloudera，我还有另外三台机器可以将它们添加为集群，我想知道，在将这些机器添加为集群之前，我是否要在这三台机器上安装 cloudera？，或者我们可以添加一个节点作为集群而不在那个特定节点上安装 cloudera？。

在此先感谢任何人，请提供有关上述问题的一些信息。

最佳答案

回答问题-

1。如果您想从现有的 Apache 发行版迁移到 CDH，您可以 follow this link

摘录:

Overview

The migration process does require a moderate understanding of Linux system administration. You should make a plan before you start. You will be restarting some critical services such as the name node and job tracker, so some downtime is necessary. Given the value of the data on your cluster, you’ll also want to be careful to take recent back ups of any mission-critical data sets as well as the name node meta-data.

Backing up your data is most important if you’re upgrading from a version of Hadoop based on an Apache Software Foundation release earlier than 0.20.

2。需要在所有节点中安装和配置 CDH 二进制文件，以启动并运行基于 CDH 的集群。

关于hadoop - Cloudera安装疑惑？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/17824319/

hadoop - Cloudera安装疑惑？

上一篇：java - 在映射器中写入自定义对象时出错

下一篇：bash - 使用部分文件名添加为字段/列