scala - 从源代码构建 Apache Spark 2.1.0 失败

标签 scala hadoop apache-spark

我正在尝试构建 Apache Spark 2.1.0 源代码,但下面出现的这些错误令我感到困惑...

Hadoop 2.8.0 已安装并正在运行 在执行 Spark 安装之前安装了 Scala 2.12.1(这似乎会自动安装 Scala 2.11.8?!?)

我的构建线是:

build/mvn -Pyarn -Phadoop-2.7 -Dhadoop.version=2.7.0 -DskipTests clean package

有人知道我为什么得到:

user@server:/usr/local/share/spark/spark-2.1.0$ sudo /usr/local/share/spark/spark-2.1.0/build/mvn -Pyarn -Phadoop-2.7 -Dhadoop.version=2.7.0 -DskipTests clean package
[sudo] password for user:
exec: curl --progress-bar -L https://downloads.typesafe.com/zinc/0.3.9/zinc-0.3.9.tgz
######################################################################## 100.0%
exec: curl --progress-bar -L https://downloads.typesafe.com/scala/2.11.8/scala-2.11.8.tgz
######################################################################## 100.0%
exec: curl --progress-bar -L https://www.apache.org/dyn/closer.lua?action=download&filename=/maven/maven-3/3.3.9/binaries/apache-maven-3.3.9-bin.tar.gz
######################################################################## 100.0%
Using `mvn` from path: /usr/local/share/spark/spark-2.1.0/build/apache-maven-3.3.9/bin/mvn
Java HotSpot(TM) 64-Bit Server VM warning: ignoring option MaxPermSize=512M; support was removed in 8.0
[INFO] Scanning for projects...
Downloading: https://repo1.maven.org/maven2/org/apache/apache/14/apache-14.pom
[ERROR] [ERROR] Some problems were encountered while processing the POMs:
[FATAL] Non-resolvable parent POM for org.apache.spark:spark-parent_2.11:2.1.0: Could not transfer artifact org.apache:apache:pom:14 from/to central (https://repo1.maven.org/maven2): repo1.maven.org: Name or service not known and 'parent.relativePath' points at wrong local POM @ line 22, column 11
 @
[ERROR] The build could not read 1 project -> [Help 1]
[ERROR]
[ERROR]   The project org.apache.spark:spark-parent_2.11:2.1.0 (/usr/local/share/spark/spark-2.1.0/pom.xml) has 1 error
[ERROR]     Non-resolvable parent POM for org.apache.spark:spark-parent_2.11:2.1.0: Could not transfer artifact org.apache:apache:pom:14 from/to central (https://repo1.maven.org/maven2): repo1.maven.org: Name or service not known and 'parent.relativePath' points at wrong local POM @ line 22, column 11: Unknown host repo1.maven.org: Name or service not known -> [Help 2]
[ERROR]
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR]
[ERROR] For more information about the errors and possible solutions, please read the following articles:
[ERROR] [Help 1] http://cwiki.apache.org/confluence/display/MAVEN/ProjectBuildingException
[ERROR] [Help 2] http://cwiki.apache.org/confluence/display/MAVEN/UnresolvableModelException

我测试了手动下载(以查看是否是导致错误的原因),下载没有问题:

https://repo1.maven.org/maven2/org/apache/apache/14/apache-14.pom

我还测试了访问下面的 URL,它也显示了内容:

https://repo1.maven.org/maven2

希望聪明的人知道如何解决这个...

最佳答案

我发现了问题所在:

我必须在目录中的 SETTINGS.XML 中配置我们的代理设置:

/usr/local/share/spark/spark-2.1.0/build/apache-maven-3.3.9/conf

编辑文件后,构建没有任何问题:)

希望这可以帮助遇到同样问题的其他人...

编辑:需要特别说明的是,仅在 bash 中具有有效的代理配置不足以使 Maven 构建成功。我能够手动从 bash 下载所有文件,但 Maven 还需要 SETTINGS.XML 文件中的代理配置...

关于scala - 从源代码构建 Apache Spark 2.1.0 失败,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/43127684/

相关文章:

generics - 使用泛型的Scala异常签名定义

scala - akka 以哪种方式实时?

hadoop - mapred.job.reduce.markreset.buffer.percent 的含义

bash - Hadoop-2.6 中 Map Reduce 作业的总时间计算

scala - 具有条件计数的 Pivot scala 数据框

scala - 哪些是模板上默认可用的隐式对象?

scala - 所有参数均为默认值的方法与不带参数的方法不同

hadoop - 提交 hadoop-streaming 作业 : yarn or hadoop?

apache-spark - 带有 --files 参数错误的 PySpark spark-submit 命令

apache-spark - 来自 Kafka 源的 Spark Streaming 返回检查点或倒带