hadoop - 如何使用Pig访问远程计算机上的HDFS?

标签 hadoop apache-pig hdfs

在远程计算机(example.host.com)上安装了Hadoop。 Pig也已安装在该计算机上。

如何从其他计算机访问该计算机上的hdfs?

我不想从远程复制文件。我只想对以具有架构的avro格式存储的文件运行查询。

我在本地计算机上安装了Pig,并将以下行添加到pig.properties文件

fs.default.name=hdfs://example.host.com:8020
mapred.job.tracker=example.host.com:8021

但是当我启动Pig时,出现以下错误
2013-02-15 12:35:26,534 [main] INFO  org.apache.pig.Main - Apache Pig version 0.10.1-SNAPSHOT (rexported) compiled Feb 14 2013, 17:55:12
2013-02-15 12:35:26,535 [main] INFO  org.apache.pig.Main - Logging error messages to: /log/path/pig_1360911926530.log
2013-02-15 12:35:26,818 [main] INFO  org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to hadoop file system at: hdfs://example.host.com:8020
2013-02-15 12:35:26.907 java[2346:1c03] Unable to load realm info from SCDynamicStore
2013-02-15 12:35:27,574 [main] ERROR org.apache.pig.Main - ERROR 2999: Unexpected internal error. Failed to create DataStorage
Details at logfile: /log/path/pig_1360911926530.log

并且在日志文件“/log/path/pig_1360911926530.log”的内容中
Error before Pig is launched
----------------------------
ERROR 2999: Unexpected internal error. Failed to create DataStorage

java.lang.RuntimeException: Failed to create DataStorage
        at org.apache.pig.backend.hadoop.datastorage.HDataStorage.init(HDataStorage.java:75)
        at org.apache.pig.backend.hadoop.datastorage.HDataStorage.<init>(HDataStorage.java:58)
        at org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.init(HExecutionEngine.java:205)
        at org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.init(HExecutionEngine.java:118)
        at org.apache.pig.impl.PigContext.connect(PigContext.java:208)
        at org.apache.pig.PigServer.<init>(PigServer.java:246)
        at org.apache.pig.PigServer.<init>(PigServer.java:231)
        at org.apache.pig.tools.grunt.Grunt.<init>(Grunt.java:47)
        at org.apache.pig.Main.run(Main.java:487)
        at org.apache.pig.Main.main(Main.java:111)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:601)
        at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
Caused by: org.apache.hadoop.ipc.RPC$VersionMismatch: Protocol org.apache.hadoop.hdfs.protocol.ClientProtocol version mismatch. (client = 61, server = 63)
        at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:401)
        at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:379)
        at org.apache.hadoop.hdfs.DFSClient.createRPCNamenode(DFSClient.java:119)
        at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:238)
        at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:203)
        at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:89)
        at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:1386)
        at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:66)
        at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:1404)
        at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:254)
        at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:123)
        at org.apache.pig.backend.hadoop.datastorage.HDataStorage.init(HDataStorage.java:72)
        ... 14 more
================================================================================

最佳答案

如您所见,您得到的异常是版本不匹配。您确定XXX具有$ HADOOP_HOME初始化的类路径吗?看起来好像指向错误的 jar 。

关于hadoop - 如何使用Pig访问远程计算机上的HDFS?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/14870862/

相关文章:

hadoop - 有关hadoop联合的同步问题

apache-pig - 如何用 Pig Latin 编写带有 WHERE 子句的左外连接?

hadoop - 将数据/文件从 Windows 复制到 Linux 机器或 HDFS

Hadoop fsck 显示缺少副本

hadoop - 使用 Parquet 文件作为存储在 Hive 中创建外部表后获取 NULL

java - 将大型 gzip 数据文件上传到 HDFS

hadoop - Hive CLI无法启动

hadoop - SAP HANA 智能数据访问与 SAP HANA Vora 之间的区别

hadoop - 通过 Oozie 进行 Map-Reduce

java - Pig UDF 在生成新元组时抛出 NullPointerException