hadoop - 来自 Eclipse 的字数统计

标签 hadoop eclipse-plugin hdfs word-count hadoop-plugins

我正在使用 hadoop 的 eclipse 插件。我可以通过创建一个 hadoop 服务器来查看 HDFS 中的所有文件,但是当我尝试从 eclipse 运行 wordcount.java 文件时,它给了我异常,而从终端运行却很顺利。异常(exception)情况如下。

2/11/14 04:09:06 WARN util.NativeCodeLoader: Unable to load native-hadoop library for  your platform... using builtin-java classes where applicable
12/11/14 04:09:06 WARN mapred.JobClient: No job jar file set.  User classes may not be     found. See JobConf(Class) or JobConf#setJar(String).
12/11/14 04:09:06 WARN snappy.LoadSnappy: Snappy native library not loaded
12/11/14 04:09:06 INFO mapred.JobClient: Cleaning up the staging area file:/tmp/hadoop-hduser/mapred/staging/hduser1728681403/.staging/job_local_0001
12/11/14 04:09:06 ERROR security.UserGroupInformation: PriviledgedActionException as:hduser cause:org.apache.hadoop.mapred.InvalidInputException: Input path does not exist: file:/user/hduser/gutenberg
Exception in thread "main" org.apache.hadoop.mapred.InvalidInputException: Input path does not exist: file:/user/hduser/gutenberg
    at org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:197)
    at org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:208)
    at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:989)
    at org.apache.hadoop.mapred.JobClient.writeSplits(JobClient.java:981)
    at org.apache.hadoop.mapred.JobClient.access$600(JobClient.java:174)
    at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:897)
    at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:850)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Subject.java:396)
    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
    at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:850)
    at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:824)
    at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1261)
    at WordCount.run(WordCount.java:149)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
    at WordCount.main(WordCount.java:155)

最佳答案

我将从调查这个开始:

ERROR security.UserGroupInformation: PriviledgedActionException as:hduser cause:org.apache.hadoop.mapred.InvalidInputException: Input path does not exist: file:/user/hduser/gutenberg

看来是这个问题造成的。您确定这是正确的道路吗?如果是这样,您可能没有访问它的权限。稍后我会尝试尽可能多地消除 WARN。

关于hadoop - 来自 Eclipse 的字数统计,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/13378932/

相关文章:

hadoop - HDFS 在哪里存储它的文件

hadoop:任务不是本地文件?

java - MapReduce - 不调用 Reduce

file - 使用 mapreduce 为不同的输入文件在不同的文件中写入输出

Ubuntu 中的 Hadoop Eclipse 插件错误

java - 获取 Eclipse WebSphere 运行时库

java - 有没有办法打开展开的 JFace ElementTreeSelectionDialog?

java - 无法使用 Eclipse (Neon) 配置 SonarQube

hadoop - HBase WAL文件和HDFS数据暂存

java - 从 HDFS 读取属性文件