hadoop - 无法将数据加载到 Pig 中的 Hortonworks 沙箱

标签 hadoop apache-pig hortonworks-data-platform

嗨,我是 hadoop 的新手,当我第一次运行这个命令时 LOAD 'Pig/iris.csv' using PigStorage (',') 弹出错误:

LOAD 'Pig/iris.csv' using PigStorage (',');
2014-09-05 06:04:04,853 [main] INFO org.apache.pig.Main - Apache Pig version 0.12.1.2.1.1.0-385 (rexported) compiled Apr 16 2014, 15:59:00
2014-09-05 06:04:04,885 [main] INFO org.apache.pig.Main - Logging error messages to: /dev/null
2014-09-05 06:04:07,077 [main] INFO org.apache.pig.impl.util.Utils - Default bootup file /usr/lib/hue/.pigbootup not found
2014-09-05 06:04:14,699 [main] INFO org.apache.hadoop.conf.Configuration.deprecation - mapred.job.tracker is deprecated. Instead, use mapreduce.jobtracker.address
2014-09-05 06:04:14,699 [main] INFO org.apache.hadoop.conf.Configuration.deprecation - fs.default.name is deprecated. Instead, use fs.defaultFS
2014-09-05 06:04:14,699 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to hadoop file system at: hdfs://sandbox.hortonworks.com:8020

2014-09-05 06:05:11,826 [main] INFO org.apache.hadoop.conf.Configuration.deprecation - fs.default.name is deprecated. Instead, use fs.defaultFS
grunt> LOAD 'Pig/iris.csv' using PigStorage (',');
2014-09-05 06:05:13,203 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1000: Error during parsing. Encountered " <IDENTIFIER> "LOAD "" at line 1, column 1.
Was expecting one of:
<EOF>
"cat" ...
"clear" ...
"fs" ...
"sh" ...
"cd" ...
"cp" ...
"copyFromLocal" ...
"copyToLocal" ...
"dump" ...
"\\d" ...
"describe" ...
"\\de" ...
"aliases" ...
"explain" ...
"\\e" ...
"help" ...
"history" ...
"kill" ...
"ls" ...
"mv" ...
"mkdir" ...
"pwd" ...
"quit" ...
"\\q" ...
"register" ...
"rm" ...
"rmf" ...
"set" ...
"illustrate" ...
"\\i" ...
"run" ...
"exec" ...
"scriptDone" ...
"" ...
"" ...
<EOL> ...
";" ...

Details at logfile: /dev/null

有谁知道如何解决这个问题?

最佳答案

LOAD 创建关系。您需要将其分配给一个变量,以便稍后可以对其进行处理:

L = LOAD 'Pig/iris.csv' using PigStorage (',');

DUMP L;

关于hadoop - 无法将数据加载到 Pig 中的 Hortonworks 沙箱,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/25686934/

相关文章:

hadoop - 从 Hive 中拆分数组的末尾进行评估

java - Hadoop 反序列化不适用于列表

hadoop - 如果是,则在 pig 中输入文件名

hadoop - 聚合的 pig 拉丁逻辑测试

hadoop - Windows 平台上是否有 Hortonwork Data 平台的管理器

hadoop - 如何估计 Hortonworks Hadoop 集群上的 spark 执行器数量?

java - 如何使用 Java 客户端 API 连接到 Hortonworks 沙箱 Hbase

apache-spark - HDFS与带有YARN的HDFS的对比,如果我使用spark,可以放置新的资源管理吗?

hadoop - 如何清除 HBase UI 中的死区服务器?

hadoop - pig 0.13 错误 2998 : Unhandled internal error. org/apache/hadoop/mapreduce/task/JobContextImpl