eclipse - 将数据加载到表配置单元

标签 eclipse hadoop jdbc hive

我正在尝试配置 Hive 以使用 JDBC,我在 eclipse 上使用了这个示例:

public class HiveJdbcClient {
    private static String driverName = "org.apache.hadoop.hive.jdbc.HiveDriver";
    /**
    * @param args
    * @throws SQLException
    **/
    public static void main(String[] args) throws SQLException {
        try {
            Class.forName(driverName);
        } catch (ClassNotFoundException e){
            // TODO Auto-generated catch block
            e.printStackTrace();
            System.exit(1);
        }
        Connection con = DriverManager.getConnection("jdbc:hive://localhost:10000/default", "", "");
        Statement stmt = con.createStatement();
        String tableName = "testHiveDriverTable";
        stmt.executeQuery("drop table " + tableName);
        ResultSet res = stmt.executeQuery("create table " + tableName + " (key int, value string)");

        // show tables
        String sql = "show tables '" + tableName + "'";
        System.out.println("Running: " + sql);
        res = stmt.executeQuery(sql);
        if (res.next()) {
            System.out.println(res.getString(1));
        }

        // describe table
        sql = "describe " + tableName;
        System.out.println("Running: " + sql);  
        res = stmt.executeQuery(sql);
        while (res.next()) {
            System.out.println(res.getString(1) + "\t" + res.getString(2));
        }

        // load data into table
        // NOTE: filepath has to be local to the hive server
        // NOTE: /tmp/test_hive_server.txt is a ctrl-A separated file with two fields per line
        String filepath = "/tmp/test_hive_server.txt";
        sql = "load data local inpath '" + filepath + "' into table " + tableName;
        System.out.println("Running: " + sql);
        res = stmt.executeQuery(sql);
        // select * query
        sql = "select * from " + tableName;
        System.out.println("Running: " + sql);
        res = stmt.executeQuery(sql);
        while (res.next()){
            System.out.println(String.valueOf(res.getInt(1)) + "\t" + res.getString(2));
        }
        // regular hive query
        sql = "select count(1) from " + tableName;
        System.out.println("Running: " + sql);
        res = stmt.executeQuery(sql);
        while (res.next()){
            System.out.println(res.getString(1));
        }
    }
}

我能够在配置单元上创建表,但是当我尝试将数据加载到表中时出现错误。所以我的问题是我应该将什么放入“test_hive_server.txt”以使其工作!因为我什么都试过了,每次都犯同样的错误。 谢谢!

错误:

Exception in thread "main" java.sql.SQLException: org.apache.thrift.TApplicationException: Internal error processing execute
    at org.apache.hadoop.hive.jdbc.HiveStatement.executeQuery(HiveStatement.java:196)
    at com.palmyra.nosql.HiveJdbcClient.main(HiveJdbcClient.java:52)

最佳答案

实际上我自己找到了答案:

ROW FORMAT DELIMITED FIELDS TERMINATED BY ' ';

确定我们要加载到表中的“file.txt”中行的格式,Hive 的默认记录和字段分隔符列表是:

\n

^A

^B

^C

press ^V^A could insert a ^A in Vim.

或者你可以这样做: create table tableName (key int, value string) ROW FORMAT DELIMITED FIELDS TERMINATED BY ' ';

'file.txt' 应该是这样的:

value1 value2
value3 value4

在这个例子中value1和value3代表key 而value4和value5代表值

关于eclipse - 将数据加载到表配置单元,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/27783098/

相关文章:

java - 从 Java Web 服务的操作创建 MySql 数据库

java - 我可以将多页编辑器添加到多页编辑器吗

hadoop - 在现有的 Hortonworks HDP 集群中安装 Spark 1.5

java - Glassfish 4.1 + Hibernate 5.2 连接

java - 在 Mapper 中检索当前行的文件名

bash - 如何存储/*url* 的实际名称?

google-app-engine - 为什么我不断收到 "No suitable driver found"错误?

Android 开发 : Keytool, 创建 keystore ?

eclipse - 在 Eclipse 中,如何选择垂直连续的文本框,就像在 Notepad++ 中使用 alt+drag 一样?

java - Eclipse 中的 javaw 命令无效?