arrays - 在 Hadoop 中自定义 TwoDArrayWritable 并且无法在 reducer 中对其进行迭代

标签 arrays class hadoop mapreduce writable

尝试从映射器发出 2 个双维数组作为值。 在 hadoop 中,我们有 TwoDArrayWritable,它将 1 - 2D array 作为输入。 为了实现我的用例,我尝试编辑 TwoDArrayWritable 以获取 2 - 2D array

的输入
/**
 * A Writable for 2D arrays containing a matrix of instances of a class.
 */
public class MyTwoDArrayWritable implements Writable {


    private Class valueClass;
    private Writable[][] values;


    private Class valueClass1;
    private Writable[][] values1;


    public MyTwoDArrayWritable(Class valueClass,Class valueClass1) {
        this.valueClass = valueClass;
        this.valueClass1 = valueClass1;
    }

    public MyTwoDArrayWritable(Class valueClass, DoubleWritable[][] values,Class valueClass1, DoubleWritable[][] values1) {
        this(valueClass, valueClass1);
        this.values = values;
        this.values1 = values1;
    }

    public Object toArray() {
        int dimensions[] = {values.length, 0};
        Object result = Array.newInstance(valueClass, dimensions);
        for (int i = 0; i < values.length; i++) {
            Object resultRow = Array.newInstance(valueClass, values[i].length);
            Array.set(result, i, resultRow);
            for (int j = 0; j < values[i].length; j++) {
                Array.set(resultRow, j, values[i][j]);
            }
        }
        return result;
    }



    /**
     * @return the valueClass
     */
    public Class getValueClass() {
        return valueClass;
    }

    /**
     * @param valueClass the valueClass to set
     */
    public void setValueClass(Class valueClass) {
        this.valueClass = valueClass;
    }

    /**
     * @return the values
     */
    public Writable[][] getValues() {
        return values;
    }

    /**
     * @param values the values to set
     */
    public void setValues(DoubleWritable[][] values,DoubleWritable[][] values1) {
        this.values = values;
        this.values = values1;
    }

    /**
     * @return the valueClass1
     */
    public Class getValueClass1() {
        return valueClass1;
    }

    /**
     * @param valueClass1 the valueClass1 to set
     */
    public void setValueClass1(Class valueClass1) {
        this.valueClass1 = valueClass1;
    }

    /**
     * @return the values1
     */
    public Writable[][] getValues1() {
        return values1;
    }


    public void readFields(DataInput in) throws IOException {
        // construct matrix
        values = new Writable[in.readInt()][];
        for (int i = 0; i < values.length; i++) {
            values[i] = new Writable[in.readInt()];
        }

        // construct values
        for (int i = 0; i < values.length; i++) {
            for (int j = 0; j < values[i].length; j++) {
                Writable value;                             // construct value
                try {
                    value = (Writable) valueClass.newInstance();
                } catch (InstantiationException e) {
                    throw new RuntimeException(e.toString());
                } catch (IllegalAccessException e) {
                    throw new RuntimeException(e.toString());
                }
                value.readFields(in);                       // read a value
                values[i][j] = value;                       // store it in values
            }
        }
    }

    public void write(DataOutput out) throws IOException {
        out.writeInt(values.length);                 // write values
        for (int i = 0; i < values.length; i++) {
            out.writeInt(values[i].length);
        }
        for (int i = 0; i < values.length; i++) {
            for (int j = 0; j < values[i].length; j++) {
                values[i][j].write(out);
            }
        }
    }


}

并从映射器发出 2 个二维 double 组。

MyTwoDArrayWritable array = new MyTwoDArrayWritable (DoubleWritable.class,DoubleWritable.class);
DoubleWritable[][] myInnerArray = new DoubleWritable[EtransEkey.length][EtransEkey[0].length];
DoubleWritable[][] myInnerArray1 = new DoubleWritable[EtransDevalue.length][EtransDevalue[0].length];
// set values in myInnerArray
for (int k1 = 0; k1 < EtransEkey.length; k1++) {
 for(int j1=0;j1< EtransEkey[0].length;j1++){
     myInnerArray[k1][j1] = new DoubleWritable(EtransEkey[k1][j1]);

 }
}

for (int k1 = 0; k1 < EtransDevalue.length; k1++) {
 for(int j1=0;j1< EtransDevalue[0].length;j1++){
     myInnerArray1[k1][j1] = new DoubleWritable(EtransDevalue[k1][j1]);
 }
}

array.set(myInnerArray,myInnerArray1); 

array.set(myInnerArray,myInnerArray1); 中显示错误

/*
 * The method set(DoubleWritable[][], DoubleWritable[][]) is undefined for the type MyTwoDArrayWritableritable
 */

编辑:如何在 Reducer 中迭代这些值以获得 myInnerArray 矩阵和 myInnerArray1 矩阵?

到目前为止我所做的是

for (MyTwoDArrayWritable c : values) {
            System.out.println(c.getValues());
            DoubleWritable[][] myInnerArray = new DoubleWritable[KdimRow][KdimCol];
            for (int k1 = 0; k1 < KdimRow; k1++) {
                 for(int j1=0;j1< KdimCol;j1++){
                     myInnerArray[k1][j1] = new DoubleWritable();

                 }
        }

但如何将它们存储回 double 组?

最佳答案

您尚未在 MyTwoDArrayWritable 中定义 set 方法,这就是显示该错误的原因。不要调用 array.set,您应该使用您已经定义的方法,它完全满足您的需要:setValues,所以替换

array.set(myInnerArray,myInnerArray1); 

array.setValues(myInnerArray,myInnerArray1); 

关于arrays - 在 Hadoop 中自定义 TwoDArrayWritable 并且无法在 reducer 中对其进行迭代,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/24904782/

相关文章:

python - 混合多处理和串行端口

hadoop - HDFS是如何下载文件的?

arrays - 从值为空字符串的数组中删除字典(使用高阶函数)

mysql - json_unquote 和 extract 给出 null

javascript - 函数的输出 = NAN

hadoop - 安装apache Ranger时无法通过使用浏览器访问服务器的外部URL进行验证

java - Hadoop:不支持的名称:具有方案但相对路径部分错误

java - 循环遍历二维数组中的主对角线 (/) 及其下方的单元格

c++ - 比较同一类的 2 个对象(覆盖 == 运算符)c++

python - 理解 Python super() 和 __init__() 方法