OrientDB ETL 边缘转换器 2 joinFieldName(s)

标签 orientdb etl

通过一个 joinFieldName 和查找,边缘转换器可以完美工作。然而,现在需要两个键,即查找中的复合索引。如何指定两个joinFieldName?

这是脚本化(后处理)版本: 创建边缘从(从 MC 中选择,其中sample=1 和 mkey=6)扩展到(从Event 中选择,其中sample=1 和 mcl=6)

这可以工作,但不适合生产。

有人可以帮忙吗?

最佳答案

您可以简单地添加 2 个 joinFieldName(s),例如

{ "edge": { "class": "Conn",
                "joinFieldName": "b1",
                "lookup": "A.a1",
                "joinFieldName": "b2",
                "lookup": "A.a2",
                "direction": "out"
            }}

请参阅下面我的测试数据:

json1.json

{
  "source": { "file": { "path": "/home/ivan/Scrivania/cose/etl/stak39517796/data1.csv" } },
  "extractor": { "csv": {} },
  "transformers": [
    { "vertex": { "class": "A" } }
  ],
  "loader": {
    "orientdb": {
       "dbURL": "plocal:/home/ivan/OrientDB/db_installati/enterprise/orientdb-enterprise-2.2.10/databases/stack39517796",
       "dbType": "graph",
       "dbAutoCreate": true,
       "classes": [
         {"name": "A", "extends": "V"},
         {"name": "B", "extends": "V"},
         {"name": "Conn", "extends": "E"}
       ]
    }
  }
}

json2.json

{
  "source": { "file": { "path": "/home/ivan/Scrivania/cose/etl/stak39517796/data2.csv" } },
  "extractor": { "csv": {} },
  "transformers": [
    { "vertex": { "class": "B" } },
    { "edge": { "class": "Conn",
                "joinFieldName": "b1",
                "lookup": "A.a1",
                "joinFieldName": "b2",
                "lookup": "A.a2",
                "direction": "out"
            }}
  ],
  "loader": {
    "orientdb": {
       "dbURL": "plocal:/home/ivan/OrientDB/db_installati/enterprise/orientdb-enterprise-2.2.10/databases/stack39517796",
       "dbType": "graph",
       "dbAutoCreate": true,
       "classes": [
         {"name": "A", "extends": "V"},
         {"name": "B", "extends": "V"},
         {"name": "Conn", "extends": "E"}
       ]
    }
  }
}

数据1.csv

a1,a2
1,1
1,2
2,3

数据2.csv

b1,b2
1,1
2,3
1,2

执行顺序:

  1. json1
  2. json2

这是最终结果:

orientdb {db=stack39517796}> select from v                                        

+----+-----+------+----+----+-------+----+----+--------+
|#   |@RID |@CLASS|a1  |a2  |in_Conn|b2  |b1  |out_Conn|
+----+-----+------+----+----+-------+----+----+--------+
|0   |#17:0|A     |1   |1   |[#25:0]|    |    |        |
|1   |#18:0|A     |1   |2   |[#27:0]|    |    |        |
|2   |#19:0|A     |2   |3   |[#26:0]|    |    |        |
|3   |#21:0|B     |    |    |       |1   |1   |[#25:0] |
|4   |#22:0|B     |    |    |       |3   |2   |[#26:0] |
|5   |#23:0|B     |    |    |       |2   |1   |[#27:0] |
+----+-----+------+----+----+-------+----+----+--------+

关于OrientDB ETL 边缘转换器 2 joinFieldName(s),我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/39517796/

相关文章:

testing - 如何在ETL过程中进行测试(单元测试)?

java - Pentaho Kettle Kitchen 找不到插件

sql-server - 保持两个不同的数据库同步

python - 相当于 pandas to_numeric() 的 bool 值

sql - 如何在orientdb中添加约束

MySQL - 行到列

javascript - 如何在 Angular 应用程序中使用 OrientDB HTTP API?

OrientDB 使用 shortestPath() 获取边

scale - 如何对 OrientDB 进行分片?

java - OrientDB顶点标签和顶点类的区别