ArangoDB 慢查询

我是 ArangoDB 的新手，在优化我的查询时遇到了问题，希望得到一些帮助。

我在下面提供的查询是我遇到的一个真实示例，在我的开发数据库上为 758.078 毫秒，但在暂存时，使用更大的数据集，它需要 531.511 s.

我还将提供我在开发和暂存中遍历的每个边缘表的大小。非常感谢任何帮助。

for doc in document
filter repo._key == "my-key"
    for v, e, p in 3 any doc edge1, edge2, edge3
    options {uniqueVertices: 'global', bfs: true}
    filter DATE_ISO8601(p.vertices[2].date) > DATE_ISO8601("2017-09-04T00:00:01Z")
        and DATE_ISO8601(p.vertices[2].date) < DATE_ISO8601("2017-09-15T23:59:59Z")
    limit 1
    return {
        commit: p.vertices[2].hash,
        date: p.vertices[2].date,
        message: p.vertices[2].message,
        author: p.vertices[1].email,
        loc: p.vertices[3].stats.additions
    }

开发

edge1: 2,638
edge2:2,560
edge3: 386

暂存

edge1: 5,438,811
edge2:5,544,028
edge3:423,545

最佳答案

查询可能很慢，因为过滤条件

filter 
  DATE_ISO8601(p.vertices[2].date) > DATE_ISO8601("2017-09-04T00:00:01Z"
and 
  DATE_ISO8601(p.vertices[2].date) < DATE_ISO8601("2017-09-15T23:59:59Z")

在遍历期间不应用，但仅在之后应用。这可能是由于过滤条件中的函数调用(对 DATE_ISO8601 )造成的。如果您的日期值存储为数字，您可以检查以下过滤条件是否可以加快查询速度:

filter 
  p.vertices[2].date > DATE_TIMESTAMP("2017-09-04T00:00:01Z"
and 
  p.vertices[2].date < DATE_TIMESTAMP("2017-09-15T23:59:59Z")

修改后的过滤条件应该允许在遍历中提取过滤条件，因此它会更早地执行。

您可以使用 db._explain(<query string goes here>); 验证查询执行计划在 ArangoShell 或 Web 界面的 AQL 编辑器中。

关于ArangoDB 慢查询，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/46898330/

上一篇：string - 使用列表的字符串/元素为 random.sample 提供范围

下一篇：sql - 减去不同记录中的值