postgresql - 为什么 PostgreSQL 没有正确使用索引？

架构:

create table records(
  id         varchar,
  updated_at bigint
);
create index index1 on records (updated_at, id);

查询。它遍历最近更新的记录。获取 10 条记录，记住最后一条，然后获取下 10 条，依此类推。

select * from objects
where updated_at > '1' or (updated_at = '1' and id > 'some-id')
order by updated_at, id
limit 10;

它使用索引，但没有明智地使用它，还应用过滤器并处理大量记录，请参阅下面查询说明中的 Rows Removed by Filter: 31575。

奇怪的是，如果您删除 或 并保留左或右条件 - 它对两者都适用。但是，如果同时使用 或 两个条件，似乎无法弄清楚如何正确应用索引。

Limit  (cost=0.42..19.03 rows=20 width=1336) (actual time=542.475..542.501 rows=20 loops=1)
   ->  Index Scan using index1 on records  (cost=0.42..426791.29 rows=458760 width=1336) (actual time=542.473..542.494 rows=20 loops=1)
         Filter: ((updated_at > '1'::bigint) OR ((updated_at = '1'::bigint) AND ((id)::text > 'some-id'::text)))
         Rows Removed by Filter: 31575
 Planning time: 0.180 ms
 Execution time: 542.532 ms
(6 rows)

Postgres 版本是 9.6

最佳答案

我会把它作为两个单独的查询来尝试，像这样组合它们的结果:

select *
from
  (
    select   *
    from     objects
    where    updated_at > 1
    order by updated_at, id
    limit    10
    union all
    select   *
    from     objects
    where    updated_at = 1
      and    id > 'some-id'
    order by updated_at, id
    limit    10
  ) t
order by updated_at, id
limit    10

我的猜测是，这两个查询都将各自优化得很好，同时运行这两个查询会比当前的查询更有效率。

如果可能的话，我也会让这些列不为空。

关于postgresql - 为什么 PostgreSQL 没有正确使用索引？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/46389163/

postgresql - 为什么 PostgreSQL 没有正确使用索引？

上一篇：django -/api/accounts/relation 不存在编程错误

下一篇：python - 使用 PostgreSQL/SqlAlchemy 选择 ARRAY 的第一项