我在做什么:
使用事实表和日期维度选择每年支付的美元总额。
我所看到的情况:
当查询运行 1 到 4 年的数据时,优化器充分利用并行性和布隆过滤器,查询在 3 分钟内完成。
当查询运行 5 年或以上,或者根本没有日期过滤器时,不使用并行性,并且查询需要 90 分钟才能运行。
我的问题:
为什么向谓词添加第 5 年或删除日期谓词会导致并行性丧失?
我该如何解决这个问题?
事实表:
8000 万行
平行度 = 8
日期键 (INCUR_DTE_S_KEY) 是从 to_char(INCUR_DATE,'yyyymmdd') 派生的数字
表在 INCUR_DTE_S_KEY 上进行范围分区:
分区“MED_2010”值小于 (20100101)
...
分区“MED_2016”值小于 (20160101)
分区“MED_MAXV”值小于 (MAXVALUE)
INCUR_DTE_S_KEY 是分区 BITMAP 索引
日期维度:
平行度 = 1
27,000 行
无分区
DATE_S_KEY 是位图索引
场景:
- 谓词中包含 1 到 4 年的查询。运行 8 个并行进程,应用布隆过滤器,查询在 3 分钟内完成。
- 谓词中包含 5 年或以上的查询。失去所有并行进程,失去布隆过滤器,运行 1 个进程,查询在 90 分钟内完成。
- 不查询谓词。丢失所有并行进程,运行 1 个进程,丢失布隆过滤器,查询在 90 分钟内完成。
- 将所有表上的维度索引更改为唯一,并将并行度增加到 8。现在获得 4 度并行度,但没有布隆过滤器/分区识别。
-- 场景 1
EXPLAIN PLAN FOR
SELECT D."CAL_YEAR" AS "Incur_Year",
SUM(F."PAID") AS "Paid"
FROM "F_PAID" F
INNER JOIN "D_DATE" D
ON F."INCUR_DTE_S_KEY" = D."DATE_S_KEY"
WHERE D."CAL_YEAR" BETWEEN 2011 AND 2015
GROUP BY D."CAL_YEAR" ;
SELECT PLAN_TABLE_OUTPUT FROM TABLE(DBMS_XPLAN.DISPLAY());
Plan hash value: 1889948660
-------------------------------------------------------------------------------------------------------------------------------------------
| Id | Operation | Name | Rows | Bytes | Cost (%CPU)| Time | Pstart| Pstop | TQ |IN-OUT| PQ Distrib |
-------------------------------------------------------------------------------------------------------------------------------------------
| 0 | SELECT STATEMENT | | 5 | 110 | 117K (2)| 00:23:31 | | | | | |
| 1 | PX COORDINATOR | | | | | | | | | | |
| 2 | PX SEND QC (RANDOM) | :TQ10002 | 5 | 110 | 117K (2)| 00:23:31 | | | Q1,02 | P->S | QC (RAND) |
| 3 | HASH GROUP BY | | 5 | 110 | 117K (2)| 00:23:31 | | | Q1,02 | PCWP | |
| 4 | PX RECEIVE | | 5 | 110 | 117K (2)| 00:23:31 | | | Q1,02 | PCWP | |
| 5 | PX SEND HASH | :TQ10001 | 5 | 110 | 117K (2)| 00:23:31 | | | Q1,01 | P->P | HASH |
| 6 | HASH GROUP BY | | 5 | 110 | 117K (2)| 00:23:31 | | | Q1,01 | PCWP | |
|* 7 | HASH JOIN | | 69M| 1454M| 116K (1)| 00:23:23 | | | Q1,01 | PCWP | |
| 8 | BUFFER SORT | | | | | | | | Q1,01 | PCWC | |
| 9 | PART JOIN FILTER CREATE| :BF0000 | 1826 | 20086 | 241 (1)| 00:00:03 | | | Q1,01 | PCWP | |
| 10 | PX RECEIVE | | 1826 | 20086 | 241 (1)| 00:00:03 | | | Q1,01 | PCWP | |
| 11 | PX SEND BROADCAST | :TQ10000 | 1826 | 20086 | 241 (1)| 00:00:03 | | | | S->P | BROADCAST |
|* 12 | TABLE ACCESS FULL | D_DATE | 1826 | 20086 | 241 (1)| 00:00:03 | | | | | |
| 13 | PX BLOCK ITERATOR | | 80M| 847M| 116K (1)| 00:23:19 |:BF0000|:BF0000| Q1,01 | PCWC | |
| 14 | TABLE ACCESS FULL | F_PAID | 80M| 847M| 116K (1)| 00:23:19 |:BF0000|:BF0000| Q1,01 | PCWP | |
-------------------------------------------------------------------------------------------------------------------------------------------
Predicate Information (identified by operation id):
---------------------------------------------------
7 - access("F"."INCUR_DTE_S_KEY"="D"."DATE_S_KEY")
12 - filter("D"."CAL_YEAR"<=2015 AND "D"."CAL_YEAR">=2011)
-- 场景 2
EXPLAIN PLAN FOR
SELECT D."CAL_YEAR" AS "Incur_Year",
SUM(F."PAID") AS "Paid"
FROM "F_PAID" F
INNER JOIN "D_DATE" D
ON F."INCUR_DTE_S_KEY" = D."DATE_S_KEY"
WHERE D."CAL_YEAR" BETWEEN 2010 AND 2015
GROUP BY D."CAL_YEAR" ;
SELECT PLAN_TABLE_OUTPUT FROM TABLE(DBMS_XPLAN.DISPLAY());
Plan hash value: 433333781
---------------------------------------------------------------------------------------------------------------------
| Id | Operation | Name | Rows | Bytes | Cost (%CPU)| Time | Pstart| Pstop |
---------------------------------------------------------------------------------------------------------------------
| 0 | SELECT STATEMENT | | 3 | 72 | 3640 (1)| 00:00:44 | | |
| 1 | HASH GROUP BY | | 3 | 72 | 3640 (1)| 00:00:44 | | |
| 2 | NESTED LOOPS | | 170 | 4080 | 3639 (1)| 00:00:44 | | |
|* 3 | TABLE ACCESS FULL | D_DATE | 2191 | 24101 | 241 (1)| 00:00:03 | | |
| 4 | VIEW PUSHED PREDICATE | VW_GBC_5 | 1 | 13 | | | | |
|* 5 | FILTER | | | | | | | |
| 6 | SORT AGGREGATE | | 1 | 11 | | | | |
| 7 | PARTITION RANGE SINGLE | | 37958 | 407K| 1206 (1)| 00:00:15 | KEY | KEY |
| 8 | TABLE ACCESS BY LOCAL INDEX ROWID| F_PAID | 37958 | 407K| 1206 (1)| 00:00:15 | KEY | KEY |
| 9 | BITMAP CONVERSION TO ROWIDS | | | | | | | |
|* 10 | BITMAP INDEX SINGLE VALUE | IX2F_PAID | | | | | KEY | KEY |
---------------------------------------------------------------------------------------------------------------------
Predicate Information (identified by operation id):
---------------------------------------------------
3 - filter("D"."CAL_YEAR"<=2015 AND "D"."CAL_YEAR">=2010)
5 - filter(COUNT(*)>0 AND 2010<=2015)
10 - access("F"."INCUR_DTE_S_KEY"="D"."DATE_S_KEY")
-- 场景 3
EXPLAIN PLAN FOR
SELECT D."CAL_YEAR" AS "Incur_Year",
SUM(F."PAID") AS "Paid"
FROM "F_PAID" F
INNER JOIN "D_DATE" D
ON F."INCUR_DTE_S_KEY" = D."DATE_S_KEY"
GROUP BY D."CAL_YEAR" ;
SELECT PLAN_TABLE_OUTPUT FROM TABLE(DBMS_XPLAN.DISPLAY());
Plan hash value: 433333781
---------------------------------------------------------------------------------------------------------------------
| Id | Operation | Name | Rows | Bytes | Cost (%CPU)| Time | Pstart| Pstop |
---------------------------------------------------------------------------------------------------------------------
| 0 | SELECT STATEMENT | | 77 | 1848 | 28859 (1)| 00:05:47 | | |
| 1 | HASH GROUP BY | | 77 | 1848 | 28859 (1)| 00:05:47 | | |
| 2 | NESTED LOOPS | | 2129 | 51096 | 28858 (1)| 00:05:47 | | |
| 3 | TABLE ACCESS FULL | D_DATE | 27397 | 294K| 241 (1)| 00:00:03 | | |
| 4 | VIEW PUSHED PREDICATE | VW_GBC_5 | 1 | 13 | | | | |
|* 5 | FILTER | | | | | | | |
| 6 | SORT AGGREGATE | | 1 | 11 | | | | |
| 7 | PARTITION RANGE SINGLE | | 37958 | 407K| 1206 (1)| 00:00:15 | KEY | KEY |
| 8 | TABLE ACCESS BY LOCAL INDEX ROWID| F_PAID | 37958 | 407K| 1206 (1)| 00:00:15 | KEY | KEY |
| 9 | BITMAP CONVERSION TO ROWIDS | | | | | | | |
|* 10 | BITMAP INDEX SINGLE VALUE | IX2F_PAID | | | | | KEY | KEY |
---------------------------------------------------------------------------------------------------------------------
Predicate Information (identified by operation id):
---------------------------------------------------
5 - filter(COUNT(*)>0)
10 - access("F"."INCUR_DTE_S_KEY"="D"."DATE_S_KEY")
经过一些反馈,我做了一些更改。我现在获得了并行性,但分区仍然被忽略。
-- 场景 4
--
-- 变化:
-- 将 D.DATE_S_KEY 设为唯一索引
-- 将 F.INCUR_DTE_S_KEY 索引从并行 2 更改为并行 8
-- 将日期维度从并行 1 更改为并行 8
--
-- 导致使用并行 4,但仍然丢失布隆过滤器
EXPLAIN PLAN FOR
SELECT D."CAL_YEAR" AS "Incur_Year",
SUM(F."PAID") AS "Paid"
FROM "F_PAID" F
INNER JOIN "D_DATE" D
ON F."INCUR_DTE_S_KEY" = D."DATE_S_KEY"
where D.CAL_YEAR between 2010 and 2015
GROUP BY D."CAL_YEAR" ;
SELECT PLAN_TABLE_OUTPUT FROM TABLE(DBMS_XPLAN.DISPLAY());
Plan hash value: 881547170
--------------------------------------------------------------------------------------------------------------------------------------------------------
| Id | Operation | Name | Rows | Bytes | Cost (%CPU)| Time | Pstart| Pstop | TQ |IN-OUT| PQ Distrib |
--------------------------------------------------------------------------------------------------------------------------------------------------------
| 0 | SELECT STATEMENT | | 4 | 96 | 1515 (1)| 00:00:19 | | | | | |
| 1 | PX COORDINATOR | | | | | | | | | | |
| 2 | PX SEND QC (RANDOM) | :TQ10001 | 4 | 96 | 1515 (1)| 00:00:19 | | | Q1,01 | P->S | QC (RAND) |
| 3 | HASH GROUP BY | | 4 | 96 | 1515 (1)| 00:00:19 | | | Q1,01 | PCWP | |
| 4 | PX RECEIVE | | 4 | 96 | 1515 (1)| 00:00:19 | | | Q1,01 | PCWP | |
| 5 | PX SEND HASH | :TQ10000 | 4 | 96 | 1515 (1)| 00:00:19 | | | Q1,00 | P->P | HASH |
| 6 | HASH GROUP BY | | 4 | 96 | 1515 (1)| 00:00:19 | | | Q1,00 | PCWP | |
| 7 | NESTED LOOPS | | 170 | 4080 | 1514 (1)| 00:00:19 | | | Q1,00 | PCWP | |
| 8 | PX BLOCK ITERATOR | | | | | | | | Q1,00 | PCWC | |
|* 9 | TABLE ACCESS FULL | D_DATE | 2191 | 24101 | 33 (0)| 00:00:01 | | | Q1,00 | PCWP | |
| 10 | VIEW PUSHED PREDICATE | VW_GBC_5 | 1 | 13 | | | | | Q1,00 | PCWP | |
|* 11 | FILTER | | | | | | | | Q1,00 | PCWP | |
| 12 | SORT AGGREGATE | | 1 | 11 | | | | | Q1,00 | PCWP | |
| 13 | PARTITION RANGE SINGLE | | 37958 | 407K| 1206 (1)| 00:00:15 | KEY | KEY | Q1,00 | PCWP | |
| 14 | TABLE ACCESS BY LOCAL INDEX ROWID| F_PAID | 37958 | 407K| 1206 (1)| 00:00:15 | KEY | KEY | Q1,00 | PCWP | |
| 15 | BITMAP CONVERSION TO ROWIDS | | | | | | | | Q1,00 | PCWP | |
|* 16 | BITMAP INDEX SINGLE VALUE | IX2F_PAID | | | | | KEY | KEY | Q1,00 | PCWP | |
--------------------------------------------------------------------------------------------------------------------------------------------------------
Predicate Information (identified by operation id):
---------------------------------------------------
9 - filter("D"."CAL_YEAR"<=2015 AND "D"."CAL_YEAR">=2010)
11 - filter(COUNT(*)>0 AND 2010<=2015)
16 - access("F"."INCUR_DTE_S_KEY"="D"."DATE_S_KEY")
最佳答案
根据 Dani Schnider 的说法,https://danischnider.wordpress.com/2015/03/13/time-dimension-keys-and-partitioning/ ,看来Oracle在使用日期作为分区键时,确实希望数据类型是DATE。我使用基于 to_char(date, ‘yyyymmdd’) 的整数作为代理键,就像 Ralph Kimball 那样。
我将所有日期键切换为实际日期,将索引位图连接起来,并且执行计划适用于所有日期范围。
Plan hash value: 4263234925
----------------------------------------------------------------------------------------------------------------------------------------------------
| Id | Operation | Name | Rows | Bytes | Cost (%CPU)| Time | Pstart| Pstop | TQ |IN-OUT| PQ Distrib |
----------------------------------------------------------------------------------------------------------------------------------------------------
| 0 | SELECT STATEMENT | | 4 | 100 | 5921 (1)| 00:01:12 | | | | | |
| 1 | PX COORDINATOR | | | | | | | | | | |
| 2 | PX SEND QC (RANDOM) | :TQ20001 | 4 | 100 | 5921 (1)| 00:01:12 | | | Q2,01 | P->S | QC (RAND) |
| 3 | HASH GROUP BY | | 4 | 100 | 5921 (1)| 00:01:12 | | | Q2,01 | PCWP | |
| 4 | PX RECEIVE | | 4 | 100 | 5921 (1)| 00:01:12 | | | Q2,01 | PCWP | |
| 5 | PX SEND HASH | :TQ20000 | 4 | 100 | 5921 (1)| 00:01:12 | | | Q2,00 | P->P | HASH |
| 6 | HASH GROUP BY | | 4 | 100 | 5921 (1)| 00:01:12 | | | Q2,00 | PCWP | |
| 7 | NESTED LOOPS | | 195 | 4875 | 5920 (1)| 00:01:12 | | | Q2,00 | PCWP | |
| 8 | PX BLOCK ITERATOR | | 2505 | 30060 | 157 (0)| 00:00:02 | 1 | 15 | Q2,00 | PCWC | |
|* 9 | TABLE ACCESS FULL | D_DATE | 2505 | 30060 | 157 (0)| 00:00:02 | 1 | 15 | Q2,00 | PCWP | |
| 10 | VIEW PUSHED PREDICATE | VW_GBC_5 | 1 | 13 | | | | | Q2,00 | PCWP | |
|* 11 | FILTER | | | | | | | | Q2,00 | PCWP | |
| 12 | SORT AGGREGATE | | 1 | 177 | | | | | Q2,00 | PCWP | |
| 13 | PX COORDINATOR | | | | | | | | | | |
| 14 | PX SEND QC (RANDOM) | :TQ10000 | 1 | 177 | | | | | Q1,00 | P->S | QC (RAND) |
| 15 | SORT AGGREGATE | | 1 | 177 | | | | | Q1,00 | PCWP | |
| 16 | PX BLOCK ITERATOR | | 37958 | 6561K| 5554 (1)| 00:01:07 | KEY | KEY | Q1,00 | PCWC | |
|* 17 | TABLE ACCESS FULL| F_PAID | 37958 | 6561K| 5554 (1)| 00:01:07 | KEY | KEY | Q1,00 | PCWP | |
----------------------------------------------------------------------------------------------------------------------------------------------------
Predicate Information (identified by operation id):
---------------------------------------------------
9 - filter("D_DATE_INCUR"."CAL_YEAR"<=2015 AND "D_DATE_INCUR"."CAL_YEAR">=2009)
11 - filter(COUNT(SYS_OP_CSR(SYS_OP_MSR(COUNT(*),SUM("F_ACTIVE_MED_CLAIMS_PART"."PAID")),0))>0 AND 2009<=2015)
17 - filter("F_ACTIVE_MED_CLAIMS_PART"."INCUR_DTE_S_KEY"="D_DATE_INCUR"."CAL_DATE")
关于当筛选日期范围跨越 4 年以上时,Oracle 查询会失去并行性,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/32226540/