我有以下声明
Values = FILTER Input_Data BY Fields > 0
如何统计过滤和未过滤的记录数?
最佳答案
-- split into 2 datasets
SPLIT Input_data INTO A IF Field > 0, B if Field <= 0;
-- count > 0 records
A_grp = GROUP A ALL;
A_count = FOREACH A_grp GENERATE COUNT(A);
-- count <= 0 records
B_grp = GROUP B ALL;
B_count = FOREACH B_grp GENERATE COUNT(B);
希望这会有所帮助!!
关于hadoop - 过滤后的计数值 - Apache PIG,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/39533846/