任何人都可以帮助我,因为我是 python 新手。我有一个名为购买数据的数据集,其中每个 CaseID 的每个 PO 事件的日志数据都存在于数据集中。
Case Id Activity transactionstatus
1 Create Purchase Requisition Closed
1 Create Request for Quotation Requester Closed
1 Analyze Request for Quotation Closed
1 Send Request for Quotation to Supplier Closed
1 Create Quotation comparison Map Closed
1 Analyze Quotation comparison Map Closed
1 Choose best option Closed
1 Settle conditions with supplier Closed
1 Create Purchase Order Closed
1 Confirm Purchase Order Closed
1 Deliver Goods Services Closed
1 Release Purchase Order Closed
1 Approve Purchase Order for payment Closed
1 Send invoice Closed
1 Release Supplier's Invoice Closed
1 Authorize Supplier's Invoice payment Closed
1 Pay invoice Closed
这里每一个Case Id都被视为一个变量。这样的变量一共有1949个。
例如: 案例 ID:1 被视为从“事件”列“创建采购申请”到“支付发票”的一个变量,然后 TransactionStatus 被视为“已关闭”。 (如上数据)
现在有很多 caseID 的 transactionStatus 为“open”,现在我想做的是尝试获取所有 caseId 和相应的 Activity(从事件“create...”直到“的全部数据”分析....'),交易状态为“打开”并尝试将其放入单独的数据集中
例如:
Case ID Activity TransactionStatus
1941 Create Purchase Requisition Closed
1941 Analyze Purchase Requisition Closed
1941 Create Request for Quotation Requester Manager Closed
1941 Analyze Request for Quotation Open
1949 Create Purchase Requisition Closed
1949 Analyze Purchase Requisition Open
共有 196 条记录处于开放交易状态! 谁能帮我看看我能用什么方式做到这一点
最佳答案
试试这个
df = df[df.loc[:, 'caseID'].isin(df[df['TransactionStatus'] == 'Open']['caseID'])]
关于python dataset - 读取一组列并将其放入单独的数据框中?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/51979976/