python - 根据最后一行获取新值并检查 ID

当前日期范围。

ID  Date     Start Value    Payment
111 1/1/2018    1000        0
111 1/2/2018                100
111 1/3/2018                500
111 1/4/2018                400
111 1/5/2018                0
222 4/1/2018    2000        200
222 4/2/2018                100
222 4/3/2018                700
222 4/4/2018                0
222 4/5/2018                0
222 4/6/2018                1000
222 4/7/2018                0

这是我要获取的数据框。基本上，我试图为每一行填充星号。如您所见，每个 ID 在第一天都有一个起始值。次日起步价=昨日起步价-昨日支付。

   ID   Date    Start Value     Payment
    111 1/1/2018    1000        0
    111 1/2/2018    1000        100
    111 1/3/2018    900         500
    111 1/4/2018    400         400
    111 1/5/2018    0           0
    222 4/1/2018    2000        200
    222 4/2/2018    1800        100
    222 4/3/2018    1700        700
    222 4/4/2018    1000        0
    222 4/5/2018    1000        0
    222 4/6/2018    1000        1000
    222 4/7/2018    0           0

现在，我将 Excel 与此公式结合使用。起始值=if(本行ID==最后一行ID，最后一行起始值-最后一行付款，起始值)

效果很好，我想知道我是否可以在 Python/Pandas 中完成。谢谢。

最佳答案

我们可以使用groupby 和shift + cumsum，ffill 将为下面的所有行设置初始值相同的Id，那么我们只需要从该行中扣除累计付款直到开始，我们就得到了那一点的剩余值(value)

df.StartValue.fillna(df.groupby('ID').apply(lambda x : x['StartValue'].ffill()-x['Payment'].shift().cumsum()).reset_index(level=0,drop=True))
Out[61]: 
0     1000.0
1     1000.0
2      900.0
3      400.0
4        0.0
5     2000.0
6     1800.0
7     1700.0
8     1000.0
9     1000.0
10    1000.0
11       0.0
Name: StartValue, dtype: float64

通过添加 inplace=Ture 将其重新分配

df.StartValue.fillna(df.groupby('ID').apply(lambda x : x['StartValue'].ffill()-x['Payment'].shift().cumsum()).reset_index(level=0,drop=True),inplace=True)
df
Out[63]: 
     ID      Date  StartValue  Payment
0   111  1/1/2018      1000.0        0
1   111  1/2/2018      1000.0      100
2   111  1/3/2018       900.0      500
3   111  1/4/2018       400.0      400
4   111  1/5/2018         0.0        0
5   222  4/1/2018      2000.0      200
6   222  4/2/2018      1800.0      100
7   222  4/3/2018      1700.0      700
8   222  4/4/2018      1000.0        0
9   222  4/5/2018      1000.0        0
10  222  4/6/2018      1000.0     1000
11  222  4/7/2018         0.0        0

关于python - 根据最后一行获取新值并检查 ID，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/50025210/

python - 根据最后一行获取新值并检查 ID

上一篇：python - 导入错误没有名为kafka的模块

下一篇：python - 在不使用 python 请求模块的情况下进行此 API 调用的最佳替代方法是什么？