python - Pandas 数据框 : For loop that adds a fixed integer if the value exists in previous rows

标签 python pandas dataframe loops for-loop

对于以下数据框

df = pd.DataFrame({'Rounds':[1000,1000,1000,1000,3000,3000,4000,5000,6000,6000]})

我想要一个 for 循环，如果该值已经存在于前面的行中，则将一个固定的 int(在本例中为 25)添加到该值并创建:

df = pd.DataFrame({'Rounds':[1000,1025,1050,1075,3000,3025,4000,5000,6000,6025]})

一开始我试过

for i in df.index:
    if df.iat[i,1] == df.iloc[i-1,1]:
        df.iat[i,1] = df.iat[i-1,1]+25

问题是它不适用于一列中超过两个相似的值，我想给列名称“Rounds”而不是列的索引。

最佳答案

你需要groupby.cumcount :

df['Rounds'] += df.groupby('Rounds').cumcount()*25

输出:

中级:

df.groupby('Rounds').cumcount()

0    0
1    1
2    2
3    3
4    0
5    1
6    0
7    0
8    0
9    1
dtype: int64

关于python - Pandas 数据框 : For loop that adds a fixed integer if the value exists in previous rows，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/73134568/

上一篇：TypeScript 推断数组 [index]

下一篇：r - 使用混合日期格式解析日期列

相关文章：

python - 如何使用包含字段和值的列表过滤模型？

python - Pandas DataFrame上的NaN替换引发TypeError:找不到匹配的签名

python - 如何合并列表中的对象

python - 通过混合列表和数据帧列，数组长度与索引长度不匹配

python - 为 dask 中的行创建唯一的 id

python - 使用 pyinstaller 和 mayavi 导入创建独立的 exe

python - IBEX 中的流水线变压器阶段、Scikit-Learn 和 Pandas 中的列访问问题

python - pandas.read_csv 无法导入路径中带有重音符号的文件

python - 如何根据排序算法获得 pandas 的获胜者选民

python - (Pygame) 鼠标悬停检测的问题