python - 如何将 group by Keys 应用到相关组

我有一个数据框，我使用 group by 对它们进行分组，如下所示

Name      Nationality    age
Peter     UK             28
John      US             29 
Wiley     UK             28 
Aster     US             29 

grouped = self_ex_df.groupby([Nationality, age])

现在我想为每个值附加一个唯一的 ID

我正在尝试这个，但不确定它是否有效？

uniqueID = 'ID_'+ grouped.groups.keys().astype(str)

    uniqueID    Name      Nationality    age
     ID_UK28    Peter       UK             28
     ID_US29    John        US             29 
     ID_UK28    Wiley       UK             28 
     ID_US29    Aster       US             29

我现在想将其合并到一个新的 DF 中，如下所示

 uniqueID   Nationality    age   Text
  ID_UK28     UK           28    Peter and Whiley have a combined age of 56
  ID_US_29    US           29    John and Aster have a combined age of 58

如何实现上述目标？

最佳答案

希望足够接近，无法获得平均年龄:

import pandas as pd

#create dataframe
df = pd.DataFrame({'Name': ['Peter', 'John', 'Wiley', 'Aster'], 'Nationality': ['UK', 'US', 'UK', 'US'], 'age': [28, 29, 28, 29]})

#make uniqueID
df['uniqueID'] = 'ID_' + df['Nationality'] + df['age'].astype(str)

#groupby has agg method that can take dict and preform multiple aggregations
df = df.groupby(['uniqueID', 'Nationality']).agg({'age': 'sum', 'Name': lambda x: ' and '.join(x)})

#to get text you just combine new Name and sum of age
df['Text'] = df['Name'] + ' have a combined age of ' + df['age'].astype(str)

关于python - 如何将 group by Keys 应用到相关组，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/43719179/

上一篇：python - Django filter() 增加结果查询集而不是减少

下一篇：python - 如何从父调用子构造函数？

相关文章：

python - 使用 Cython 实现 Numba 的性能

python - MayaVi:显示的 mlab 段错误

python - pandas.Series.explode 抛出 AttributeError

python - 根据另一个数组填充numpy数组直到位置

python - 在python中乘以大型稀疏矩阵

python - 在Python中解析嵌套的urlencode请求体

python - python 中的编程错误，如何使用 SQL 查询解析表中元组的值？

python - 根据总数的比例删除 pandas 数据框中的行

python - pandas 中的左连接无需创建左右变量

python - numpy 中的 FFT 与 MATLAB 中的 FFT 没有相同的结果