Python Pandas : Create New Columns For Each Instance of A Particular Column Value

我有一个像这样的数据框

----------------
RecID| A  |B
----------------
1    |Dog | x 
2    |Dog | y 
3    |Dog | z
4    |Cat | a 
5    |Cat | b

并且想知道是否有办法将其转换为这样:

-----------------------------
RecID| A |B_1|B_2|B_3|
-----------------------------
1    |Dog| x | y | z |
2    |Cat| a | b | NA|

基本上，为 B 的每个可能值创建新列，并按 A 的特定值分组，并在需要时填充 NA。

最佳答案

一种方法是

In [294]: (df.groupby('A', sort=False).B.apply(list)
             .apply(pd.Series).add_prefix('B_').reset_index())
Out[294]:
     A B_0 B_1  B_2
0  Dog   x   y    z
1  Cat   a   b  NaN

或者，

In [320]: (df.groupby('A', sort=False).B.apply(lambda x: pd.Series(x.values))
             .unstack().rename(columns=lambda x: 'B_{}'.format(int(x)+1))
             .reset_index())
Out[320]:
     A B_1 B_2   B_3
0  Dog   x   y     z
1  Cat   a   b  None

关于Python Pandas : Create New Columns For Each Instance of A Particular Column Value，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/45967900/

上一篇：python - 使用 pandas series.sample() 返回索引？

下一篇：python - 组合 setattr 和 getattr 会导致无限循环

相关文章：

python - Flask SqlAlchemy 开始和结束事务

python - 复制 Pandas DF N 次

python - 循环遍历数据框(列和行)并替换数据

python - 从列表中查找基因名称到数据框

Python 在两次使用一个值时避免使用变量？

python - 欧拉计划 #10(Python)

python - 根据条件将数据框列拆分为不同的列

python - 将 DataFrame.query() 与 pandas.Series.str.contains 一起使用会得到 AttributeError : 'dict' object has no attribute 'append'

pandas - 如何检查具有多种类型的列中的数据类型？ -寻找更好的解决方案

python - Pandas : columns sample based on row value

Python Pandas : Create New Columns For Each Instance of A Particular Column Value

上一篇：python - 使用 pandas series.sample() 返回索引？

下一篇：python - 组合 __setattr__ 和 __getattr__ 会导致无限循环

下一篇：python - 组合 setattr 和 getattr 会导致无限循环