python - 如何选择数据框中列的每个类别？

标签 python pandas dataframe numpy data-science

我有一个类似这样的数据框。

我想通过分组和聚合来计算每个 food_category 的 co2_emission 的方差和标准差。并且必须采用这种格式

print(food_consumption.____(____)['co2_emission'].agg([____]))

这是我到目前为止所做的

print(food_consumption. .....(....)['co2_emission'].agg([np.var(food_consumption['co2_emission'], ddof=1),np.sqrt(np.var(food_consumption['co2_emission'], ddof=1))]))

我必须选择名为 food_category 的列的每个类别。如何做到这一点？

最佳答案

因为 Pandas Series.var和 Series.std默认 ddof=1 将它们传递给 agg:

print(food_consumption.groupby('food_category')['co2_emission'].agg(['var','std']))

关于python - 如何选择数据框中列的每个类别？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/71223167/

上一篇：google-authenticator - 我们可以安装 Google Authenticator 扩展吗？

下一篇：flutter - 无法无条件访问属性 'latitude' ，因为接收者可以是 'null' 。尝试将访问设置为有条件的(使用 '?.' )

相关文章：

python - 如何从 url 获取 int 值

python - Pandas 使用值计数获取类型

python - Python向数据库添加字符串后字母错误

python - Django ManyToMany 字段从没有 'for' 的对象中获取所有值

windows - sklearn : ImportError: DLL load failed: The specified module could not be found

python - 根据所选列过滤重复的行并与 Pandas 中的另一个数据框进行比较

r - 为数据框中的每个日期插入日期序列

python - Pandas 中 boolean 索引的逻辑运算符

python - python 列表理解中的 if-else

python - 如何使用最新的 pandas 版本计算扩展协整？