所以我想做的是重新索引一个数据帧,该数据帧在框架内有一堆单独的组。每个都有自己的索引,我想在其中添加月末索引。我认为最好的例子是:
给定这个数据框:
In [72]: a = {'2014-01-02': {'A': 1}, '2014-02-03': {'A': 1}, '2014-03-05': {'A': 1}}
In [74]: b = {'2015-01-02': {'A': 2}, '2015-02-03': {'A': 2}, '2015-03-05': {'A': 2}}
In [76]: pd.DataFrame.from_dict(a,orient='index').append(pd.DataFrame.from_dict(b,orient='index'))
Out[76]:
A
2014-01-02 1
2014-02-03 1
2014-03-05 1
2015-01-02 2
2015-02-03 2
2015-03-05 2
(注意 2014 年与 2015 年对比)
我想获取“A”列中的两个组,并对每个组重新建立索引,以包含每个组从最小值到索引最大值之后的月末。我想要这样的输出:
A
2014-01-02 1
2014-01-31 1
2014-02-03 1
2014-02-28 1
2014-03-05 1
2014-03-31 1
2015-01-02 2
2015-01-31 2
2015-02-03 2
2015-02-28 2
2015-03-05 2
2015-03-31 2
最佳答案
In [59]:
df.index = pd.to_datetime(df.index , format = '%Y-%m-%d')
df
Out[59]:
A
2014-01-02 1
2014-02-03 1
2014-03-05 1
2015-01-02 2
2015-02-03 2
2015-03-05 2
In [61]:
month_end = df.resample('M').dropna()
month_end
Out[61]:
A
2014-01-31 1
2014-02-28 1
2014-03-31 1
2015-01-31 2
2015-02-28 2
2015-03-31 2
In [64]:
pd.concat([df , month_end]).sortlevel(0)
Out[64]:
A
2014-01-02 1
2014-01-31 1
2014-02-03 1
2014-02-28 1
2014-03-05 1
2014-03-31 1
2015-01-02 2
2015-01-31 2
2015-02-03 2
2015-02-28 2
2015-03-05 2
2015-03-31 2
关于python - 按组重新索引 Pandas 时间序列,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/33683390/