Python:使用 Pandas 从数据框中选择特定日期

以下简短脚本使用 findatapy从 Dukascopy 收集数据网站。请注意，此包使用 Pandas，不需要单独导入。

from findatapy.market import Market, MarketDataRequest, MarketDataGenerator

market = Market(market_data_generator=MarketDataGenerator())
md_request = MarketDataRequest(start_date='08 Feb 2017', finish_date='09 Feb 2017', category='fx', fields=['bid', 'ask'], freq='tick', data_source='dukascopy', tickers=['EURUSD'])

df = market.fetch_market(md_request)

#Group everything by an hourly frequency.
df=df.groupby(pd.TimeGrouper('1H')).head(1)

#Deleting the milliseconds from the Dateframe
df.index =df.index.map(lambda t: t.strftime('%Y-%m-%d %H:%M:%S'))

#Computing Average between columns 1 and 2, and storing it in a new one.
df['Avg'] = (df['EURUSD.bid'] + df['EURUSD.ask'])/2

结果是这样的:

到目前为止，一切都正常运行，但我需要从该数据框中提取特定的时间。比方说，我想在特定时间上午 10:00:00 选择所有值(出价、要价、平均...或其中之一)。

通过查看其他posts ，我想我可以做这样的事情:

match_timestamp = "10:00:00"
df.loc[(df.index.strftime("%H:%M:%S") == match_timestamp)]

但结果是一条错误消息:

AttributeError: 'Index' object has no attribute 'strftime'

我什至无法执行 df.index.hour，它曾经在我删除毫秒的行之前工作(dtype 是 datetime64[ns] 直到那一点)，之后 dtype 是一个“对象”。看来我需要反转此格式才能使用 strftime。

你能帮帮我吗？

最佳答案

你应该看看resample :

df = df.resample('H').first()  # resample for each hour and use first value of hour

然后:

df.loc[df.index.hour == 10]  # index is still a date object, play with it

如果你不喜欢那样，你可以像这样将你的索引设置为一个日期时间对象:

df.index = pd.to_datetime(df.index)

那么你的代码应该可以正常工作了

关于Python:使用 Pandas 从数据框中选择特定日期，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/46674889/

Python:使用 Pandas 从数据框中选择特定日期

上一篇：python - 基于条件在数据框中创建列

下一篇：python - (Python) while 循环打破了我的 random_number