基于此answer我想编写一个函数将 csv
加载到 OrderedDict()
中,但我不知道如何解决将键列名称作为字符串传递而不是手动说明的问题它?这是我的代码,可以让它更清楚:
dic_key = 'uniqueID'
df.dic_key #this gives AttributeError: 'DataFrame' object has no attribute 'dic_key'
而不是df.uniqueID
,其中uniqueID
是我们想要将其用作键的列的名称
完整代码如下:
def csv_to_OrderedDic1(path, dic_key='uniqueID'):
'''
Parameters:
dic_key: the name of the column to be used as the dictionary key
'''
df = pd.DataFrame.from_csv(path, sep='\t', header=0)
# Get an unordered dictionary
unordered_dict = df.set_index(dic_key).T.to_dict('list')
# Then order it
ordered_dict = OrderedDict((k,unordered_dict.get(k)) for k in df.dic_key)
return ordered_dict
最佳答案
我认为更好的是使用 read_csv
对于选择列 []
而不是点符号:
def csv_to_OrderedDic1(path, dic_key='uniqueID'):
'''
Parameters:
dic_key: the name of the column to be used as the dictionary key
'''
df = pd.read_csv(path, sep='\t', header=0)
# Get an unordered dictionary
unordered_dict = df.set_index(dic_key).T.to_dict('list')
# Then order it
ordered_dict = OrderedDict((k,unordered_dict.get(k)) for k in df[dic_key])
return ordered_dict
另一种解决方案,使用zip
并通过 drop
删除列:
def csv_to_OrderedDic1(path, dic_key='uniqueID'):
'''
Parameters:
dic_key: the name of the column to be used as the dictionary key
'''
df = pd.read_csv(path, sep='\t', header=0)
L = zip(df[dic_key], df.drop(dic_key, 1).values.tolist())
ordered_dict = OrderedDict(L)
return ordered_dict
关于python - pandas.dataframe 到orderedDictionary : using a passed argument to specify the key column name instead of explicitly writing it,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/46644248/