python - 在 Pandas DataFrame 中构建多重索引

标签 python pandas indexing dataframe multi-index

我正在将两个文件读入 Python，它们的形式如下:

           0.00902317     0.0270695     0.0451159     0.0631622  \   
0000010  6.962980e-05  7.063750e-05  7.165970e-05  7.269680e-05   
1000010  0.000000e+00  0.000000e+00  0.000000e+00  0.000000e+00   
2000010  0.000000e+00  0.000000e+00  0.000000e+00  0.000000e+00

第一行是身份证号码，各列是不同的年龄。这两个文件的组成年龄不同，只有几个共同的 ID#。

最终我将两个数据帧组合起来找到共同的 ID#。但我想要结果数据框

               File 1                      File 2    
           0.00902317     0.0270695     0.0675493     0.1091622  \   
0000010  6.962980e-05  7.063750e-05  0.000000e+00  0.000000e+00   
1000010  0.000000e+00  0.000000e+00  0.000000e+00  0.000000e+00   
2000010  0.000000e+00  0.000000e+00  0.000000e+00  0.000000e+00

有没有办法制作一个看起来像这样的数据框，多索引列？

抱歉，如果这是一个简单的问题，我是使用数据帧的新手。

最佳答案

我认为你可以使用concat :

print (pd.concat([df1, df2], axis=1, keys=['File 1','File 2']))

            File 1                                  File 2            
        0.00902317 0.0270695 0.0451159 0.0631622 0.0675493 0.1091622  
0000010    0.00007  0.000071  0.000072  0.000073       0.0         0.0
1000010    0.00000  0.000000  0.000000  0.000000       0.0         0.0
2000010    0.00000  0.000000  0.000000  0.000000       0.0         0.0

关于python - 在 Pandas DataFrame 中构建多重索引，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/38656625/

上一篇：python - 以内存有效的方式迭代添加 pandas 数据帧

下一篇：python - 在Django home中添加动态生成的matplotlib图片

相关文章：

python - Pandas 使用数据框作为字典或查找

mysql - 如何加快MySQL的搜索速度？是否可以使用特殊字符进行全文搜索？

python - Boost Python 以 std::vectors 作为参数导入 C++ 函数

python - Argparse:根据正则表达式区分参数？

python - Django 3.1 - "OperationalError: no such table"在进行或应用迁移之前使用 ORM 类模型时

python - 使用 Pandas 进行朴素预测

python - 如何更改使用 Matplotlib 绘制的图形的大小？

sql - 非主键列上的聚集索引还是非聚集索引？

python - 如何在不知道索引的情况下查询 pandas 数据帧的特定值

python - Numpy 链式索引 "force"可查看而不是复制