python - Pandas Concat 返回 "InvalidIndexError: Reindexing only valid with uniquely valued Index objects"错误

标签 python python-3.x pandas dataframe concatenation

我有两个具有相同标题的数据框。当我尝试连接 (pd.concat()) 这两个 dfs (df1, df2) 时，出现错误:

"InvalidIndexError: Reindexing only valid with uniquely valued Index objects"

我发现问题是每个数据框中都有重复的列名称。示例:

df1

respondent ID - Column1 - Column2 - Column3 - Column1 - Column2 - Column3

df2

respondent ID - Column1 - Column2 - Column3 - Column1 - Column2 - Column3

理论上我只是想在df1下面添加df2的数据(不带df2 ofc的header)

如何绕过这个？有什么想法吗？

最佳答案

重复的列不应成为问题(即使使用ignore_index=False):

df1 = pd.DataFrame([range(7)], columns=['respondent ID', 'Column1', 'Column2', 'Column3', 'Column1', 'Column2', 'Column3'])
df2 = pd.DataFrame([['2']*7], columns=['respondent ID', 'Column1', 'Column2', 'Column3', 'Column1', 'Column2', 'Column3'])
pd.concat([df1, df2], ignore_index=True)

输出:

  respondent ID Column1 Column2 Column3 Column1 Column2 Column3
0             0       1       2       3       4       5       6
1             2       2       2       2       2       2       2

关于python - Pandas Concat 返回 "InvalidIndexError: Reindexing only valid with uniquely valued Index objects"错误，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/69929277/

上一篇：python - Pandas，如何找到互补的时间范围？

下一篇：apache-spark - 如何将 Great Expectations DataFrame 转换为 Apache Spark DataFrame

Python:获取对象的str时的最大递归深度

python - 有没有办法在Python中使用super()来调用每个基类的__init__方法？

python - Web 抓取导致 403 禁止错误

python-3.x - TypeError : can't pickle _thread. 锁定 Seq2Seq 中的对象

python - Python中x的第n个根有简写吗？

python - 另一个合并列表列表，但最 pythonic 方式

python - Pandas :获取与查询关联的掩码

python - 使用 selenium 读取多个文本元素时出现错误消息

python - 从 pandas cut 中排序垃圾箱