python - 阅读 excel 框架时跳过特定的一组列 - pandas

我事先知道我不需要 excel 文件中的哪些列，并且我想在读取文件时避免使用它们以提高性能。像这样:

import pandas as pd
df = pd.read_excel('large_excel_file.xlsx', skip_cols=['col_a', 'col_b',...,'col_zz'])

文档中没有与此相关的内容。有什么解决方法吗？

最佳答案

如果你的 pandas 版本允许(首先检查你是否可以将函数传递给 usecols)，我会尝试类似的方法:

import pandas as pd
df = pd.read_excel('large_excel_file.xlsx', usecols=lambda x: 'Unnamed' not in x,)

这应该跳过所有没有标题名称的列。您可以将“未命名”替换为您不需要的列名列表。

关于python - 阅读 excel 框架时跳过特定的一组列 - pandas，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/49677313/

相关文章：

excel - 将过滤列复制到另一列但索引相同