python - Google Dataflow python 快速启动错误 - GcsIO 没有属性

标签 python google-cloud-dataflow

我一直在关注 Dataflow Python Quickstart并在运行 wordcount 示例管道时出现错误:

... File "apache_beam/io/fileio.py", line 281, in glob return gcsio.GcsIO().glob(path, limit) AttributeError: 'NoneType' object has no attribute 'GcsIO'

我用我自己的管道试过,结果相同。我不确定这里的问题是什么,因为我认为我已经完全按照教程进行操作,这个错误似乎与读/写转换有关

Traceback (most recent call last): File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/runpy.py", line 162, in _run_module_as_main "main", fname, loader, pkg_name) File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/runpy.py", line 72, in _run_code exec code in run_globals File "/Users/Alex/beam/sdks/python/apache_beam/examples/wordcount.py", line 116, in run() File "/Users/Alex/beam/sdks/python/apache_beam/examples/wordcount.py", line 87, in run lines = p | 'read' >> ReadFromText(known_args.input) File "apache_beam/io/textio.py", line 378, in init skip_header_lines=skip_header_lines) File "apache_beam/io/textio.py", line 87, in init validate=validate) File "apache_beam/io/filebasedsource.py", line 97, in init self._validate() File "apache_beam/io/filebasedsource.py", line 171, in _validate if len(fileio.ChannelFactory.glob(self._pattern, limit=1)) <= 0: File "apache_beam/io/fileio.py", line 281, in glob return gcsio.GcsIO().glob(path, limit) AttributeError: 'NoneType' object has no attribute 'GcsIO'

知道我做错了什么吗?

谢谢

最佳答案

发生这种情况是因为您没有安装 google-apitools 包(代码中提到了这一点,但应该更好地记录下来)。

尝试在您的虚拟环境中运行 pip install google-apitools,然后重新运行管道(请注意,您的系统中需要有 Google Cloud 凭据)。

关于python - Google Dataflow python 快速启动错误 - GcsIO 没有属性,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/42473961/

相关文章:

hadoop - 是否可以使用Google的Dataflow运行Hadoop MR作业?

google-cloud-dataflow - 数据流似乎被卡住了

python - 在美国位置未找到数据集

javascript - 从 API 数据切换 Div 隐藏/显示

python - 按行将 digitalize() 应用于 ndarray

python - Django 多选字段 : Make some choices readonly when editing

python - 日期时间模块的语法无效

python - 属性错误 : 'scoped_session' object has no attribute 'session'

python - Google Cloud Dataflow Python SDK 更新

google-cloud-dataflow - 谷歌数据流 Apache Beam