python - 无法在 PyCharm 上安装 pyspark

标签 python pyspark pycharm

我尝试在 PyCharm 上安装 pyspark 但最终出现以下错误:

Collecting pyspark
  Using cached https://files.pythonhosted.org/packages/37/98/244399c0daa7894cdf387e7007d5e8b3710a79b67f3fd991c0b0b644822d/pyspark-2.4.3.tar.gz

    ERROR: Complete output from command python setup.py egg_info:
    ERROR: Could not import pypandoc - required to package PySpark
    Download error on https://pypi.org/simple/pypandoc/: [Errno 60] Operation timed out -- Some packages may not be found!
    Couldn't find index page for 'pypandoc' (maybe misspelled?)
    Download error on https://pypi.org/simple/: [Errno 60] Operation timed out -- Some packages may not be found!
    No local packages or working download links found for pypandoc
    Traceback (most recent call last):
      File "<string>", line 1, in <module>
      File "/private/var/folders/m6/x3v1x7255lv_w_vsp0zz_jw8hvmgc7/T/pycharm-packaging/pyspark/setup.py", line 224, in <module>
        'Programming Language :: Python :: Implementation :: PyPy']
      File "/usr/local/lib/python3.7/site-packages/setuptools/__init__.py", line 144, in setup
        _install_setup_requires(attrs)
      File "/usr/local/lib/python3.7/site-packages/setuptools/__init__.py", line 139, in _install_setup_requires
        dist.fetch_build_eggs(dist.setup_requires)
      File "/usr/local/lib/python3.7/site-packages/setuptools/dist.py", line 717, in fetch_build_eggs
        replace_conflicting=True,
      File "/usr/local/lib/python3.7/site-packages/pkg_resources/__init__.py", line 782, in resolve
        replace_conflicting=replace_conflicting
      File "/usr/local/lib/python3.7/site-packages/pkg_resources/__init__.py", line 1065, in best_match
        return self.obtain(req, installer)
      File "/usr/local/lib/python3.7/site-packages/pkg_resources/__init__.py", line 1077, in obtain
        return installer(requirement)
      File "/usr/local/lib/python3.7/site-packages/setuptools/dist.py", line 784, in fetch_build_egg
        return cmd.easy_install(req)
      File "/usr/local/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 673, in easy_install
        raise DistutilsError(msg)
    distutils.errors.DistutilsError: Could not find suitable distribution for Requirement.parse('pypandoc')
    ----------------------------------------
ERROR: Command "python setup.py egg_info" failed with error code 1 in /private/var/folders/m6/x3v1x7255lv_w_vsp0zz_jw8hvmgc7/T/pycharm-packaging/pyspark/

我尝试通过首选项 -> 项目解释器 -> 单击“+” -> 搜索 pyspark -> 单击安装包来安装它

已安装相关软件包:

pip v19.1.1(最新) setuptools v41.0.1(最新) ez-setup v0.9(最新)

请注意,我可以安装其他软件包。

有人可以在这里解释一下吗?

最佳答案

如错误所示:

No local packages or working download links found for pypandoc

您应该首先安装pypandoc,然后尝试安装pyspark软件包。

关于python - 无法在 PyCharm 上安装 pyspark,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/56024901/

相关文章:

python-3.x - py4JJava 错误 - 使用 select 语句时出错

python - 多个 pyspark "window()"调用在执行 "groupBy()"时显示错误

python - 从另一个 python 文件导入对象作为列表

pycharm ctrl pageup/pagedown 在打开的文件之间切换

pycharm - 如何将 Pycharm 'copy' 和 'paste' 函数更改为 ctrl+c 和 ctrl+v?

python - 在 Mac OS X 上安装 libpq-dev

python - 标签的加权聚类

python - 为什么当 quizform.ans1 在访问动态创建的表单时工作时 getattr(quizform, 'ans1' ) 会失败?

python - 在 Django 中使用 '_id'

python - PySpark 应用程序失败,出现 java.lang.OutOfMemoryError : Java heap space