我已经通过VMware在ubuntu 14上安装了hadoop和spark。我正在尝试在独立模式下在spark / examples / ...中运行wordcount的python脚本,但是它给出了语法错误。
./bin/spark-submit --master yarn --deploy-mode client --executor-memory 2g usr/local/spark/examples/src/main/python/wordcount.py '/usr/local/spark/README.md'
File "<stdin>", line 1
./bin/spark-submit --master yarn --deploy-mode client --executor-memory 1g
/usr/local/spark/examples/src/main/python/wordcount.py '/usr/local/README.md'
^
SyntaxError: invalid syntax
我是Spark的初学者,请告诉我如何解决它。
最佳答案
wordcount.py
需要两个输入参数,请参见here
关于python - 提交pyspark作业时出现语法错误,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/41039178/