我一直在尝试使用 Elasticsearch 5.4 设置 Nutch 2.3。问题出在 Nutch 中,因为我无法让它注入(inject)我的网址。 hadoop 日志显示以下警告:
安慰:
aurora apache-nutch-2.3.1 # runtime/local/bin/nutch inject urls/seed.txt
InjectorJob: starting at 2017-06-14 17:08:28
InjectorJob: Injecting urlDir: urls/seed.txt
**它卡在这里**
和
Hadoop日志:
aurora apache-nutch-2.3.1 # cat runtime/local/logs/hadoop.log
2017-06-14 17:08:28,339 INFO crawl.InjectorJob - InjectorJob: starting at 2017-06-14 17:08:28
2017-06-14 17:08:28,340 INFO crawl.InjectorJob - InjectorJob: Injecting urlDir: urls/seed.txt
2017-06-14 17:08:28,992 WARN util.NativeCodeLoader - Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
我已经尝试在这个线程( Hadoop "Unable to load native-hadoop library for your platform" warning )之后设置我的 Hadoop 环境变量,但我仍然遇到同样的错误。
有任何想法吗?
最佳答案
关于hadoop - Apache Nutch 2.3 : won't inject urls (hangs) & hadoop log shows warning,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/44708292/