java - 在 Google Dataproc Spark 中链接原生 .so

标签 java apache-spark google-cloud-dataproc freeling

我有一个本地库 (Freeling),我使用 cmake 编译并本地制作并通过集群启动操作安装(因此,它应该存在于 master 和每个 worker 中)

即便如此,我在调用 System.loadLibrary 时仍收到此错误

Exception in thread "main" java.lang.UnsatisfiedLinkError: no Jfreeling in java.library.path
    at java.lang.ClassLoader.loadLibrary(ClassLoader.java:1867)
    at java.lang.Runtime.loadLibrary0(Runtime.java:870)
    at java.lang.System.loadLibrary(System.java:1122)

我尝试使用以下属性让程序找到库(它在静态代码块中调用)

      "properties": {
        "spark.driver.extraClassPath": "/usr/local/share/freeling/APIs/java/Jfreeling.jar:/usr/local/lib/libfreeling.so",
        "spark.executor.extraClassPath": "/usr/local/share/freeling/APIs/java/Jfreeling.jar:/usr/local/lib/libfreeling.so",
        "spark.executor.extraLibraryPath": "/usr/local/lib/libfreeling.so",
        "spark.driver.extraLibraryPath": "/usr/local/lib/libfreeling.so",
        "spark.executorEnv.LD_PRELOAD": "/usr/local/lib/libfreeling.so",
        "spark.yarn.dist.files": "/usr/local/lib/libfreeling.so",
        "spark.yarn.appMasterEnv.LD_PRELOAD": "libfreeling.so",
        "spark.files": "/usr/local/lib/libfreeling.so",
        "spark.executorEnv.LD_LIBRARY_PATH": "libfreeling.so"
      },
      "jarFileUris": [
        "file:///usr/local/share/freeling/APIs/java/Jfreeling.jar",
        "file:///usr/local/lib/libfreeling.so"
      ],

最佳答案

你能试着把你的库放在 /usr/lib/hadoop/lib/native/ 下吗?在/etc/spark/conf/spark-env.sh中,它有

# Spark got rid of SPARK_LIBRARY_PATH in 1.0
# It has properties for extraLibraryPaths, but this is more extensible
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:${HADOOP_HOME}/lib/native

关于java - 在 Google Dataproc Spark 中链接原生 .so,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/57149660/

相关文章:

python - 运行PySpark命令时出错

apache-spark - GCP Dataproc:使用存储连接器的网络带宽不足

scala - 在大量分区上处理 upsert 不够快

java - 如何对数组列表进行计数?

java - 通过编程语言访问 Skype

java - 结果集未正确定位

scala - 如何在连接中将 Column.isin 与数组列一起使用?

google-cloud-platform - GCP : You do not have sufficient permissions to SSH into this instance

google-cloud-dataproc - Dataproc 中的 Hive 授权

Java 泛型 : Accessing Generic Type at runtime