apache-spark - 加载包时禁止来自 spark-submit 的消息

标签 apache-spark ivy spark-submit

如果你试试这个:

spark-submit \
  --packages "org.apache.hadoop:hadoop-aws:2.7.4" \
  pyspark-example.py

您将获得大量噪声输出为 spark-submit解决了 hadoop-aws 的所有依赖项打包并下载它们。如果包已经下载,你得到的输出会稍微减少,但仍然很多:
org.apache.hadoop:hadoop-aws:2.7.4 pyspark-example.py
Ivy Default Cache set to: /home/ec2-user/.ivy2/cache
The jars for the packages stored in: /home/ec2-user/.ivy2/jars
:: loading settings :: url = jar:file:/hadoop/spark/spark-2.2.1-bin-hadoop2.7/jars/ivy-2.4.0.jar!/org/apache/ivy/core/settings/ivysettings.xml
org.apache.hadoop#hadoop-aws added as a dependency
:: resolving dependencies :: org.apache.spark#spark-submit-parent;1.0
    confs: [default]
    found org.apache.hadoop#hadoop-aws;2.7.4 in central
    found org.apache.hadoop#hadoop-common;2.7.4 in central
    found org.apache.hadoop#hadoop-annotations;2.7.4 in central
    found com.google.guava#guava;11.0.2 in central
    found com.google.code.findbugs#jsr305;3.0.0 in central
    found commons-cli#commons-cli;1.2 in central
    found org.apache.commons#commons-math3;3.1.1 in central
    found xmlenc#xmlenc;0.52 in central
    found commons-httpclient#commons-httpclient;3.1 in central
    found commons-logging#commons-logging;1.1.3 in central
    found commons-codec#commons-codec;1.4 in central
    found commons-io#commons-io;2.4 in central
    found commons-net#commons-net;3.1 in central
    found commons-collections#commons-collections;3.2.2 in central
    found javax.servlet#servlet-api;2.5 in central
    found org.mortbay.jetty#jetty;6.1.26 in central
    found org.mortbay.jetty#jetty-util;6.1.26 in central
    found org.mortbay.jetty#jetty-sslengine;6.1.26 in central
    found com.sun.jersey#jersey-core;1.9 in central
    found com.sun.jersey#jersey-json;1.9 in central
    found org.codehaus.jettison#jettison;1.1 in central
    found com.sun.xml.bind#jaxb-impl;2.2.3-1 in central
    found javax.xml.bind#jaxb-api;2.2.2 in central
    found javax.xml.stream#stax-api;1.0-2 in central
    found javax.activation#activation;1.1 in central
    found org.codehaus.jackson#jackson-core-asl;1.9.13 in central
    found org.codehaus.jackson#jackson-mapper-asl;1.9.13 in central
    found org.codehaus.jackson#jackson-jaxrs;1.9.13 in central
    found org.codehaus.jackson#jackson-xc;1.9.13 in central
    found com.sun.jersey#jersey-server;1.9 in central
    found asm#asm;3.2 in central
    found log4j#log4j;1.2.17 in central
    found net.java.dev.jets3t#jets3t;0.9.0 in central
    found org.apache.httpcomponents#httpclient;4.2.5 in central
    found org.apache.httpcomponents#httpcore;4.2.5 in central
    found com.jamesmurty.utils#java-xmlbuilder;0.4 in central
    found commons-lang#commons-lang;2.6 in central
    found commons-configuration#commons-configuration;1.6 in central
    found commons-digester#commons-digester;1.8 in central
    found commons-beanutils#commons-beanutils;1.7.0 in central
    found commons-beanutils#commons-beanutils-core;1.8.0 in central
    found org.slf4j#slf4j-api;1.7.10 in central
    found org.apache.avro#avro;1.7.4 in central
    found com.thoughtworks.paranamer#paranamer;2.3 in central
    found org.xerial.snappy#snappy-java;1.0.4.1 in central
    found org.apache.commons#commons-compress;1.4.1 in central
    found org.tukaani#xz;1.0 in central
    found com.google.protobuf#protobuf-java;2.5.0 in central
    found com.google.code.gson#gson;2.2.4 in central
    found org.apache.hadoop#hadoop-auth;2.7.4 in central
    found org.apache.directory.server#apacheds-kerberos-codec;2.0.0-M15 in central
    found org.apache.directory.server#apacheds-i18n;2.0.0-M15 in central
    found org.apache.directory.api#api-asn1-api;1.0.0-M20 in central
    found org.apache.directory.api#api-util;1.0.0-M20 in central
    found org.apache.zookeeper#zookeeper;3.4.6 in central
    found org.slf4j#slf4j-log4j12;1.7.10 in central
    found io.netty#netty;3.6.2.Final in central
    found org.apache.curator#curator-framework;2.7.1 in central
    found org.apache.curator#curator-client;2.7.1 in central
    found com.jcraft#jsch;0.1.54 in central
    found org.apache.curator#curator-recipes;2.7.1 in central
    found org.apache.htrace#htrace-core;3.1.0-incubating in central
    found org.mortbay.jetty#servlet-api;2.5-20081211 in central
    found javax.servlet.jsp#jsp-api;2.1 in central
    found jline#jline;0.9.94 in central
    found junit#junit;4.11 in central
    found org.hamcrest#hamcrest-core;1.3 in central
    found com.fasterxml.jackson.core#jackson-databind;2.2.3 in central
    found com.fasterxml.jackson.core#jackson-annotations;2.2.3 in central
    found com.fasterxml.jackson.core#jackson-core;2.2.3 in central
    found com.amazonaws#aws-java-sdk;1.7.4 in central
    found joda-time#joda-time;2.9.9 in central
    [2.9.9] joda-time#joda-time;[2.2,)
:: resolution report :: resolve 2170ms :: artifacts dl 65ms
    :: modules in use:
    asm#asm;3.2 from central in [default]
    com.amazonaws#aws-java-sdk;1.7.4 from central in [default]
    com.fasterxml.jackson.core#jackson-annotations;2.2.3 from central in [default]
    com.fasterxml.jackson.core#jackson-core;2.2.3 from central in [default]
    com.fasterxml.jackson.core#jackson-databind;2.2.3 from central in [default]
    com.google.code.findbugs#jsr305;3.0.0 from central in [default]
    com.google.code.gson#gson;2.2.4 from central in [default]
    com.google.guava#guava;11.0.2 from central in [default]
    com.google.protobuf#protobuf-java;2.5.0 from central in [default]
    com.jamesmurty.utils#java-xmlbuilder;0.4 from central in [default]
    com.jcraft#jsch;0.1.54 from central in [default]
    com.sun.jersey#jersey-core;1.9 from central in [default]
    com.sun.jersey#jersey-json;1.9 from central in [default]
    com.sun.jersey#jersey-server;1.9 from central in [default]
    com.sun.xml.bind#jaxb-impl;2.2.3-1 from central in [default]
    com.thoughtworks.paranamer#paranamer;2.3 from central in [default]
    commons-beanutils#commons-beanutils;1.7.0 from central in [default]
    commons-beanutils#commons-beanutils-core;1.8.0 from central in [default]
    commons-cli#commons-cli;1.2 from central in [default]
    commons-codec#commons-codec;1.4 from central in [default]
    commons-collections#commons-collections;3.2.2 from central in [default]
    commons-configuration#commons-configuration;1.6 from central in [default]
    commons-digester#commons-digester;1.8 from central in [default]
    commons-httpclient#commons-httpclient;3.1 from central in [default]
    commons-io#commons-io;2.4 from central in [default]
    commons-lang#commons-lang;2.6 from central in [default]
    commons-logging#commons-logging;1.1.3 from central in [default]
    commons-net#commons-net;3.1 from central in [default]
    io.netty#netty;3.6.2.Final from central in [default]
    javax.activation#activation;1.1 from central in [default]
    javax.servlet#servlet-api;2.5 from central in [default]
    javax.servlet.jsp#jsp-api;2.1 from central in [default]
    javax.xml.bind#jaxb-api;2.2.2 from central in [default]
    javax.xml.stream#stax-api;1.0-2 from central in [default]
    jline#jline;0.9.94 from central in [default]
    joda-time#joda-time;2.9.9 from central in [default]
    junit#junit;4.11 from central in [default]
    log4j#log4j;1.2.17 from central in [default]
    net.java.dev.jets3t#jets3t;0.9.0 from central in [default]
    org.apache.avro#avro;1.7.4 from central in [default]
    org.apache.commons#commons-compress;1.4.1 from central in [default]
    org.apache.commons#commons-math3;3.1.1 from central in [default]
    org.apache.curator#curator-client;2.7.1 from central in [default]
    org.apache.curator#curator-framework;2.7.1 from central in [default]
    org.apache.curator#curator-recipes;2.7.1 from central in [default]
    org.apache.directory.api#api-asn1-api;1.0.0-M20 from central in [default]
    org.apache.directory.api#api-util;1.0.0-M20 from central in [default]
    org.apache.directory.server#apacheds-i18n;2.0.0-M15 from central in [default]
    org.apache.directory.server#apacheds-kerberos-codec;2.0.0-M15 from central in [default]
    org.apache.hadoop#hadoop-annotations;2.7.4 from central in [default]
    org.apache.hadoop#hadoop-auth;2.7.4 from central in [default]
    org.apache.hadoop#hadoop-aws;2.7.4 from central in [default]
    org.apache.hadoop#hadoop-common;2.7.4 from central in [default]
    org.apache.htrace#htrace-core;3.1.0-incubating from central in [default]
    org.apache.httpcomponents#httpclient;4.2.5 from central in [default]
    org.apache.httpcomponents#httpcore;4.2.5 from central in [default]
    org.apache.zookeeper#zookeeper;3.4.6 from central in [default]
    org.codehaus.jackson#jackson-core-asl;1.9.13 from central in [default]
    org.codehaus.jackson#jackson-jaxrs;1.9.13 from central in [default]
    org.codehaus.jackson#jackson-mapper-asl;1.9.13 from central in [default]
    org.codehaus.jackson#jackson-xc;1.9.13 from central in [default]
    org.codehaus.jettison#jettison;1.1 from central in [default]
    org.hamcrest#hamcrest-core;1.3 from central in [default]
    org.mortbay.jetty#jetty;6.1.26 from central in [default]
    org.mortbay.jetty#jetty-sslengine;6.1.26 from central in [default]
    org.mortbay.jetty#jetty-util;6.1.26 from central in [default]
    org.mortbay.jetty#servlet-api;2.5-20081211 from central in [default]
    org.slf4j#slf4j-api;1.7.10 from central in [default]
    org.slf4j#slf4j-log4j12;1.7.10 from central in [default]
    org.tukaani#xz;1.0 from central in [default]
    org.xerial.snappy#snappy-java;1.0.4.1 from central in [default]
    xmlenc#xmlenc;0.52 from central in [default]
    ---------------------------------------------------------------------
    |                  |            modules            ||   artifacts   |
    |       conf       | number| search|dwnlded|evicted|| number|dwnlded|
    ---------------------------------------------------------------------
    |      default     |   72  |   1   |   0   |   0   ||   72  |   0   |
    ---------------------------------------------------------------------
:: retrieving :: org.apache.spark#spark-submit-parent
    confs: [default]
    0 artifacts copied, 72 already retrieved (0kB/17ms)
hadoop-aws是一个比较常用的包,可以让 Spark 通过 S3A 与 S3 交互.每次有人跑spark-submit有了那个包裹,他们就会看到上面的文字墙。

除非出现问题,否则有没有办法让所有这些输出安静下来?解决方案 discussed here , 喜欢设置 log4j.rootCategory=ERROR , 似乎不影响上述输出。

最佳答案

从评论中提取:

Spark uses Ivy API ,应该可以通过在 Ivy 实例化之前调用以下命令来更改默认记录器

org.apache.ivy.util.Message.setDefaultLogger(new org.apache.ivy.util.DefaultMessageLogger(org.apache.ivy.util.Message.Message.MSG_WARN));

我在这里使用了警告,但它可以是 any of the message levels .

关于apache-spark - 加载包时禁止来自 spark-submit 的消息,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/49000342/

相关文章:

java - 无法解析..使用 ant 和 ivy 构建 red5 时解析会永远等待

java - 将ant ivy项目中的jar导入到maven中

apache-spark - Kubernetes 上的 Spark 提交,即使 Spark 作业完成后,执行程序 Pod 仍在运行。因此,资源不能免费用于新工作

scala - 未检测到项目中的 SLF4J : simplelogger. 属性

apache-spark - 使用 pyspark 分层采样

apache-spark - 老年代堆内存逐渐增加

branch - 有没有办法配置 Ivy 以从分支获取依赖项,而无需编辑每个相关依赖项?

apache-spark - spark-submit 中的 Spark : How to set spark. yarn.executor.memoryOverhead 属性

scala - 从数据帧列读取路径并从数据帧添加另一列

java - 如何使用数据集聚合 spark(java) 中的字符串计数