mysql - 我无法用 solr 索引我的数据库。我总是得到 `Indexing failed. Rolled back all changes.`

标签 mysql docker solr solr4 dataimporthandler

我无法使用 solr (4.0) 索引我的 mysql 数据库。我总是得到 Indexing failed。回滚所有更改。 我已经查看了以前的答案,但我无法弄清楚这是什么问题。我在 Docker(passenger phusion 和 ubuntu 14.04)容器中运行它。有任何想法吗?我已经花了几天时间试图弄清楚这一点。

docker 组成

version: '2'

services:
  search:
    env_file: .env
    build: .
    ports:
      - "8080:8080"
    volumes:
      - ~/.m2:/root/.m2
      # - ~/.solr:/data/solr/
      - ./docker:/home/app/docker
  db:
    env_file: .env
    image: mysql:5.6
    ports:
      - "3307:3306"

SOLR_HOME

ls -l /data/solr/collection1/conf/
-rw-r--r-- 1 tomcat7 tomcat7  7601 Oct 17 15:35 data-config.xml
-rw-r--r-- 1 tomcat7 tomcat7     0 Oct 17 15:35 dataimport.properties
-rw-r--r-- 1 tomcat7 tomcat7   561 Oct 17 15:35 log4j.properties
-rw-r--r-- 1 tomcat7 tomcat7   707 Oct 17 15:35 log4j.xml
-rw-r--r-- 1 tomcat7 tomcat7 12302 Oct 17 15:35 schema.xml
-rw-r--r-- 1 tomcat7 tomcat7   479 Oct 17 15:35 solrconfig-qf.xml
-rw-r--r-- 1 tomcat7 tomcat7 41383 Oct 17 15:35 solrconfig.xml
-rw-r--r-- 1 tomcat7 tomcat7   148 Oct 17 15:35 solrcore.properties
-rw-r--r-- 1 tomcat7 tomcat7   138 Oct 17 15:35 solrcore.properties.template

Solr 管理员

Solr Admin

data-config.xml https://gist.github.com/anonymous/ce99aa9277f0295a2a52768fb7866e6a

<dataConfig>
  <dataSource name="db" type="JdbcDataSource" driver="com.mysql.jdbc.Driver"
    url="jdbc:mysql://localhost:3306/datacite" user="root"
    password="" readonly="true"
    batchSize="-1" />
  <!-- for batchSize=-1 see DIH FAQ -->
  <dataSource name="field" type="FieldReaderDataSource" />
  <document>
    <!-- SOLR-2104 -->
    <!-- using delta import as proposed in http://wiki.apache.org/solr/DataImportHandlerDeltaQueryViaFullImport -->

Sorl 核心属性

mds.db.url=jdbc:mysql://localhost:3306/datacite?useUnicode=true&characterEncoding=UTF8
mds.db.user=datacite
mds.db.password=
mds.testprefix=10.5072

Solrconfig.xml

<!-- AutoCommit

     Perform a hard commit automatically under certain conditions.
     Instead of enabling autoCommit, consider using "commitWithin"
     when adding documents.

     http://wiki.apache.org/solr/UpdateXmlMessages

     maxDocs - Maximum number of documents to add since the last
               commit before automatically triggering a new commit.

     maxTime - Maximum amount of time in ms that is allowed to pass
               since a document was added before automaticly
               triggering a new commit.
     openSearcher - if false, the commit causes recent index changes
     to be flushed to stable storage, but does not cause a new
     searcher to be opened to make those changes visible.
  -->
 <autoCommit>
   <maxTime>15000</maxTime>
   <openSearcher>false</openSearcher>
 </autoCommit>

<!-- softAutoCommit is like autoCommit except it causes a
     'soft' commit which only ensures that changes are visible
     but does not ensure that data is synced to disk.  This is
     faster and more near-realtime friendly than a hard commit.
  -->
 <!--
   <autoSoftCommit>
     <maxTime>1000</maxTime>
   </autoSoftCommit>
  -->

<!-- Update Related Event Listeners

     Various IndexWriter related events can trigger Listeners to
     take actions.

     postCommit - fired after every commit or optimize command
     postOptimize - fired after every optimize command
  -->
<!-- The RunExecutableListener executes an external command from a
     hook such as postCommit or postOptimize.

     exe - the name of the executable to run
     dir - dir to use as the current working directory. (default=".")
     wait - the calling thread waits until the executable returns.
            (default="true")
     args - the arguments to pass to the program.  (default is none)
     env - environment variables to set.  (default is none)
  -->
<!-- This example shows how RunExecutableListener could be used
     with the script based replication...
     http://wiki.apache.org/solr/CollectionDistribution
  -->
<!--
   <listener event="postCommit" class="solr.RunExecutableListener">
     <str name="exe">solr/bin/snapshooter</str>
     <str name="dir">.</str>
     <bool name="wait">true</bool>
     <arr name="args"> <str>arg1</str> <str>arg2</str> </arr>
     <arr name="env"> <str>MYVAR=val1</str> </arr>
   </listener>
  -->

<!-- Enables a transaction log, currently used for real-time get.
     "dir" - the target directory for transaction logs, defaults to the
        solr data directory.  -->
<updateLog>
  <str name="dir">/data/solr/collection1/data</str>
</updateLog>

  <dataDir>/data/solr/collection1/data</dataDir>


  <lib dir="../../../contrib/dataimporthandler/lib/" regex=".*\.jar" />

  <lib dir="../../../dist/" regex="solr-dataimporthandler-\d.*\.jar" />

  <lib dir="../../../lib/" regex="mysql-connector-java-5.0.8-bin.jar" />

  <requestHandler name="/admin/dataimport" class="org.apache.solr.handler.dataimport.DataImportHandler">
    <lst name="defaults">
      <str name="config">data-config.xml</str>
    </lst>
    <lst name="invariants">
      <str name="db.url">jdbc:mysql://localhost:3306/datacite?useUnicode=true&amp;amp;characterEncoding=UTF8</str>
      <str name="db.user">root</str>
      <str name="db.password"></str>
      <str name="testprefix">10.5072</str>
    </lst>
  </requestHandler>

最佳答案

问题是我没有连接到 mysql 数据库。 mysql 数据库主机错误。感谢@MatsLindh,我弄明白了。

docker compose 中,您的服务 的名称是它们的主机 的名称。因此,在我的例子中,mysql 数据库主机的名称是 db(请参阅上面问题中的 docker compose)。我的错误是我在环境变量中将 mysql 数据库主机的名称设置为 $DB_HOST = localhost。

当我尝试从 solr 容器连接到 mysql 容器时,我意识到了。我不得不做

mysql --host=db --user=root --password= datacite

因为

mysql --host=localhost --user=root --password= datacite

没有工作。

就这样吧。 在 docker compose 中执行此操作时检查数据库主机的名称。

关于mysql - 我无法用 solr 索引我的数据库。我总是得到 `Indexing failed. Rolled back all changes.`,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/40091152/

相关文章:

MySql - 删除一行

ubuntu - 如何让mysql服务器停止作为slave启动

Docker 使用 ansible 'Docker login' 和 _json_key 登录 GCE

mongodb - 将 ObjectID 作为字符串导入

java - 标题中的句点在 SOLR 中不可搜索

php - Mysql DELETE 查询在 PMA 中有效,但在我的 PHP 脚本中无效

mysql - 如何编码日期以在 mysql 中进行高效查询

nginx - 如何在docker Registry v2中公开发布和推送要求登录的信息?

python - docker-py : attach to network before starting the container

solr - 在哪里可以找到 Apache Lucene/Solr 的性能基准