lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yangrui Guo <guoyang...@gmail.com>
Subject Data import handler not indexing all data
Date Sat, 07 Nov 2015 23:21:52 GMT
Hello

I'm being troubled by solr's data import handler. My solr version is 5.3.1
and mysql is 5.5. I tried to index imdb data but found solr only partially
indexed. I ran "SELECT DISTINCT COUNT(*) FROM imdb.director" and the query
result was 1636549. However DIH only fetched and indexed 287041 rows. I
didn't see any error in the log. Why was this happening?

Here's my data-config.xml

<dataConfig>
<dataSource type="JdbcDataSource" driver="com.mysql.jdbc.Driver"
url="jdbc:mysql://localhost:3306/imdb" user="root" password="password" />
<document>
<entity name="director" transformer="RegexTransformer" query="SELECT
DISTINCT * FROM imdb.director">
<field name="id" column="id" />
<field name="content_type" column="content_type" />
</entity>
</document>
</dataConfig>

Yangrui Guo

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message