lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From KP Sanjailal <kpsanjai...@gmail.com>
Subject Indexing & Searching MySQL table with Hindi and English data
Date Thu, 17 May 2012 09:55:28 GMT
Hi,

I tried to setup indexing of MySQL tables in Apache Solr 3.6.

Everything works fine but text in Hindi script (only some 10% of total
records) not getting indexed properly.

A search with keyword in Hindi retrieve emptly result set.  Also a
retrieved hindi record displays junk characters.

The database tables contains bibliographical details of books such as
title, author, publisher, isbn, publishing place, series etc. and out of
the total records about 10% of records contains text in Hindi in title,
author, publisher fields.

Example:

*Search Results from MySQL using PHP*

   1.
<http://192.168.0.132/shared/biblio_view.php?bibid=26913&tab=opac>
  *Title:* सौर ऊर्जा Saur
oorja<http://192.168.0.132/shared/biblio_view.php?bibid=26913&tab=opac>
*Author(s):* विनोद कुमार मिश्र MISHRA (VK) *Material:* Books
**  **
*Search Results from Apache Solr (searched using keyword in English)*

  1.
<http://192.168.0.132/test/biblio_view.php?bibid=26913&tab=opac>
  *Title:* सौर ऊर्जा Saur
oorja<http://192.168.0.132/test/biblio_view.php?bibid=26913&tab=opac>
*Author(s):* विनोद कुमार मिश्र
MISHRA (VK) *
Material:* Books


How do I go about solving this language problem.

Thanks in advace.

K. P. Sanjailal
--

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message