lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lance Norskog <goks...@gmail.com>
Subject Re: Wrong categorization with DIH
Date Fri, 20 Apr 2012 10:09:05 GMT
Working with the DIH is a little easier if you make database view and
load from that. You can set all of the field names and see exactly
what the DIH gets.

On Thu, Apr 19, 2012 at 10:11 AM, Ramo Karahasan
<ramo.karahasan@googlemail.com> wrote:
> Hi,
>
> yes i use every oft hem.
>
> Thanks for your hint... I'll have a look at this and try to configure it
> correctly.
>
> Thank you,
> Ramo
>
> -----Ursprüngliche Nachricht-----
> Von: Jeevanandam Madanagopal [mailto:jeeva@myjeeva.com]
> Gesendet: Donnerstag, 19. April 2012 18:42
> An: solr-user@lucene.apache.org
> Betreff: Re: Wrong categorization with DIH
>
> Ramo -
>
> Are you using all the selected columns from the query?
>
> select p.title as title, p.id, p.category_id, p.pic_thumb, c.name as
> category, c.id as category_id from product p, category c ...
>
> I see following attributes 'p.id', 'p.category_id' & 'p.pic_thumb'  doesn't
> have alias defined.
>
> Pointers:
> ------------
> - Select only required field in the sql query
> - Ensure sql alias name and attribute name in the schema.xml should match
>      or
> - If you like to do explicit mapping for every column in DIH config as
> follow <field column="SQL-COLUMN-NAME-HERE"
> name="SOLR-SCHEMA-ATTRIBUTE-NAME-HERE" />
>
> Detailed Info refer this: http://wiki.apache.org/solr/DataImportHandler
>
> -Jeevanandam
>
>
> On Apr 19, 2012, at 9:37 PM, Ramo Karahasan wrote:
>
>> Hi,
>>
>> my config is just the following:
>>
>> <dataConfig>
>>  <dataSource type="JdbcDataSource"
>>              driver="com.mysql.jdbc.Driver"
>>              url="jdbc:mysql://xx/asdx"
>>              user="xxxx"
>>              password="xxxx"/>
>>  <document>
>>   <entity name="id"
>>            query="select p.title as title, p.id, p.category_id,
>> p.pic_thumb, c.name as category, c.id as category_id from product p,
>> category c WHERE p.category_id = c.id AND  '${dataimporter.request.clean}'
>> != 'false' OR updated_at > '${dataimporter.last_index_time}' ">
>>    </entity>
>>  </document>
>> </dataConfig>
>>
>> I'm doing it as described on:
>>
>> http://wiki.apache.org/solr/DataImportHandlerDeltaQueryViaFullImport
>>
>> Any ideas?
>>
>> Best regars,
>> Ramo
>>
>> -----Ursprüngliche Nachricht-----
>> Von: Jeevanandam Madanagopal [mailto:jeeva@myjeeva.com]
>> Gesendet: Donnerstag, 19. April 2012 17:44
>> An: solr-user@lucene.apache.org
>> Betreff: Re: Wrong categorization with DIH
>>
>> Ramo -
>>
>> Please share DIH configuration with us.
>>
>> -Jeevanandam
>>
>> On Apr 19, 2012, at 7:46 PM, Ramo Karahasan wrote:
>>
>>> Does anyone has an idea what's going wrong here?
>>>
>>> Thanks,
>>> Ramo
>>>
>>> -----Ursprüngliche Nachricht-----
>>> Von: Gora Mohanty [mailto:gora@mimirtech.com]
>>> Gesendet: Dienstag, 17. April 2012 11:34
>>> An: solr-user@lucene.apache.org
>>> Betreff: Re: Wrong categorization with DIH
>>>
>>> On 17 April 2012 14:47, Ramo Karahasan
>>> <ramo.karahasan@googlemail.com>
>>> wrote:
>>>> Hi,
>>>>
>>>>
>>>>
>>>> i currently face the followin issue:
>>>>
>>>> Testing the following sql statement which is also used in SOLR (DIH)
>>>> leads to a wrong categorization in solr:
>>>>
>>>> select p.title as title, p.id, p.category_id, p.pic_thumb, c.name as
>>>> category, c.id as category_id from product p, category c WHERE
>>>> p.category_id = c.id AND p.id = 3091328
>>>>
>>>>
>>>>
>>>> This returns in my sql client:
>>>>
>>>> Apple MacBook Pro MD313D/A 33,8 cm (13,3 Zoll) Notebook (Intel Core
>>>> i5-2435M, 2,4GHz, 4GB RAM, 500GB HDD, Intel HD 3000, Mac OS),
>>>> 3091328, 1003,
>>>> http://m-d.ww.cdn.com/images/I/41teWbp-uAL._SL75_.jpg,
>>>> Computer,
>>>> 1003
>>>>
>>>>
>>>>
>>>> As you see, the categoryid 1003 points to "Computer"
>>>>
>>>>
>>>>
>>>> Via the solr searchadmin i get the following result when searchgin
>>>> for
>>>> id:3091328
>>>>
>>>> <str name="category">Sport</str>
>>>>
>>>> <int name="category_id">1003</int>
>>> [...]
>>>
>>> Please share with us the rest of the DIH configuration file, i.e.,
>>> the part where these data are saved to the Solr index.
>>>
>>> Regards,
>>> Gora
>>>
>>
>>
>
>



-- 
Lance Norskog
goksron@gmail.com

Mime
View raw message