lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From It-forum <it-fo...@meseo.fr>
Subject Re: Indexed data not searchable
Date Mon, 08 Apr 2013 16:05:48 GMT
hi

I use dataimporter

the actual entity contain this :
<field column="id_product" name="id" />
<field column="quantity" name="inStock" />
<field column="reference" name="ref" />
<field column="supplier" name="brand" />
<field column="manufacturer" name="brand" />
<field column="name" name="brand" />
<field column="comptabible_model" regex="Piéce détachée pour ([\w 
0-9éèêîûô]+) Modèle" sourceColName="description_short" />
<field column="version_model" regex="Modèle:([0-9a-zA-Zéèêîûô-]+),?" 
sourceColName="description_short" />

data sample :
Piéce détachée pour Skimmer COFIES
Modèle:Premium-Design-Omega, Zipper5 Piéce détachée pour Régulateur de 
niveau modèle 3150
Modèle:3150 depuis 2003

Ideal result :
"name" => Couvercle SK siglé - HAYWARD
"manufacturer" => "HAYWARD"
"compatibility" =>  ["Skimmer COFIES"] -> ["Premium-Design-Omega", 
"Zipper5"]
                                 ["Régulateur de niveau modèle 3150"] 
     -> ["3150 depuis 2003"]



Then I wish to ba able to get all result for, all product with HAYWARD 
as Manufacturer. Then retreive the list of All Compatible product, in 
end the list of available model.

Schema.xml contains :
<field name="ref" type="string" indexed="true" stored="true" 
omitNorms="true" multiValued="false"/>
         <field name="name" type="text_fr" indexed="true" stored="true" />
         <field name="cat" type="text_fr" indexed="true" stored="true" 
multiValued="true" />
         <field name="brand" type="text_fr" indexed="true" stored="true" 
multiValued="true" />
         <field name="features" type="text_fr" indexed="true" 
stored="true" multiValued="true" />
where

<fieldType name="text_fr" class="solr.TextField" positionIncrementGap="100">
             <analyzer>
                 <tokenizer class="solr.StandardTokenizerFactory" />
                 <!-- removes l', etc -->
                 <filter class="solr.ElisionFilterFactory" 
ignoreCase="true" articles="lang/contractions_fr.txt" />
                 <filter class="solr.LowerCaseFilterFactory" />
                 <filter class="solr.StopFilterFactory" 
ignoreCase="true" words="lang/stopwords_fr.txt" format="snowball" 
enablePositionIncrements="true" />
                 <filter class="solr.WordDelimiterFilterFactory"  
generateWordParts="1" generateNumberParts="1" catenateWords="1" 
catenateNumbers="1" catenateAll="0" splitOnCaseChange="1"/>
                 <filter class="solr.StandardFilterFactory" />
                 <filter class="solr.FrenchLightStemFilterFactory" />
                 <filter class="solr.FrenchMinimalStemFilterFactory" />
                 <!-- more aggressive: <filter 
class="solr.SnowballPorterFilterFactory" language="French"/> -->
             </analyzer>
         </fieldType>

<fieldType name="text_html_fr" class="solr.TextField" 
positionIncrementGap="100">
             <analyzer>
                 <charFilter class="solr.HTMLStripCharFilterFactory" />
                 <tokenizer class="solr.StandardTokenizerFactory" />
                 <!-- removes l', etc -->
                 <filter class="solr.ElisionFilterFactory" 
ignoreCase="true" articles="lang/contractions_fr.txt" />
                 <filter class="solr.LowerCaseFilterFactory" />
                 <filter class="solr.StopFilterFactory" 
ignoreCase="true" words="lang/stopwords_fr.txt" format="snowball" 
enablePositionIncrements="true" />
                 <filter class="solr.ASCIIFoldingFilterFactory" />
                 <filter class="solr.FrenchLightStemFilterFactory" />
                 <filter class="solr.FrenchMinimalStemFilterFactory" />
                 <filter class="solr.SnowballPorterFilterFactory" 
language="French"/>
                 <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
             </analyzer>

         </fieldType>


I do not see how to organize this specification correctly with solr.

regards

eric

Le 08/04/2013 16:36, Gora Mohanty a écrit :
> On 8 April 2013 19:26, Max Bo <maximilian.brodhun@gmail.com> wrote:
>> Thanks for your help:
>>
>> The URL I'am positng to is: http://localhost:8983/solr/update?commit=true
>>
>>
>> The XML-Filess I've added contains fields like "author" so I thought they
>> have to serachable since it it declared as "indexed" in the example schema.
> Please include an example of your .xml file and of Solr's
> schema.xml. It is difficult to keep guessing in the dark.
>
> Regards,
> Gora


Mime
View raw message