Return-Path: Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: (qmail 10622 invoked from network); 1 Apr 2011 15:31:58 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 1 Apr 2011 15:31:58 -0000 Received: (qmail 32381 invoked by uid 500); 1 Apr 2011 15:31:55 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 32340 invoked by uid 500); 1 Apr 2011 15:31:55 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 32332 invoked by uid 99); 1 Apr 2011 15:31:55 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Apr 2011 15:31:55 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_NEUTRAL,T_FILL_THIS_FORM_SHORT X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.214.194] (HELO mail-iw0-f194.google.com) (209.85.214.194) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Apr 2011 15:31:50 +0000 Received: by iwc10 with SMTP id 10so826475iwc.5 for ; Fri, 01 Apr 2011 08:31:29 -0700 (PDT) MIME-Version: 1.0 Received: by 10.42.142.135 with SMTP id s7mr3286723icu.247.1301671888672; Fri, 01 Apr 2011 08:31:28 -0700 (PDT) Received: by 10.42.220.197 with HTTP; Fri, 1 Apr 2011 08:31:28 -0700 (PDT) In-Reply-To: References: Date: Fri, 1 Apr 2011 11:31:28 -0400 Message-ID: Subject: Re: Problems with DIH and missing fields. From: Marcelo Iturbe To: solr-user@lucene.apache.org Content-Type: multipart/alternative; boundary=90e6ba6e819089d811049fdd1a38 --90e6ba6e819089d811049fdd1a38 Content-Type: text/plain; charset=ISO-8859-1 Solved it! commonField="true" should be commonField="false" mistakes that happen when copying source a sample proyect... Thanks for your help. On Fri, Apr 1, 2011 at 10:29 AM, Marcelo Iturbe wrote: > > Hello, > I was able to repeat this behaviour in Solr 3.1.0 > > The procedure is > - rename the directory example-DIH/rss to example-DIH/gcontacts > - modify solrconfig.xml to only load gcontacts > - rename rss-data-config.xml to gcontacts-data-config.xml and modify (see > content below) > - modify schema.xml > > This is from my schema.xml > > /> > > > > > multiValued="true" default=" "/> > multiValued="true" default=" "/> > multiValued="true" default=" "/> > > multiValued="true" default=" "/> > > multiValued="true" /> > > > > > > > this is my gcontacts-data-config.xml file > > > > pk="link" > url="http://172.16.0.30/sayt2/contacts/testtim.xml" > processor="XPathEntityProcessor" > forEach="/feed/entry" > > > > commonField="true" /> > xpath="/feed/entry/link[@rel='edit']/@href" commonField="true" /> > > commonField="true"/> > xpath="/feed/entry/link[@rel='edit']/@href" /> > commonField="true"/> > commonField="true"/> > commonField="true"/> > xpath="/feed/entry/postalAddress" commonField="true"/> > > > > > This is from my solrconfig.xml file > > > > > > > > Thanks for your help. > > Regards > > > On Fri, Apr 1, 2011 at 4:27 AM, Stefan Matheis < > matheis.stefan@googlemail.com> wrote: > >> Marcelo, >> >> could you paste the relevant parts of your DIH config? >> >> Regards >> Stefan >> >> On Thu, Mar 31, 2011 at 9:55 PM, Marcelo Iturbe >> wrote: >> > Hello, >> > I have an XML which contains personal contacts. Not all contacts have >> the >> > same fields (email, phone, postal). >> > >> > The problem is that when certain fields are NOT present, SOLR is >> injecting >> > the previous contacts data. >> > >> > For example, assume the following from the XML feed: >> > >> > Jane Doe >> > >> > Santiago >> > Region Metropolitana >> > Chile >> > >> > >> > Jeff Smith >> > >> > >> > >> > Ana Mercurio >> > > > primary='true'>+56912345678 >> > >> > >> > The second contact, will have the first contacts postal address. >> > The third contact, will have Janes Postal Address and Jeffs email >> address: >> > >> > >> > >> > Ana Mercurio >> > >> > >> > +5612345678 >> > >> > >> > jeff.smith@gmail.com >> > >> > >> > Santiago >> > Region Metropolitana >> > Chile >> > >> > >> > >> > This is how I have the fields specified in the schema.xml file: >> > > > multiValued="true" default=" "/> >> > > > multiValued="true" default=" "/> >> > > stored="true" >> > multiValued="true" default=" "/> >> > >> > What did I miss? >> > >> > Thanks for your help. >> > >> > > --90e6ba6e819089d811049fdd1a38--