lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ameya Aware <ameya.aw...@gmail.com>
Subject Re: Query in metadata sent to Solr
Date Thu, 10 Jul 2014 18:15:59 GMT
Hi,

Please find below solr log:

INFO  - 2014-07-09 15:30:56.243;
org.apache.solr.update.processor.LogUpdateProcessor;
[collection1] webapp=/solr path=/update/extract params={literal.deny_token_
document=DEAD_AUTHORITY&literal.DocIcon=docx&resource.name=Anarchism-
201310091123505625.docx&literal.allow_token_document=S-1-5-21-1482846375-
2276666860-3536682573-500&literal.allow_token_document=S-1-5-21-1482846375-
2276666860-3536682573-68651&literal.FolderChildCount=0&version=2.2&literal.
ItemChildCount=0&literal.GUID=Ameya&literal.ParentVersionString=&literal._
CopySource=&literal.cat=&literal.FileSizeDisplay=1264155&literal._
CheckinComment=&literal.Edit=0&literal.id=
http://sharepointten:10800/sites/siteecho/Shared%2520Documents/Anarchism-201310091123505625.docx&literal.LinkFilenameNoMenu=Anarchism-201310091123505625.docx&literal.Created=2014-06-03+11:21:53&literal._UIVersionString=1.0&wt=xml&literal.Title=Anarchism&
literal.Modified=2014-06-03+11:21:53
&literal.Author=Sharepoint+Backup&literal.FileLeafRef=Anarchism-201310091123505625.docx&literal.LinkFilename=Anarchism-201310091123505625.docx&literal.lcf_metadata_id=81&literal.Editor=Administrator&literal.ParentLeafName=&literal.CheckoutUser=&literal.
<http://sharepointten:10800/sites/siteecho/Shared%2520Documents/Anarchism-201310091123505625.docx&literal.LinkFilenameNoMenu=Anarchism-201310091123505625.docx&literal.Created=2014-06-03+11:21:53&literal._UIVersionString=1.0&wt=xml&literal.Title=Anarchism&literal.Modified=2014-06-03+11:21:53&literal.Author=Sharepoint+Backup&literal.FileLeafRef=Anarchism-201310091123505625.docx&literal.LinkFilename=Anarchism-201310091123505625.docx&literal.lcf_metadata_id=81&literal.Editor=Administrator&literal.ParentLeafName=&literal.CheckoutUser=&literal.ContentType=Document>


After seeing log above we got to know that MCF is
sending literal.Modified=2014-06-03+11:21:34 which is correct. So when i
updated my Solr schema file to change the field name from 'last_modified'
 to 'modified',
I got error that *multiple values encountered for non multiValued field
modified: [2014-06-03T00:00:00.000Z, 2013-10-09T11:24:25.000Z].*

So 2014-06-03 is coming from MCF and  2013-10-09 might be coming from Tika.

So i do not think in my case <str name="uprefix">ignored_</str> should be
removed from solrconfig.xml.

I just dont want tika to send the calue 2013-10-09, how can i manage that?

Please let me know in case of any concerns.


Thanks,
Ameya




On Thu, Jul 10, 2014 at 12:43 PM, Ahmet Arslan <iorixxx@yahoo.com.invalid>
wrote:

> Hi,
>
> Okey then remove this line from solrconfig.xml
>
> <str name="uprefix">ignored_</str>
>
> and explicitly define a field named Modified in schema.xml with
> stored=true.
>
> With this you should be able to see Modified field. Please see definition
> of uprefix parameter :
> http://wiki.apache.org/solr/ExtractingRequestHandler#Input_Parameters
>
>
>
> By the way when I work with solr-cell I  add following dynamic field
>
>  <dynamicField name="*" type="string" multiValued="true" stored="true"
> indexed="true" />
>
> for debugging purposes. This will show you all field generated solr-cell
> and mcf. I select/note the ones I want to keep and then define them in
> schema.xml. I ignore the rest using uprefix trick. Of course remove dynamic
> field too at the very end.
>
> Ahmet
>
>
> On Thursday, July 10, 2014 5:20 PM, Ameya Aware <ameya.aware@gmail.com>
> wrote:
> Hi
>
> I am using same solrconfig.xml which comes in default solr package.
>
> Do i need to make change anywhere?
>
> Thanks,
> Ameya
>
>
>
>
>
> On Wed, Jul 9, 2014 at 5:59 PM, Ahmet Arslan <iorixxx@yahoo.com.invalid>
> wrote:
>
> > Hi,
> >
> > Field name sent with literal is Modified. In your screenshot, it is
> > last_modified . Do you use f.map setting in solrconfig.xml?
> >
> > I think it is better to send use solrconfig.xml file where solr cell
> > handler defined.
> >
> >
> > On Thursday, July 10, 2014 12:18 AM, Ameya Aware <ameya.aware@gmail.com>
> > wrote:
> >
> >
> >
> > Hi,
> >
> > Please have look at the below part taken from solr.log file.
> >
> > INFO  - 2014-07-09 15:30:56.243;
> > org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
> > webapp=/solr path=/update/extract
> >  params={literal.deny_token_document=DEAD_AUTHORITY&literal.DocIcon=docx&
> > resource.name
> >
> =Anarchism-201310091123505625.docx&literal.allow_token_document=S-1-5-21-1482846375-2276666860-3536682573-500&literal.allow_token_document=S-1-5-21-1482846375-2276666860-3536682573-68651&literal.FolderChildCount=0&version=2.2&literal.ItemChildCount=0&literal.GUID=Ameya&literal.ParentVersionString=&literal._CopySource=&
> > literal.cat
> >
> =&literal.FileSizeDisplay=1264155&literal._CheckinComment=&literal.Edit=0&
> > literal.id=
> >
> http://sharepointten:10800/sites/siteecho/Shared%2520Documents/Anarchism-201310091123505625.docx&literal.LinkFilenameNoMenu=Anarchism-201310091123505625.docx&literal.Created=2014-06-03+11:21:53&literal._UIVersionString=1.0&wt=xml&literal.Title=Anarchism&literal.Modified=2014-06-03+11:21:53&literal.Author=Sharepoint+Backup&literal.FileLeafRef=Anarchism-201310091123505625.docx&literal.LinkFilename=Anarchism-201310091123505625.docx&literal.lcf_metadata_id=81&liter
> >
> >
> al.Editor=Administrator&literal.ParentLeafName=&literal.CheckoutUser=&literal
> > .
> >
> >
> > In the log, last_modified date comes out to be 2014-06-03 which is
> correct
> > date.
> > But when i see it on Solr UI it seems to come different.
> >
> > Please find attached screenshot for it.
> >
> > Can you please let me know cause for it?
> >
> > Is that field being extracted by Solar cell?
> >
> > Thanks,
> > Ameya
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message