lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Walter Underwood <wun...@wunderwood.org>
Subject Re: Unstructured/Structured data for indexing
Date Wed, 09 Dec 2015 16:05:09 GMT
Often Solr documents are “semi-structured”. They have some structured fields and some free-text
fields. e-mail messages are like that, with structured headers and an unstructured body.

wunder
Walter Underwood
wunder@wunderwood.org
http://observer.wunderwood.org/  (my blog)


> On Dec 9, 2015, at 4:13 AM, Alexandre Rafalovitch <arafalov@gmail.com> wrote:
> 
> Don't think about indexing so much, think about searching.
> 
> Say you are searching a video? What does that mean? Do you want to
> match random sequence of binary values that represent inter-frame
> change? Probably not. When you answer what you want to actually search
> (title? length? subscripts?), you will discover that structure. What
> do you want to return? A whole video, a segment, a description with a
> link?
> 
> So, you pre-process/index your data to give you the things you want to
> search for and in the form you want them to receive.
> 
> Regards,
>   Alex.
> ----
> Newsletter and resources for Solr beginners and intermediates:
> http://www.solr-start.com/
> 
> 
> On 9 December 2015 at 03:09, subinalex <alexkutty19@gmail.com> wrote:
>> Hi,
>> 
>> I am a solr newbie,just got a quick question.
>> 
>> SOLR is designed for querying unstructured data,but then why we have to send
>> it in a structured form(json,xml) for indexing?.
>> 
>> Thanks & Regards,S
>> Subin
>> 
>> 
>> 
>> --
>> View this message in context: http://lucene.472066.n3.nabble.com/Unstructured-Structured-data-for-indexing-tp4244406.html
>> Sent from the Solr - User mailing list archive at Nabble.com.


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message