lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From sam ” <skyn...@gmail.com>
Subject document inside document?
Date Mon, 26 Mar 2012 19:00:07 GMT
Hey,

I am making an image search engine where people can tag images with various
items that are themselves tagged.
For example, http://example.com/abc.jpg is tagged with the following three
items:
- item1 that is tagged with: tall blond woman
- item2 that is tagged with: yellow purse
- item3 that is tagged with: gucci red dress

Querying for +yellow +purse  will return the example image. But, querying
for +gucci +purse will not because the image does not have an item tagged
with both gucci and purse.

In addition to "items", each image has various metadata such as alt text,
location, description, photo credit.. etc  that should be available for
search.

How should I write my schema.xml ?
If imageUrl is primary key, do I implement my own fieldType for items, so
that I can write:
<field name="items" type="myItemType" multiValued="true"/>
What would myItemType look like so that solr would know the example image
will not be part of the query, +gucci +purse??

If itemId is primary key, I can use result grouping (
http://wiki.apache.org/solr/FieldCollapsing). But, I need to repeat alt
text and other image metadata for each item.

Or, should I create different schema for item search and metadata search?

Thanks.
Sam.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message