Return-Path: Delivered-To: apmail-lucene-solr-dev-archive@minotaur.apache.org Received: (qmail 72711 invoked from network); 23 Feb 2009 11:35:29 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 23 Feb 2009 11:35:29 -0000 Received: (qmail 82131 invoked by uid 500); 23 Feb 2009 11:35:28 -0000 Delivered-To: apmail-lucene-solr-dev-archive@lucene.apache.org Received: (qmail 81831 invoked by uid 500); 23 Feb 2009 11:35:26 -0000 Mailing-List: contact solr-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-dev@lucene.apache.org Delivered-To: mailing list solr-dev@lucene.apache.org Received: (qmail 81820 invoked by uid 99); 23 Feb 2009 11:35:25 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 23 Feb 2009 03:35:25 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 23 Feb 2009 11:35:23 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 62B95234C495 for ; Mon, 23 Feb 2009 03:35:02 -0800 (PST) Message-ID: <1058785119.1235388902403.JavaMail.jira@brutus> Date: Mon, 23 Feb 2009 03:35:02 -0800 (PST) From: "Fergus McMenemie (JIRA)" To: solr-dev@lucene.apache.org Subject: [jira] Issue Comment Edited: (SOLR-1033) DIH transformers cannot reuse output from previous transformations In-Reply-To: <802130496.1235223122224.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/SOLR-1033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12675856#action_12675856 ] fergus edited comment on SOLR-1033 at 2/23/09 3:33 AM: ----------------------------------------------------------------- OK here goes. My document contains references to embeded imagery. For each image there is the image itself along with a thumbnail and caption. The source document contains:- I have a search application that searches only the captions associated with a given image. It would be nice to populate solr fields with the correct relative path to each image and thumbnails at index time. Problem arises in that although the thumbnail is: s${e.vurl}.jpg The name of the image itself varies depending on the first letter of the image type imageType! It could be one of 'picture' 'graphic' 'lineDrawing' or 'map'. ie:- p${e.vurl}.jpg g${e.vurl}.jpg l${e.vurl}.jpg m${e.vurl}.jpg My patch would allow the following sort of thing to be added to a data-config. I feel this considerably increases its power and usefulness. {code} {code} was (Author: fergus): OK here goes. My document contains references to embeded imagery. For each image there is the image itself along with a thumbnail and caption. The source document contains:- I have a search application that searches only the captions associated with a given image. It would be nice to populate solr fields with the correct relative path to each image and thumbnails at index time. Problem arises in that although the thumbnail is: s${e.vurl}.jpg The name of the image itself varies depending on the first letter of the image type imageType! It could be one of 'picture' 'graphic' 'lineDrawing' or 'map'. ie:- p${e.vurl}.jpg g${e.vurl}.jpg l${e.vurl}.jpg m${e.vurl}.jpg My patch would allow the following sort of thing to be added to a data-config. I feel this considerably increases its power and usefulness. {{code}} {{code}} > DIH transformers cannot reuse output from previous transformations > ------------------------------------------------------------------ > > Key: SOLR-1033 > URL: https://issues.apache.org/jira/browse/SOLR-1033 > Project: Solr > Issue Type: Improvement > Components: contrib - DataImportHandler > Affects Versions: 1.4 > Environment: All operating systems and software platforms > Reporter: Fergus McMenemie > Fix For: 1.4 > > Attachments: SOLR-1033.patch, SOLR-1033.patch > > Original Estimate: 24h > Remaining Estimate: 24h > > It can be very useful to reuse the output from a DIH template in other templates and or regex transformers. Currently this cannot be done. The resolver is initialized at the start of the transformer run with what ever values exist for a column name at that instant. As the transformer executes it may define new values for column names. My change is intended to update the hash used by the resolver after each successful transformation. > This only applies to the template and regex transformers. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.