Mailing-List: contact general-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: general@lucene.apache.org
Received-SPF: neutral (nike.apache.org: local policy)
Date: Sun, 27 Nov 2011 16:22:47 -0800 (PST)
From: Jan <jan.richter@dsto.defence.gov.au>
To: general@lucene.apache.org
Message-ID: <1322439767608-3541066.post@n3.nabble.com>
In-Reply-To: <alpine.DEB.2.00.1111231713400.11793@bester>
References: <1321508548770-3514857.post@n3.nabble.com>
 <alpine.DEB.2.00.1111231713400.11793@bester>
Subject: Re: Populating a custom Solr field with text extracted from
 document
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit

Thank you for your reply.

: what are you using to do the crawling?

I'm using Solr within LucidWorks Enterprise. As far as I know LucidWorks
provides a default crawler called Aperture so this is what I'm using.

Thank you also for describing a few of the options to tackle the problem. I
did consider writing some custom parsing code, but wanted to explore
existing options first rather than re-inventing the wheel. I've tinkered
with curl a bit and think that POSTing to Solr may be a suitable approach.

--
View this message in context: http://lucene.472066.n3.nabble.com/Populating-a-custom-Solr-field-with-text-extracted-from-document-tp3514857p3541066.html
Sent from the Lucene - General mailing list archive at Nabble.com.