Return-Path: Delivered-To: apmail-lucene-solr-user-archive@locus.apache.org Received: (qmail 78123 invoked from network); 29 Jan 2007 20:17:54 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 29 Jan 2007 20:17:54 -0000 Received: (qmail 88885 invoked by uid 500); 29 Jan 2007 20:17:49 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 88873 invoked by uid 500); 29 Jan 2007 20:17:49 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 88843 invoked by uid 99); 29 Jan 2007 20:17:48 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Jan 2007 12:17:48 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of bdelacretaz@gmail.com designates 209.85.132.242 as permitted sender) Received: from [209.85.132.242] (HELO an-out-0708.google.com) (209.85.132.242) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Jan 2007 12:17:40 -0800 Received: by an-out-0708.google.com with SMTP id c3so584474ana for ; Mon, 29 Jan 2007 12:17:19 -0800 (PST) DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:sender:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references:x-google-sender-auth; b=eh6Ybaa0d8xeZW/Wl568N3BslFpdjtFBt0LLjiqj8ZeFiIrE1Fvcm38X4kV2ss85IUSKKb2YdjW6X70PFIPTefa3uBYBJXqbUgD4hPPH/8AHXWgpayMWXOeraCiNLEuudpaav5pLlvBQ7NXX2/RmwnL1spdIPPLzl0hxYkSCT40= Received: by 10.78.118.19 with SMTP id q19mr4361107huc.1170101838255; Mon, 29 Jan 2007 12:17:18 -0800 (PST) Received: by 10.78.50.11 with HTTP; Mon, 29 Jan 2007 12:17:17 -0800 (PST) Message-ID: Date: Mon, 29 Jan 2007 21:17:17 +0100 From: "Bertrand Delacretaz" Sender: bdelacretaz@gmail.com To: solr-user@lucene.apache.org Subject: Re: How to Index Word, Excel, PDF files? In-Reply-To: <27bc81750701291039ifad10b1l3cd483775f84ab96@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <27bc81750701291039ifad10b1l3cd483775f84ab96@mail.gmail.com> X-Google-Sender-Auth: 4bc3e990acbc9296 X-Virus-Checked: Checked by ClamAV on apache.org On 1/29/07, Leandro Saad wrote: > ...I'd like to know if solr can index Word, Excel and PDF files or I must > create a xml representation of those files matching my schema?... Currently you must create the XML yourself outside of Solr. This might change, see https://issues.apache.org/jira/browse/SOLR-104 and the recent related "update plugins" discussions. -Bertrand