Return-Path: Delivered-To: apmail-lucene-general-archive@www.apache.org Received: (qmail 34216 invoked from network); 6 Sep 2007 13:58:44 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 6 Sep 2007 13:58:44 -0000 Received: (qmail 37120 invoked by uid 500); 6 Sep 2007 13:58:36 -0000 Delivered-To: apmail-lucene-general-archive@lucene.apache.org Received: (qmail 37102 invoked by uid 500); 6 Sep 2007 13:58:36 -0000 Mailing-List: contact general-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@lucene.apache.org Delivered-To: mailing list general@lucene.apache.org Received: (qmail 37091 invoked by uid 99); 6 Sep 2007 13:58:36 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Sep 2007 06:58:36 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [208.69.42.181] (HELO radix.cryptio.net) (208.69.42.181) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Sep 2007 13:59:52 +0000 Received: by radix.cryptio.net (Postfix, from userid 1007) id CFFC771C055; Thu, 6 Sep 2007 06:58:10 -0700 (PDT) Received: from localhost (localhost [127.0.0.1]) by radix.cryptio.net (Postfix) with ESMTP id 2AF7871C04A for ; Thu, 6 Sep 2007 06:58:10 -0700 (PDT) Date: Thu, 6 Sep 2007 06:58:10 -0700 (PDT) From: Chris Hostetter To: general@lucene.apache.org Subject: Re: How Index XML In-Reply-To: <12511507.post@talk.nabble.com> Message-ID: References: <12511507.post@talk.nabble.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Virus-Checked: Checked by ClamAV on apache.org : Hi to all : : how i can index 1000 XML documents? : : exist some tool for make this? There are several Lucene based applications that might help you ... but it's hard to be sure without a better understanding of what it is you want to do, and how much control you wnat to have over the way it does it. I suggest you take a look at some of the various Lucene subprojects and decide if any of them meet your needs... http://lucene.apache.org/java/ - A library for writing your own application in Java http://lucene.apache.org/nutch/ - A web search engine for crawling documents on a web server http://lucene.apache.org/solr/ - A search server in which you explicitly feed documents to be indexed and decide how they are parsed. ...You should also take a look at the Lucene-Java PoewredBy page, which lists many applications (some free, some commercial) that are built using Lucene that may meet your needs... http://wiki.apache.org/lucene-java/PoweredBy -Hoss