Return-Path: Delivered-To: apmail-jackrabbit-users-archive@locus.apache.org Received: (qmail 62693 invoked from network); 29 Feb 2008 13:54:58 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 29 Feb 2008 13:54:58 -0000 Received: (qmail 43978 invoked by uid 500); 29 Feb 2008 13:54:52 -0000 Delivered-To: apmail-jackrabbit-users-archive@jackrabbit.apache.org Received: (qmail 43956 invoked by uid 500); 29 Feb 2008 13:54:52 -0000 Mailing-List: contact users-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@jackrabbit.apache.org Delivered-To: mailing list users@jackrabbit.apache.org Received: (qmail 43947 invoked by uid 99); 29 Feb 2008 13:54:52 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Feb 2008 05:54:52 -0800 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of seancallan@gmail.com designates 64.233.178.244 as permitted sender) Received: from [64.233.178.244] (HELO hs-out-0708.google.com) (64.233.178.244) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Feb 2008 13:54:16 +0000 Received: by hs-out-0708.google.com with SMTP id h53so3186862hsh.11 for ; Fri, 29 Feb 2008 05:54:25 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; bh=l1F8X3gq+0TGqRLcP6TzYOMhg3KZQS8FYGcXQ5jpBrs=; b=nYB1Hut+bgXqwvjcIOu9axiDGWlV7Deed8KA6G62wMcElHohPmUVKIRpfJBu8I1Ih2lF2nvRE2oNNMQNhcj3stgDOGqGc4aSZ/n6BoxXLN/4yq0g91oES//dgyEcDAU0IYab+Np1edQZkEpN6doUmfxVW/QGo4rq21Pt87w0tXM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=lCESXPs/3NrCZLNL1/M82cf/xY3arr1c3jjv7ftGk4NJ4trORpss1iYkaeOTxJilB7duEsMGEAbAsIyF02LffMyFXBTl3OKtxCS0wVxx++WztMwNdCPBYHHN/sqjDYhXojz8BXb65FG/Dh++ortOFpw9AxtVhOa34lg7GgHQpxY= Received: by 10.100.195.15 with SMTP id s15mr12225142anf.28.1204293263692; Fri, 29 Feb 2008 05:54:23 -0800 (PST) Received: by 10.100.44.16 with HTTP; Fri, 29 Feb 2008 05:54:23 -0800 (PST) Message-ID: <245ef33b0802290554n16cfac35w7eafc481a8783c61@mail.gmail.com> Date: Fri, 29 Feb 2008 08:54:23 -0500 From: "Sean Callan" To: users@jackrabbit.apache.org Subject: Re: Proper Workspace/Indexing Configuration In-Reply-To: <47C7D8AB.3020105@gmx.net> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_17366_30519427.1204293263715" References: <245ef33b0802270928ja197797rc9b93f5ce35f3644@mail.gmail.com> <47C7D8AB.3020105@gmx.net> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_17366_30519427.1204293263715 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline Hi Marcel, You've answered all my questions in one morning, what a great thing! I did realize my issue with the dependencies for text extraction the other day by stumbling upon the pom.xml. In addition I found there are some other dependencies necessary for PDF extraction listed on the PDFBox website. Thanks for much for all your help, I owe you a beer, let me know when you're in Washington, DC! Thanks, Sean On Fri, Feb 29, 2008 at 5:04 AM, Marcel Reutegger wrote: > Hi Sean, > > did you include all required jar files into your classpath? e.g. text > extraction > from MS office documents requires apache poi. see dependencies here: > > http://svn.apache.org/repos/asf/jackrabbit/tags/1.4/jackrabbit-text-extractors/pom.xml > > regards > marcel > > Sean Callan wrote: > > Hi guys, > > > > Would anyone be so kind as to send me a functional repository > configuration > > that indexes a variety of nt:files types? I'm using the follow > > configuration and I am unable to search for a term within any of my > binary > > content (nt:files > jcr:content). > > > > At this point I'm out of ideas, the correct jars are in place, searching > > works on all my plain text nodes, I can even see that the index is > updated > > when I add in new nt:files nodes. But a search returns nothing. At > this > > point search is the only thing holding back my development and client's > > acceptance of JackRabbit as our repository. > > > > Any help would be greatly appreciated! > > > > > > > > > class="org.apache.jackrabbit.core.fs.local.LocalFileSystem > "> > > > > > > > > > class=" > > org.apache.jackrabbit.core.security.SimpleAccessManager"/> > > > > > > > > > > > rootPath="${rep.home}/workspaces" > > defaultWorkspace="default" /> > > > > > class=" > org.apache.jackrabbit.core.fs.local.LocalFileSystem > > "> > > > > > > > class=" > > org.apache.jackrabbit.core.state.xml.XMLPersistenceManager" /> > > > class="org.apache.jackrabbit.core.query.lucene.SearchIndex"> > > > > > > > > > > > > > > > class=" > org.apache.jackrabbit.core.fs.local.LocalFileSystem > > "> > > > > > > > class=" > > org.apache.jackrabbit.core.state.xml.XMLPersistenceManager" /> > > > > > > > > Thanks, > > Sean > > > > ------=_Part_17366_30519427.1204293263715--