Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A386AEAFC for ; Sun, 3 Mar 2013 20:10:39 +0000 (UTC) Received: (qmail 42650 invoked by uid 500); 3 Mar 2013 20:10:33 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 42566 invoked by uid 500); 3 Mar 2013 20:10:33 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 42529 invoked by uid 99); 3 Mar 2013 20:10:33 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 03 Mar 2013 20:10:33 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [64.28.8.39] (HELO mx02.online4u.no) (64.28.8.39) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 03 Mar 2013 20:10:19 +0000 X-Spam-Score-scoring_explanation: X-Spam-Score-spf_status: X-Spam-Score-Spamcatcher1: X-Spam-Score-Summary: X-Spam-Score-rbl_summary: X-Spam-Score-Phishing_status: X-Spam-Score-Countries: X-Spam-Score-Charsets: X-Spam-Score: 0 Received: from [192.168.100.212] (unverified [195.0.175.10]) (using TLSv1 with Cipher RC4(128), Exch RSA_KEYX(2048), Hash MD5(128)) by mx02.online4u.no (Rockliffe SMTPRA 9.5.4) with ESMTP id for ; Sun, 3 Mar 2013 21:13:57 +0100 Message-ID: <5133AE15.6000903@gurusoft.no> Date: Sun, 03 Mar 2013 21:09:57 +0100 From: =?ISO-8859-1?Q?Leif_Hetles=E6ther?= User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130220 Thunderbird/17.0.3 MIME-Version: 1.0 To: solr-user@lucene.apache.org Subject: Making tika process mail attachments eludes me Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org Been trying for a while to create an index of a mailbox. I have downloaded solr-4.1.0.tgz, configured example/example-DIH/solr/mail/conf/data-config.xml and emails are indexed, but the attachmens eludes me. The config says: "Note - In order to index attachments, set processAttachement="true" and drop Tika and its dependencies to example-DIH/solr/mail/lib directory" Have tried dropping files from the contrib/extract/lib, but no luck. My friend Google seems to be unable to help me. Do I need to modify schema.xml or solrconfig.xml ? Cannot see any trace of Tika or errors in my logfile. Does it exist a working example to index mails and attachments somewhere to download? -- Regards Leif Hetles�ther