Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 34748D7BF for ; Tue, 11 Dec 2012 19:09:15 +0000 (UTC) Received: (qmail 42192 invoked by uid 500); 11 Dec 2012 19:09:11 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 42152 invoked by uid 500); 11 Dec 2012 19:09:11 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 42143 invoked by uid 99); 11 Dec 2012 19:09:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Dec 2012 19:09:11 +0000 X-ASF-Spam-Status: No, hits=3.2 required=5.0 tests=FORGED_YAHOO_RCVD,FREEMAIL_ENVFROM_END_DIGIT,SPF_NEUTRAL,URI_HEX X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [216.139.236.26] (HELO sam.nabble.com) (216.139.236.26) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Dec 2012 19:09:06 +0000 Received: from ben.nabble.com ([192.168.236.152]) by sam.nabble.com with esmtp (Exim 4.72) (envelope-from ) id 1TiVC9-000381-KB for solr-user@lucene.apache.org; Tue, 11 Dec 2012 11:08:45 -0800 Date: Tue, 11 Dec 2012 11:08:45 -0800 (PST) From: eShard To: solr-user@lucene.apache.org Message-ID: <1355252925610-4026126.post@n3.nabble.com> Subject: Too many Tika errors MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org I'm running Solr 4.0 on Tomcat 7.0.8 and I'm running the solr/example single core as well with manifoldcf v1.1 I had everything working but then the crawler stops and I have Tika errors in the solr log I had tika 1.1 and that produces these errors: org.apache.solr.common.SolrException: org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@17bc9c03 So, I upgraded to tika 1.2 and again everything seemed to be working (I indexed 24,000 files) then I recrawled the repository and again it stops; this time the tika errors are: null:java.lang.RuntimeException: java.lang.NoClassDefFoundError: org/mozilla/universalchardet/CharsetListener at org.apache.solr.servlet.SolrDispatchFilter.sendError(SolrDispatchFilter.java:456) What's going on here? What version of tika should I use? -- View this message in context: http://lucene.472066.n3.nabble.com/Too-many-Tika-errors-tp4026126.html Sent from the Solr - User mailing list archive at Nabble.com.