Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6F06EF83C for ; Fri, 19 Apr 2013 07:53:18 +0000 (UTC) Received: (qmail 24181 invoked by uid 500); 19 Apr 2013 07:53:17 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 23602 invoked by uid 500); 19 Apr 2013 07:53:16 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 23540 invoked by uid 99); 19 Apr 2013 07:53:15 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Apr 2013 07:53:15 +0000 Date: Fri, 19 Apr 2013 07:53:15 +0000 (UTC) From: "Karl Wright (JIRA)" To: dev@lucene.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (SOLR-4358) SolrJ, by preventing multi-part post, loses key information about file name that Tika needs MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/SOLR-4358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13636166#comment-13636166 ] Karl Wright commented on SOLR-4358: ----------------------------------- The test that fails for me is: {code} [junit4:junit4] Tests with failures: [junit4:junit4] - org.apache.solr.cloud.AliasIntegrationTest.testDistribSearch [junit4:junit4] - org.apache.solr.cloud.AliasIntegrationTest (suite) [junit4:junit4] {code} This on branch_4x. Perhaps the test was renamed on trunk? Anyhow, the failure looks suspiciously like I tripped over a local disk error: {code} 2> Creating dataDir: C:\wip\solr\branch_4x\solr\build\solr-core\test\J0\.\solrtest-AliasIntegrationTest-1366356503950 2> !!!! WARNING: best effort to remove C:\wip\solr\branch_4x\solr\build\solr-core\test\J0\.\org.apache.solr.cloud.AliasIntegrationTest-1366356503947\jetty1\index\org.apache.lucene.store.RAMDirectory@1f24a78 lockFactory=org.apache.lucene.store.NativeFSLockFactory@15b0c83-write.lock FAILED !!!!! 2> !!!! WARNING: best effort to remove C:\wip\solr\branch_4x\solr\build\solr-core\test\J0\.\org.apache.solr.cloud.AliasIntegrationTest-1366356503947\jetty1\index FAILED !!!!! 2> !!!! WARNING: best effort to remove C:\wip\solr\branch_4x\solr\build\solr-core\test\J0\.\org.apache.solr.cloud.AliasIntegrationTest-1366356503947\jetty1 FAILED !!!!! 2> !!!! WARNING: best effort to remove C:\wip\solr\branch_4x\solr\build\solr-core\test\J0\.\org.apache.solr.cloud.AliasIntegrationTest-1366356503947 FAILED !!!!! 2> NOTE: reproduce with: ant test -Dtestcase=AliasIntegrationTest -Dtests.method=testDistribSearch -Dtests.seed=964155E88FA3F7F -Dtests.slow=true -Dtests.locale=en_PH -Dtests.timezone=Etc/GMT0 -Dtests.file.encoding=Cp1252 [03:28:23.944] ERROR 53.9s | AliasIntegrationTest.testDistribSearch <<< > Throwable #1: org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException: Server at http://127.0.0.1:56486/_hfw/testalias returned non ok status:500, message:Server Error > at __randomizedtesting.SeedInfo.seed([964155E88FA3F7F:88829B46FFA55F43]:0) > at org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:385) > at org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:180) > at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:117) > at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:116) > at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:102) > at org.apache.solr.cloud.AliasIntegrationTest.doTest(AliasIntegrationTest.java:213) > at org.apache.solr.BaseDistributedSearchTestCase.testDistribSearch(BaseDistributedSearchTestCase.java:806) {code} On repeat, the test succeeded. Obviously something random about the SolrCloud tests in general, I think. So I will rerun until I'm satisfied no test fails repeatably. > SolrJ, by preventing multi-part post, loses key information about file name that Tika needs > ------------------------------------------------------------------------------------------- > > Key: SOLR-4358 > URL: https://issues.apache.org/jira/browse/SOLR-4358 > Project: Solr > Issue Type: Bug > Components: clients - java > Affects Versions: 4.0 > Reporter: Karl Wright > Assignee: Ryan McKinley > Attachments: additional_changes.diff, SOLR-4358.patch, SOLR-4358.patch, SOLR-4358.patch > > > SolrJ accepts a ContentStream, which has a name field. Within HttpSolrServer.java, if SolrJ makes the decision to use multipart posts, this filename is transmitted as part of the form boundary information. However, if SolrJ chooses not to use multipart post, the filename information is lost. > This information is used by SolrCell (Tika) to make decisions about content extraction, so it is very important that it makes it into Solr in one way or another. Either SolrJ should set appropriate equivalent headers to send the filename automatically, or it should force multipart posts when this information is present. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org