lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hoss Man (JIRA)" <j...@apache.org>
Subject [jira] [Reopened] (SOLR-4358) SolrJ, by preventing multi-part post, loses key information about file name that Tika needs
Date Tue, 16 Apr 2013 22:49:16 GMT

     [ https://issues.apache.org/jira/browse/SOLR-4358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Hoss Man reopened SOLR-4358:
----------------------------


As noted by Uwe on the dev list, the changes committed for this issue seem ot have caused
various test failures, as well as sending ChaosMonkeySafeLeader into an infinite loop(?!)

Re-opening to investigate - likely need to roll back these changes if we can't get to the
bottom of things right away
                
> SolrJ, by preventing multi-part post, loses key information about file name that Tika
needs
> -------------------------------------------------------------------------------------------
>
>                 Key: SOLR-4358
>                 URL: https://issues.apache.org/jira/browse/SOLR-4358
>             Project: Solr
>          Issue Type: Bug
>          Components: clients - java
>    Affects Versions: 4.0
>            Reporter: Karl Wright
>            Assignee: Ryan McKinley
>         Attachments: SOLR-4358.patch, SOLR-4358.patch
>
>
> SolrJ accepts a ContentStream, which has a name field.  Within HttpSolrServer.java, if
SolrJ makes the decision to use multipart posts, this filename is transmitted as part of the
form boundary information.  However, if SolrJ chooses not to use multipart post, the filename
information is lost.
> This information is used by SolrCell (Tika) to make decisions about content extraction,
so it is very important that it makes it into Solr in one way or another.  Either SolrJ should
set appropriate equivalent headers to send the filename automatically, or it should force
multipart posts when this information is present.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message