manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (CONNECTORS-1434) Bad characters in file name can cause Solr 500 errors
Date Thu, 22 Jun 2017 09:57:01 GMT

     [ https://issues.apache.org/jira/browse/CONNECTORS-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Karl Wright resolved CONNECTORS-1434.
-------------------------------------
    Resolution: Fixed

r1799511 (also includes another fix for ES)


> Bad characters in file name can cause Solr 500 errors
> -----------------------------------------------------
>
>                 Key: CONNECTORS-1434
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1434
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Lucene/SOLR connector
>    Affects Versions: ManifoldCF 2.7
>            Reporter: Karl Wright
>            Assignee: Karl Wright
>             Fix For: ManifoldCF 2.8
>
>         Attachments: CONNECTORS-1434.patch
>
>
> There are reports that quotes or spaces in a file name can blow up the Solr indexing
of the document and cause it to throw a 500 error.
> The code in question (from ModifiedHttpSolrClient) is the following:
> {code}
>             String name = content.getName();
>             if (name == null) {
>               name = "";
>             }
>             parts.add(new FormBodyPart(name,
>                 new InputStreamBody(
>                     content.getStream(),
>                     contentType,
>                     content.getName())));
> {code}
> ... where content.getName() would be returning a name with illegal characters.  The question
is, what does httpclient do with this name, and should it be escaping it in some way?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message