manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <>
Subject [jira] [Created] (CONNECTORS-1434) Bad characters in file name can cause Solr 500 errors
Date Wed, 21 Jun 2017 09:56:00 GMT
Karl Wright created CONNECTORS-1434:

             Summary: Bad characters in file name can cause Solr 500 errors
                 Key: CONNECTORS-1434
             Project: ManifoldCF
          Issue Type: Bug
          Components: Lucene/SOLR connector
    Affects Versions: ManifoldCF 2.7
            Reporter: Karl Wright
            Assignee: Karl Wright
             Fix For: ManifoldCF 2.8

There are reports that quotes or spaces in a file name can blow up the Solr indexing of the
document and cause it to throw a 500 error.

The code in question (from ModifiedHttpSolrClient) is the following:

            String name = content.getName();
            if (name == null) {
              name = "";
            parts.add(new FormBodyPart(name,
                new InputStreamBody(

... where content.getName() would be returning a name with illegal characters.  The question
is, what does httpclient do with this name, and should it be escaping it in some way?

This message was sent by Atlassian JIRA

View raw message