Return-Path: X-Original-To: apmail-manifoldcf-user-archive@www.apache.org Delivered-To: apmail-manifoldcf-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CFC4710699 for ; Tue, 14 Jan 2014 17:36:49 +0000 (UTC) Received: (qmail 21177 invoked by uid 500); 14 Jan 2014 17:36:48 -0000 Delivered-To: apmail-manifoldcf-user-archive@manifoldcf.apache.org Received: (qmail 21112 invoked by uid 500); 14 Jan 2014 17:36:48 -0000 Mailing-List: contact user-help@manifoldcf.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@manifoldcf.apache.org Delivered-To: mailing list user@manifoldcf.apache.org Received: (qmail 21103 invoked by uid 99); 14 Jan 2014 17:36:48 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Jan 2014 17:36:48 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of daddywri@gmail.com designates 209.85.128.49 as permitted sender) Received: from [209.85.128.49] (HELO mail-qe0-f49.google.com) (209.85.128.49) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Jan 2014 17:36:41 +0000 Received: by mail-qe0-f49.google.com with SMTP id w4so5935565qeb.8 for ; Tue, 14 Jan 2014 09:36:20 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=9JC068VbJ3Q4wN04TKTKUdE5n8699SE07sZiy5B2QyA=; b=UmQFH+Yj8pKcj5cIeSpb6eKJUaqpMR4AKKQVXDODeEVyKeqIsPoEBeVKQbszK+WKTj IZNnhtTf6pMCn7Ap7nWht061yHJSRJfhbMtxJuvyhEuiPKzDdK4UKGkV+LjxtwByBTvb KMtk0K5CtOK1xh3KPXyah+UmFmWlKcJTnDFaVRYi4VQfn+fQrZzDOruZH06n6iNFZBa0 EnCG5Rhs6+YigdOOqUWHQSbRb5bHCLzTgBsqKDZDutaPpIwwKxKb54F2lmSYkqp73phs r7qL4ArtRO007nFLkCleJ0j2kuZWQ2UVCrCXYYLNPN2g0UvN4mpzOq7DtYU+NSPYn3DE yvFw== MIME-Version: 1.0 X-Received: by 10.49.25.46 with SMTP id z14mr5119092qef.20.1389720980460; Tue, 14 Jan 2014 09:36:20 -0800 (PST) Received: by 10.96.102.65 with HTTP; Tue, 14 Jan 2014 09:36:20 -0800 (PST) In-Reply-To: References: Date: Tue, 14 Jan 2014 12:36:20 -0500 Message-ID: Subject: Re: ManifoldCF SOLR request default Content-Type From: Karl Wright To: "user@manifoldcf.apache.org" Content-Type: multipart/alternative; boundary=047d7b5d9b9960999104eff1a18e X-Virus-Checked: Checked by ClamAV on apache.org --047d7b5d9b9960999104eff1a18e Content-Type: text/plain; charset=ISO-8859-1 Hi Paul, When there is no content type on a web crawl, the ManifoldCF web connector does not default anything -- it sets null as the content type. The Solr output connector also does not default anything; it returns null to SolrJ when SolrJ requests the content type. What SolrJ does under those conditions is anyone's guess, but I suspect that that is where the application/octet content type is getting set. I'd have to look at that code to be sure. Karl On Tue, Jan 14, 2014 at 12:29 PM, Paul Bieles wrote: > Does ManifoldCF default Content-Type to application/octet-stream for file > types that it doesn't know? If so, is there a way to set it to something > else? The reason I ask is I've got a load of kml files that I'm pushing > into solr. > > Cheers, > > Paul > --047d7b5d9b9960999104eff1a18e Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hi Paul,

When there is no content t= ype on a web crawl, the ManifoldCF web connector does not default anything = -- it sets null as the content type.

The Solr output connector= also does not default anything; it returns null to SolrJ when SolrJ reques= ts the content type.=A0 What SolrJ does under those conditions is anyone= 9;s guess, but I suspect that that is where the application/octet content t= ype is getting set.=A0 I'd have to look at that code to be sure.

Karl



On Tue, Jan 14, 2014 at 12:29 PM, Paul Bieles &l= t;paulbieles@ho= tmail.com> wrote:
Does ManifoldCF default Content-Type to application/o= ctet-stream for file types that it doesn't know? If so, is there a way = to set it to something else? The reason I ask is I've got a load of kml= files that I'm pushing into solr.
=A0
Cheers,
=A0
Pau= l

--047d7b5d9b9960999104eff1a18e--