Return-Path: Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: (qmail 2381 invoked from network); 5 Sep 2009 14:42:55 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 5 Sep 2009 14:42:55 -0000 Received: (qmail 49050 invoked by uid 500); 5 Sep 2009 14:42:54 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 48980 invoked by uid 500); 5 Sep 2009 14:42:54 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 48970 invoked by uid 99); 5 Sep 2009 14:42:54 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 05 Sep 2009 14:42:54 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of tom@thomas-harding.name designates 88.170.168.121 as permitted sender) Received: from [88.170.168.121] (HELO smtp.thomas-harding.name) (88.170.168.121) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 05 Sep 2009 14:42:40 +0000 Received: from localhost (localhost [127.0.0.1]) by smtp.thomas-harding.name (Postfix) with ESMTP id AD1BE21609 for ; Sat, 5 Sep 2009 16:42:19 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at smtp.thomas-harding.name Received: from smtp.thomas-harding.name ([127.0.0.1]) by localhost (thomas-harding.name [127.0.0.1]) (amavisd-new, port 10024) with LMTP id t0fNQgKy13C5 for ; Sat, 5 Sep 2009 16:42:18 +0200 (CEST) Received: from [IPv6:2a01:e35:8aaa:8790:222:15ff:fe64:e855] (blackberry.hdglocal [IPv6:2a01:e35:8aaa:8790:222:15ff:fe64:e855]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client CN "Thomas Harding, smtp", Issuer "Thomas Harding" (verified OK)) (Authenticated sender: tom@smtp) by smtp.thomas-harding.name (Postfix) with ESMTPSA id 57E5621607 for ; Sat, 5 Sep 2009 16:42:18 +0200 (CEST) Message-ID: <4AA278C8.40302@thomas-harding.name> Date: Sat, 05 Sep 2009 16:42:16 +0200 From: Thomas Harding User-Agent: Mozilla-Thunderbird 2.0.0.22 (X11/20090706) MIME-Version: 1.0 To: user@couchdb.apache.org Subject: Re: [HALF-SOLVED] Re: how to handle attachments with couchdb-lucene? References: <4AA1C2BA.1070008@thomas-harding.name> <39BB2977-3502-4E68-A2DE-DF3EE04C43DA@gmail.com> <46aeb24f0909050232p5893bbc9k14845ae9e163c674@mail.gmail.com> <4AA242A8.1070306@thomas-harding.name> <46aeb24f0909050510u7509903dmc3620a1b06f00020@mail.gmail.com> In-Reply-To: <46aeb24f0909050510u7509903dmc3620a1b06f00020@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Thanks for your response. So, for text/plain guessed as application/octet-stream I suppose the problem comes from couchdb itself, as: "tom@blackberry:~/couchdb-lucene$ file test test: UTF-8 Unicode text". On the other hand, for "text/x-patch" and "text/whatether", Metadata.CONTENT_TYPE could be filled in tika calls with "text/plain" via a matching table. 'Just an idea... :) Robert Newson wrote: > couchdb-lucene uses the content-type stored in couchdb when parsing > attachments. couchdb-lucene then uses Apache Tika to parse the > attachments, and it is there that support for new MIME types should be > requested. > > A list of currently supported MIME types is available at; > > http://github.com/rnewson/couchdb-lucene > > B. >