Return-Path: Delivered-To: apmail-jakarta-lucene-dev-archive@apache.org Received: (qmail 40221 invoked from network); 1 Jul 2003 04:12:31 -0000 Received: from exchange.sun.com (192.18.33.10) by daedalus.apache.org with SMTP; 1 Jul 2003 04:12:31 -0000 Received: (qmail 25347 invoked by uid 97); 1 Jul 2003 04:15:04 -0000 Delivered-To: qmlist-jakarta-archive-lucene-dev@nagoya.betaversion.org Received: (qmail 25340 invoked from network); 1 Jul 2003 04:15:04 -0000 Received: from daedalus.apache.org (HELO apache.org) (208.185.179.12) by nagoya.betaversion.org with SMTP; 1 Jul 2003 04:15:04 -0000 Received: (qmail 39926 invoked by uid 500); 1 Jul 2003 04:12:28 -0000 Mailing-List: contact lucene-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Developers List" Reply-To: "Lucene Developers List" Delivered-To: mailing list lucene-dev@jakarta.apache.org Received: (qmail 39915 invoked from network); 1 Jul 2003 04:12:28 -0000 Received: from natsmtp01.webmailer.de (HELO post.webmailer.de) (192.67.198.81) by daedalus.apache.org with SMTP; 1 Jul 2003 04:12:28 -0000 Received: from dstc.edu.au (m080-108.nv.iinet.net.au [203.217.80.108]) by post.webmailer.de (8.12.8/8.8.7) with ESMTP id h614CZqi015576 for ; Tue, 1 Jul 2003 06:12:36 +0200 (MEST) Message-ID: <3F010A4A.6000505@dstc.edu.au> Date: Tue, 01 Jul 2003 14:12:58 +1000 From: Peter Becker User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.3) Gecko/20030312 X-Accept-Language: en-us, en MIME-Version: 1.0 To: Lucene Developers List Subject: Re: Lucene crawler plan References: <5.1.1.6.0.20030630204025.03806418@thinkalong.com> <3F0105B7.2070805@dstc.edu.au> In-Reply-To: <3F0105B7.2070805@dstc.edu.au> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Peter Becker wrote: [...about the UNIX "file" command...] > The idea is to recognize files by certain parts in them instead of > using the extensions. The result of the classic file command is a > user-readable string, although there have been extensions to MIME > types. Unfortunately I can't find a pointer for the latter. I should read the documents I cite :-) The man page for file contains the information about the MIME versions -- it is the "-i" option (see OPTIONS, or search for " -i "). Peter --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-dev-help@jakarta.apache.org