Return-Path: Delivered-To: apmail-lucene-nutch-dev-archive@www.apache.org Received: (qmail 17297 invoked from network); 11 Apr 2006 19:59:56 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 11 Apr 2006 19:59:56 -0000 Received: (qmail 4854 invoked by uid 500); 11 Apr 2006 19:59:54 -0000 Delivered-To: apmail-lucene-nutch-dev-archive@lucene.apache.org Received: (qmail 4834 invoked by uid 500); 11 Apr 2006 19:59:54 -0000 Mailing-List: contact nutch-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: nutch-dev@lucene.apache.org Delivered-To: mailing list nutch-dev@lucene.apache.org Received: (qmail 4823 invoked by uid 99); 11 Apr 2006 19:59:54 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Apr 2006 12:59:54 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: domain of jerome.charron@gmail.com designates 66.249.82.200 as permitted sender) Received: from [66.249.82.200] (HELO xproxy.gmail.com) (66.249.82.200) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Apr 2006 12:59:52 -0700 Received: by xproxy.gmail.com with SMTP id i26so803011wxd for ; Tue, 11 Apr 2006 12:59:31 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=AXIjElLprvO7OVfjsblWyYFufDYj+1+9hnTzaft+7daXTPtGYNCdpjKwh9+v7H1dkiKum14mOSMbCgZYWIozEc6y3Qsz3upztnibh8SYds0e0/uULKpgbDjYqlWr+b+7lK7/+a8ulqT/paY8YsHkxrzw1ck02qdB3e9hhsKfQW8= Received: by 10.70.27.2 with SMTP id a2mr368338wxa; Tue, 11 Apr 2006 12:59:31 -0700 (PDT) Received: by 10.70.110.4 with HTTP; Tue, 11 Apr 2006 12:59:31 -0700 (PDT) Message-ID: Date: Tue, 11 Apr 2006 21:59:31 +0200 From: "=?ISO-8859-1?Q?J=E9r=F4me_Charron?=" To: nutch-dev@lucene.apache.org Subject: Re: Microformats Support - HReview In-Reply-To: <3868806.post@talk.nabble.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_10696_22442602.1144785571902" References: <3868806.post@talk.nabble.com> X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N ------=_Part_10696_22442602.1144785571902 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline > I have noticed that there are the beginnings of microformats support > (rel-tag) in nutch version 0.8. Hi Mike, I have created this plugin for playing a little around microformats. It can be a kind of "tutorial" for people who want to add support for further microformats. > Is anyone still working on adding other > microformats (hreview, hcard)? I don't remember somebody spoke about this on the lists. > If so, I would be interested in helping and/or collaborating. I already > created a simple hreview parser using nutch version 0.7. You can for instance adapt it for nutch 0.8 and then attach the patch to a JIRA issue. (I will be interested in committing it in nutch) Regards -- http://motrech.free.fr/ http://www.frutch.org/ ------=_Part_10696_22442602.1144785571902--