Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6CBCC10A32 for ; Sun, 23 Feb 2014 08:13:54 +0000 (UTC) Received: (qmail 44462 invoked by uid 500); 23 Feb 2014 08:13:51 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 44411 invoked by uid 500); 23 Feb 2014 08:13:50 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 44404 invoked by uid 99); 23 Feb 2014 08:13:49 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 23 Feb 2014 08:13:49 +0000 X-ASF-Spam-Status: No, hits=0.5 required=5.0 tests=FUZZY_VPILL,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of uwe@thetaphi.de designates 85.25.204.22 as permitted sender) Received: from [85.25.204.22] (HELO mail.sd-datasolutions.de) (85.25.204.22) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 23 Feb 2014 08:13:39 +0000 Received: from VEGA (unknown [IPv6:2001:1a80:2b03:7301:8e70:5aff:fed1:75a4]) by mail.sd-datasolutions.de (Postfix) with ESMTPSA id A1D073320062 for ; Sun, 23 Feb 2014 08:13:18 +0000 (UTC) X-NSA-Greeting: Dear NSA, have fun with reading and analyzing this e-mail! From: "Uwe Schindler" To: References: <20140223022203.7E24323889F7@eris.apache.org> In-Reply-To: <20140223022203.7E24323889F7@eris.apache.org> Subject: RE: svn commit: r1570955 [1/3] - in /lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files: ./ test-documents/ test-morphlines/ Date: Sun, 23 Feb 2014 09:13:19 +0100 Message-ID: <08a001cf306f$1cdbbe60$56933b20$@thetaphi.de> MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Mailer: Microsoft Outlook 14.0 Thread-Index: AQJVQjOWzhos5Ni7tsmNQIdrau3WZJm2WcqA Content-Language: de X-Virus-Checked: Checked by ClamAV on apache.org Thanks! ----- Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: uwe@thetaphi.de > -----Original Message----- > From: markrmiller@apache.org [mailto:markrmiller@apache.org] > Sent: Sunday, February 23, 2014 3:22 AM > To: commits@lucene.apache.org > Subject: svn commit: r1570955 [1/3] - in > /lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files: ./ = test- > documents/ test-morphlines/ >=20 > Author: markrmiller > Date: Sun Feb 23 02:22:02 2014 > New Revision: 1570955 >=20 > URL: http://svn.apache.org/r1570955 > Log: > SOLR-5764: Set eol-style on test resources >=20 > Modified: > = lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/morphlines- > core.marker (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/cars.csv (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/complex.mbox (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/email.eml (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/rsstest.rss (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/sample-statuses-20120906-141433 (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/testEMLX.emlx (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/testRFC822 (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/testRTFVarious.rtf (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/testSVG.svg (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > morphlines/loadSolrBasic.conf (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > morphlines/solrCellDocumentTypes.conf (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > morphlines/solrCellJPGCompressed.conf (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > morphlines/solrCellXML.conf (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > morphlines/tokenizeText.conf (contents, props changed) > lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > morphlines/tutorialReadAvroContainer.conf (contents, props changed) >=20 > Modified: lucene/dev/trunk/solr/contrib/morphlines-core/src/test- > files/morphlines-core.marker > URL: > http://svn.apache.org/viewvc/lucene/dev/trunk/solr/contrib/morphlines- > core/src/test-files/morphlines- > core.marker?rev=3D1570955&r1=3D1570954&r2=3D1570955&view=3Ddiff > = =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > (empty) >=20 > Modified: lucene/dev/trunk/solr/contrib/morphlines-core/src/test- > files/test-documents/cars.csv > URL: > http://svn.apache.org/viewvc/lucene/dev/trunk/solr/contrib/morphlines- > core/src/test-files/test- > documents/cars.csv?rev=3D1570955&r1=3D1570954&r2=3D1570955&view=3Ddiff > = =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > --- lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/cars.csv (original) > +++ lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/cars.csv Sun Feb 23 02:22:02 2014 > @@ -1,6 +1,6 @@ > -Age,Color,Extras,Type,Used > -2,blue,GPS,"Gas, with electric","" > -10,green,"Labeled ""Vintage, 1913""",,yes > -100,red,"Labeled ""Vintage 1913""",yes > -5,orange,none,"This is a > +Age,Color,Extras,Type,Used > +2,blue,GPS,"Gas, with electric","" > +10,green,"Labeled ""Vintage, 1913""",,yes > +100,red,"Labeled ""Vintage 1913""",yes > +5,orange,none,"This is a > multi, line text",no > \ No newline at end of file >=20 > Modified: lucene/dev/trunk/solr/contrib/morphlines-core/src/test- > files/test-documents/complex.mbox > URL: > http://svn.apache.org/viewvc/lucene/dev/trunk/solr/contrib/morphlines- > core/src/test-files/test- > = documents/complex.mbox?rev=3D1570955&r1=3D1570954&r2=3D1570955&view=3Ddi > ff > = =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > --- lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/complex.mbox (original) > +++ lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/complex.mbox Sun Feb 23 02:22:02 2014 > @@ -1,291 +1,291 @@ > -From core-user-return-14700-apmail-hadoop-core-user- > archive=3Dhadoop.apache.org@hadoop.apache.org Mon Jun 01 04:28:28 2009 > -Return-Path: archive=3Dhadoop.apache.org@hadoop.apache.org> > -Delivered-To: apmail-hadoop-core-user-archive@www.apache.org > -Received: (qmail 19921 invoked from network); 1 Jun 2009 04:28:28 = -0000 > -Received: from hermes.apache.org (HELO mail.apache.org) = (140.211.11.3) > - by minotaur.apache.org with SMTP; 1 Jun 2009 04:28:28 -0000 > -Received: (qmail 84995 invoked by uid 500); 1 Jun 2009 04:28:38 -0000 > -Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org > -Received: (qmail 84895 invoked by uid 500); 1 Jun 2009 04:28:38 -0000 > -Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm > -Precedence: bulk > -List-Help: > -List-Unsubscribe: > -List-Post: > -List-Id: > -Reply-To: core-user@hadoop.apache.org > -Delivered-To: mailing list core-user@hadoop.apache.org > -Received: (qmail 84885 invoked by uid 99); 1 Jun 2009 04:28:38 -0000 > -Received: from athena.apache.org (HELO athena.apache.org) > (140.211.11.136) > - by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jun 2009 = 04:28:38 > +0000 > -X-ASF-Spam-Status: No, hits=3D1.2 required=3D10.0 > - tests=3DSPF_NEUTRAL > -X-Spam-Check-By: apache.org > -Received-SPF: neutral (athena.apache.org: local policy) > -Received: from [69.147.107.21] (HELO mrout2-b.corp.re1.wahoo.com) > (69.147.107.21) > - by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jun 2009 = 04:28:26 > +0000 > -Received: from SNV-EXPF01.ds.corp.wahoo.com (snv- > expf01.ds.corp.wahoo.com [207.126.227.250]) > - by mrout2-b.corp.re1.wahoo.com (8.13.8/8.13.8/y.out) with ESMTP > id n514QYA6099963 > - for ; Sun, 31 May 2009 21:26:35 - > 0700 (PDT) > -DomainKey-Signature: a=3Drsa-sha1; s=3Dserpent; d=3Dwahoo-inc.com; = c=3Dnofws; > q=3Ddns; > - h=3Dreceived:user-agent:date:subject:from:to:message-id: > - thread-topic:thread-index:in-reply-to:mime-version:content-type: > - content-transfer-encoding:x-originalarrivaltime; > - > b=3DYVtSNdgjeeSBS1yY3XDolul49i+HrgNG7QszMo9LzGnrwejjgsl5+iUM > 6EiQgEpV > -Received: from SNV-EXVS08.ds.corp.wahoo.com ([207.126.227.9]) by SNV- > EXPF01.ds.corp.wahoo.com with Microsoft SMTPSVC(6.0.3790.3959); > - Sun, 31 May 2009 21:26:34 -0700 > -Received: from 10.66.92.213 ([10.66.92.213]) by SNV- > EXVS08.ds.corp.wahoo.com ([207.126.227.58]) with Microsoft Exchange > Server HTTP-DAV ; > - Mon, 1 Jun 2009 04:26:33 +0000 > -User-Agent: Microsoft-Entourage/12.17.0.090302 > -Date: Mon, 01 Jun 2009 09:56:31 +0530 > -Subject: Re: question about when shuffle/sort start working > -From: Sam Judgement > -To: > -Message-ID: > -Thread-Topic: question about when shuffle/sort start working > -Thread-Index: AcnicSNoBw19cMU8UEaXwAdZ1YYhuw=3D=3D > -In-Reply-To: <440622.41041.qm@web111005.mail.gq1.wahoo.com> > -Mime-version: 1.0 > -Content-type: text/plain; > - charset=3D"US-ASCII" > -Content-transfer-encoding: 7bit > -X-OriginalArrivalTime: 01 Jun 2009 04:26:34.0501 (UTC) > FILETIME=3D[257EAB50:01C9E271] > -X-Virus-Checked: Checked by ClamAV on apache.org > - > -When a Mapper completes, MapCompletionEvents are generated. > Reducers try to > -fetch map outputs for a given map only on the receipt of such events. > - > -Sam > - > - > -On 5/30/09 10:00 AM, "Jianmin Foo" wrote: > - > -> Hi, > -> I am being confused by the protocol between mapper and reducer. = When > mapper > -> emitting the (key,value) pair done, is there any signal the mapper = send > out to > -> hadoop framework in protocol to indicate that map is done and the > shuffle/sort > -> can begin for reducer? If there is no this signal in protocol, when = the > -> framework begin the shuffle/sort? > -> > -> Thanks, > -> Jianmin > -> > -> > -> > -> > - > - > -From core-user-return-14701-apmail-hadoop-core-user- > archive=3Dhadoop.apache.org@hadoop.apache.org Mon Jun 01 05:31:14 2009 > -Return-Path: archive=3Dhadoop.apache.org@hadoop.apache.org> > -Delivered-To: apmail-hadoop-core-user-archive@www.apache.org > -Received: (qmail 38243 invoked from network); 1 Jun 2009 05:31:14 = -0000 > -Received: from hermes.apache.org (HELO mail.apache.org) = (140.211.11.3) > - by minotaur.apache.org with SMTP; 1 Jun 2009 05:31:14 -0000 > -Received: (qmail 15621 invoked by uid 500); 1 Jun 2009 05:31:24 -0000 > -Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org > -Received: (qmail 15557 invoked by uid 500); 1 Jun 2009 05:31:24 -0000 > -Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm > -Precedence: bulk > -List-Help: > -List-Unsubscribe: > -List-Post: > -List-Id: > -Reply-To: core-user@hadoop.apache.org > -Delivered-To: mailing list core-user@hadoop.apache.org > -Received: (qmail 15547 invoked by uid 99); 1 Jun 2009 05:31:24 -0000 > -Received: from nike.apache.org (HELO nike.apache.org) = (192.87.106.230) > - by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jun 2009 = 05:31:24 > +0000 > -X-ASF-Spam-Status: No, hits=3D2.2 required=3D10.0 > - tests=3DHTML_MESSAGE,SPF_PASS > -X-Spam-Check-By: apache.org > -Received-SPF: pass (nike.apache.org: local policy) > -Received: from [68.142.237.94] (HELO n9.bullet.re3.wahoo.com) > (68.142.237.94) > - by apache.org (qpsmtpd/0.29) with SMTP; Mon, 01 Jun 2009 05:31:11 > +0000 > -Received: from [68.142.237.88] by n9.bullet.re3.wahoo.com with NNFMP; = 01 > Jun 2009 05:30:50 -0000 > -Received: from [67.195.9.82] by t4.bullet.re3.wahoo.com with NNFMP; = 01 > Jun 2009 05:30:49 -0000 > -Received: from [67.195.9.99] by t2.bullet.mail.gq1.wahoo.com with = NNFMP; > 01 Jun 2009 05:30:49 -0000 > -Received: from [127.0.0.1] by omp103.mail.gq1.wahoo.com with NNFMP; = 01 > Jun 2009 05:28:01 -0000 > -X-wahoo-Newman-Property: ymail-3 > -X-wahoo-Newman-Id: 796121.97519.bm@omp103.mail.gq1.wahoo.com > -Received: (qmail 35264 invoked by uid 60001); 1 Jun 2009 05:30:49 = -0000 > -DKIM-Signature: v=3D1; a=3Drsa-sha256; c=3Drelaxed/relaxed; = d=3Dwahoo.com; > s=3Ds1024; t=3D1243834249; > bh=3DR8qzdi/IbLyO8UwpnaujDpT9E+6bJ7nkmZN2803EmRk=3D; h=3DMessage-ID:X- > YMail-OSG:Received:X-Mailer:References:Date:From:Subject:To:In-Reply- > To:MIME-Version:Content-Type; > b=3Dvq4c6RIDbkuLPYd8mirusIXf6DqTb/IeT55In7W00Y5Sxx1ZiXBb78yE9+TDfXJ0 > elsEZvqv4ocyvolGE0eGtyYeJA0mZikpRNu6pidxPNpCplOcLHBRz7YQ7iERwv3T > agRlWy2Xd3oD9ZeV0A05P7WUOiNNX1PUUJD1IVdrEZo=3D > -DomainKey-Signature:a=3Drsa-sha1; q=3Ddns; c=3Dnofws; > - s=3Ds1024; d=3Dwahoo.com; > - h=3DMessage-ID:X-YMail-OSG:Received:X- > Mailer:References:Date:From:Subject:To:In-Reply-To:MIME- > Version:Content-Type; > - > b=3D6HXZV98ON5vBwmE/xS8stVD0D2F4dkMY7a0suX5KVTb736JdR8G59mqBq/ > dWcpbFTLiCLtxi18LMb/dU1RKRGOEdn3l3j/jKXhBrhIgfg3qtNskPedXDKBvn7JG > XiSkqpA/tUtPjvc0Uuk8/LaA01SQTz40Engg7nD8/EJdIAhA=3D; > -Message-ID: <592088.35091.qm@web111010.mail.gq1.wahoo.com> > -X-YMail-OSG: > KzhhrJYVM1m.MCS6vRpRP2ZZO2PrfnbngosELDCIa91ZqvhJph4RdmzfUW0jw > 9W04RCSch1K730bPohwNpNBIk2QR_zt4_mfbhfq7YEPkSoz9LSXG90P9vIo5Fc > 8qyZN0U6vA9gtdyGQTpN5ahvillUH9nAF0TMWv2SvZJLjPlQ0Z0p8oK8ltBwGTg > LrM8Jtdn9D29yoRyi3_EpVOfdD9OP.EK50Vr1XwSUYMbnpZ0WGHMwd.Yig7A > 6Elwadm3YVbfOdx2mfrG.jQsUAxQjRBNvbrOM57.FaE11kHTe9aoBWSeihNg-- > -Received: from [216.145.54.7] by web111010.mail.gq1.wahoo.com via = HTTP; > Sun, 31 May 2009 22:30:49 PDT > -X-Mailer: wahooMailRC/1277.43 wahooMailWebService/0.7.289.10 > -References: > -Date: Sun, 31 May 2009 22:30:49 -0700 (PDT) > -From: Jianmin Foo > -Subject: Re: question about when shuffle/sort start working > -To: core-user@hadoop.apache.org > -In-Reply-To: > -MIME-Version: 1.0 > -Content-Type: multipart/alternative; boundary=3D"0-1193839393- > 1243834249=3D:35091" > -X-Virus-Checked: Checked by ClamAV on apache.org > - > ---0-1193839393-1243834249=3D:35091 > -Content-Type: text/plain; charset=3Dus-ascii > - > -Thanks a lot for your explanation, Sam. > - > -So is this event generated by hadoop framework? Is there any API in > mapper to fire this event? Actually, I am thinking to implement a = mapper that > will emit some pairs, then fire this event to let the = reducer > works, the same mapper task then emit some other pairs = and > repeat. Do you think is this logic feasible by current API? > - > -Thanks, > -Jianmin > - > - > - > - > - > -________________________________ > -From: Sam Judgement > -To: core-user@hadoop.apache.org > -Sent: Monday, June 1, 2009 12:26:31 PM > -Subject: Re: question about when shuffle/sort start working > - > -When a Mapper completes, MapCompletionEvents are generated. > Reducers try to > -fetch map outputs for a given map only on the receipt of such events. > - > -Sam > - > - > -On 5/30/09 10:00 AM, "Jianmin Foo" wrote: > - > -> Hi, > -> I am being confused by the protocol between mapper and reducer. = When > mapper > -> emitting the (key,value) pair done, is there any signal the mapper = send > out to > -> hadoop framework in protocol to indicate that map is done and the > shuffle/sort > -> can begin for reducer? If there is no this signal in protocol, when = the > -> framework begin the shuffle/sort? > -> > -> Thanks, > -> Jianmin > -> > -> > -> > -> > - > - > - > ---0-1193839393-1243834249=3D:35091-- > - > - > -From core-user-return-14702-apmail-hadoop-core-user- > archive=3Dhadoop.apache.org@hadoop.apache.org Mon Jun 01 06:04:30 2009 > -Return-Path: archive=3Dhadoop.apache.org@hadoop.apache.org> > -Delivered-To: apmail-hadoop-core-user-archive@www.apache.org > -Received: (qmail 53387 invoked from network); 1 Jun 2009 06:04:29 = -0000 > -Received: from hermes.apache.org (HELO mail.apache.org) = (140.211.11.3) > - by minotaur.apache.org with SMTP; 1 Jun 2009 06:04:29 -0000 > -Received: (qmail 39066 invoked by uid 500); 1 Jun 2009 06:04:39 -0000 > -Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org > -Received: (qmail 38970 invoked by uid 500); 1 Jun 2009 06:04:39 -0000 > -Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm > -Precedence: bulk > -List-Help: > -List-Unsubscribe: > -List-Post: > -List-Id: > -Reply-To: core-user@hadoop.apache.org > -Delivered-To: mailing list core-user@hadoop.apache.org > -Received: (qmail 38955 invoked by uid 99); 1 Jun 2009 06:04:39 -0000 > -Received: from athena.apache.org (HELO athena.apache.org) > (140.211.11.136) > - by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jun 2009 = 06:04:39 > +0000 > -X-ASF-Spam-Status: No, hits=3D1.2 required=3D10.0 > - tests=3DSPF_NEUTRAL > -X-Spam-Check-By: apache.org > -Received-SPF: neutral (athena.apache.org: local policy) > -Received: from [216.145.54.172] (HELO mrout2.wahoo.com) > (216.145.54.172) > - by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jun 2009 = 06:04:28 > +0000 > -Received: from SNV-EXBH01.ds.corp.wahoo.com (snv- > exbh01.ds.corp.wahoo.com [207.126.227.249]) > - by mrout2.wahoo.com (8.13.6/8.13.6/y.out) with ESMTP id > n5163FGq038852 > - for ; Sun, 31 May 2009 23:03:15 - > 0700 (PDT) > -DomainKey-Signature: a=3Drsa-sha1; s=3Dserpent; d=3Dwahoo-inc.com; = c=3Dnofws; > q=3Ddns; > - h=3Dreceived:user-agent:date:subject:from:to:message-id: > - thread-topic:thread-index:in-reply-to:mime-version:content-type: > - content-transfer-encoding:x-originalarrivaltime; > - > b=3DrChE4SCnwtWaZpjhovkiXDKfDiVNdRRvsadSGG9S9bgvOexn/9/5JjE > Qx1pOR7Nb > -Received: from SNV-EXVS08.ds.corp.wahoo.com ([207.126.227.9]) by SNV- > EXBH01.ds.corp.wahoo.com with Microsoft SMTPSVC(6.0.3790.3959); > - Sun, 31 May 2009 23:03:15 -0700 > -Received: from 10.66.92.213 ([10.66.92.213]) by SNV- > EXVS08.ds.corp.wahoo.com ([207.126.227.58]) with Microsoft Exchange > Server HTTP-DAV ; > - Mon, 1 Jun 2009 06:03:15 +0000 > -User-Agent: Microsoft-Entourage/12.17.0.090302 > -Date: Mon, 01 Jun 2009 11:33:13 +0530 > -Subject: Re: question about when shuffle/sort start working > -From: Sam Judgement > -To: > -Message-ID: > -Thread-Topic: question about when shuffle/sort start working > -Thread-Index: AcnifqWrLG6N7GAk7kqy9QalVWfegQ=3D=3D > -In-Reply-To: <592088.35091.qm@web111010.mail.gq1.wahoo.com> > -Mime-version: 1.0 > -Content-type: text/plain; > - charset=3D"US-ASCII" > -Content-transfer-encoding: 7bit > -X-OriginalArrivalTime: 01 Jun 2009 06:03:15.0462 (UTC) > FILETIME=3D[A7231260:01C9E27E] > -X-Virus-Checked: Checked by ClamAV on apache.org > - > - > -No you cannot raise this event yourself, this event is generated = internally > -by the framework. > - > -I am guessing that what you probably want is to have a chain of = MapReduce > -Jobs where the output of one is automatically fed as input to = another. You > -can look at these classes: JobControl and ChainMapper/ChainReducer. > - > -Sam > - > -On 6/1/09 11:00 AM, "Jianmin Foo" wrote: > - > -> Thanks a lot for your explanation, Sam. > -> > -> So is this event generated by hadoop framework? Is there any API in > mapper to > -> fire this event? Actually, I am thinking to implement a mapper that = will > emit > -> some pairs, then fire this event to let the reducer = works, the > -> same mapper task then emit some other pairs and = repeat. > Do you > -> think is this logic feasible by current API? > -> > -> Thanks, > -> Jianmin > -> > -> > -> > -> > -> > -> ________________________________ > -> From: Sam Judgement > -> To: core-user@hadoop.apache.org > -> Sent: Monday, June 1, 2009 12:26:31 PM > -> Subject: Re: question about when shuffle/sort start working > -> > -> When a Mapper completes, MapCompletionEvents are generated. > Reducers try to > -> fetch map outputs for a given map only on the receipt of such = events. > -> > -> Sam > -> > -> > -> On 5/30/09 10:00 AM, "Jianmin Foo" wrote: > -> > ->> Hi, > ->> I am being confused by the protocol between mapper and reducer. > When mapper > ->> emitting the (key,value) pair done, is there any signal the mapper = send > out > ->> to > ->> hadoop framework in protocol to indicate that map is done and the > ->> shuffle/sort > ->> can begin for reducer? If there is no this signal in protocol, = when the > ->> framework begin the shuffle/sort? > ->> > ->> Thanks, > ->> Jianmin > ->> > ->> > ->> > ->> > -> > -> > -> > - > - > +From core-user-return-14700-apmail-hadoop-core-user- > archive=3Dhadoop.apache.org@hadoop.apache.org Mon Jun 01 04:28:28 2009 > +Return-Path: archive=3Dhadoop.apache.org@hadoop.apache.org> > +Delivered-To: apmail-hadoop-core-user-archive@www.apache.org > +Received: (qmail 19921 invoked from network); 1 Jun 2009 04:28:28 = -0000 > +Received: from hermes.apache.org (HELO mail.apache.org) = (140.211.11.3) > + by minotaur.apache.org with SMTP; 1 Jun 2009 04:28:28 -0000 > +Received: (qmail 84995 invoked by uid 500); 1 Jun 2009 04:28:38 -0000 > +Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org > +Received: (qmail 84895 invoked by uid 500); 1 Jun 2009 04:28:38 -0000 > +Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm > +Precedence: bulk > +List-Help: > +List-Unsubscribe: > +List-Post: > +List-Id: > +Reply-To: core-user@hadoop.apache.org > +Delivered-To: mailing list core-user@hadoop.apache.org > +Received: (qmail 84885 invoked by uid 99); 1 Jun 2009 04:28:38 -0000 > +Received: from athena.apache.org (HELO athena.apache.org) > (140.211.11.136) > + by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jun 2009 = 04:28:38 > +0000 > +X-ASF-Spam-Status: No, hits=3D1.2 required=3D10.0 > + tests=3DSPF_NEUTRAL > +X-Spam-Check-By: apache.org > +Received-SPF: neutral (athena.apache.org: local policy) > +Received: from [69.147.107.21] (HELO mrout2-b.corp.re1.wahoo.com) > (69.147.107.21) > + by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jun 2009 = 04:28:26 > +0000 > +Received: from SNV-EXPF01.ds.corp.wahoo.com (snv- > expf01.ds.corp.wahoo.com [207.126.227.250]) > + by mrout2-b.corp.re1.wahoo.com (8.13.8/8.13.8/y.out) with ESMTP > id n514QYA6099963 > + for ; Sun, 31 May 2009 21:26:35 - > 0700 (PDT) > +DomainKey-Signature: a=3Drsa-sha1; s=3Dserpent; d=3Dwahoo-inc.com; = c=3Dnofws; > q=3Ddns; > + h=3Dreceived:user-agent:date:subject:from:to:message-id: > + thread-topic:thread-index:in-reply-to:mime-version:content-type: > + content-transfer-encoding:x-originalarrivaltime; > + > b=3DYVtSNdgjeeSBS1yY3XDolul49i+HrgNG7QszMo9LzGnrwejjgsl5+iUM > 6EiQgEpV > +Received: from SNV-EXVS08.ds.corp.wahoo.com ([207.126.227.9]) by SNV- > EXPF01.ds.corp.wahoo.com with Microsoft SMTPSVC(6.0.3790.3959); > + Sun, 31 May 2009 21:26:34 -0700 > +Received: from 10.66.92.213 ([10.66.92.213]) by SNV- > EXVS08.ds.corp.wahoo.com ([207.126.227.58]) with Microsoft Exchange > Server HTTP-DAV ; > + Mon, 1 Jun 2009 04:26:33 +0000 > +User-Agent: Microsoft-Entourage/12.17.0.090302 > +Date: Mon, 01 Jun 2009 09:56:31 +0530 > +Subject: Re: question about when shuffle/sort start working > +From: Sam Judgement > +To: > +Message-ID: > +Thread-Topic: question about when shuffle/sort start working > +Thread-Index: AcnicSNoBw19cMU8UEaXwAdZ1YYhuw=3D=3D > +In-Reply-To: <440622.41041.qm@web111005.mail.gq1.wahoo.com> > +Mime-version: 1.0 > +Content-type: text/plain; > + charset=3D"US-ASCII" > +Content-transfer-encoding: 7bit > +X-OriginalArrivalTime: 01 Jun 2009 04:26:34.0501 (UTC) > FILETIME=3D[257EAB50:01C9E271] > +X-Virus-Checked: Checked by ClamAV on apache.org > + > +When a Mapper completes, MapCompletionEvents are generated. > Reducers try to > +fetch map outputs for a given map only on the receipt of such events. > + > +Sam > + > + > +On 5/30/09 10:00 AM, "Jianmin Foo" wrote: > + > +> Hi, > +> I am being confused by the protocol between mapper and reducer. = When > mapper > +> emitting the (key,value) pair done, is there any signal the mapper = send > out to > +> hadoop framework in protocol to indicate that map is done and the > shuffle/sort > +> can begin for reducer? If there is no this signal in protocol, when = the > +> framework begin the shuffle/sort? > +> > +> Thanks, > +> Jianmin > +> > +> > +> > +> > + > + > +From core-user-return-14701-apmail-hadoop-core-user- > archive=3Dhadoop.apache.org@hadoop.apache.org Mon Jun 01 05:31:14 2009 > +Return-Path: archive=3Dhadoop.apache.org@hadoop.apache.org> > +Delivered-To: apmail-hadoop-core-user-archive@www.apache.org > +Received: (qmail 38243 invoked from network); 1 Jun 2009 05:31:14 = -0000 > +Received: from hermes.apache.org (HELO mail.apache.org) = (140.211.11.3) > + by minotaur.apache.org with SMTP; 1 Jun 2009 05:31:14 -0000 > +Received: (qmail 15621 invoked by uid 500); 1 Jun 2009 05:31:24 -0000 > +Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org > +Received: (qmail 15557 invoked by uid 500); 1 Jun 2009 05:31:24 -0000 > +Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm > +Precedence: bulk > +List-Help: > +List-Unsubscribe: > +List-Post: > +List-Id: > +Reply-To: core-user@hadoop.apache.org > +Delivered-To: mailing list core-user@hadoop.apache.org > +Received: (qmail 15547 invoked by uid 99); 1 Jun 2009 05:31:24 -0000 > +Received: from nike.apache.org (HELO nike.apache.org) = (192.87.106.230) > + by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jun 2009 = 05:31:24 > +0000 > +X-ASF-Spam-Status: No, hits=3D2.2 required=3D10.0 > + tests=3DHTML_MESSAGE,SPF_PASS > +X-Spam-Check-By: apache.org > +Received-SPF: pass (nike.apache.org: local policy) > +Received: from [68.142.237.94] (HELO n9.bullet.re3.wahoo.com) > (68.142.237.94) > + by apache.org (qpsmtpd/0.29) with SMTP; Mon, 01 Jun 2009 05:31:11 > +0000 > +Received: from [68.142.237.88] by n9.bullet.re3.wahoo.com with NNFMP; > 01 Jun 2009 05:30:50 -0000 > +Received: from [67.195.9.82] by t4.bullet.re3.wahoo.com with NNFMP; = 01 > Jun 2009 05:30:49 -0000 > +Received: from [67.195.9.99] by t2.bullet.mail.gq1.wahoo.com with = NNFMP; > 01 Jun 2009 05:30:49 -0000 > +Received: from [127.0.0.1] by omp103.mail.gq1.wahoo.com with NNFMP; > 01 Jun 2009 05:28:01 -0000 > +X-wahoo-Newman-Property: ymail-3 > +X-wahoo-Newman-Id: 796121.97519.bm@omp103.mail.gq1.wahoo.com > +Received: (qmail 35264 invoked by uid 60001); 1 Jun 2009 05:30:49 = -0000 > +DKIM-Signature: v=3D1; a=3Drsa-sha256; c=3Drelaxed/relaxed; = d=3Dwahoo.com; > s=3Ds1024; t=3D1243834249; > bh=3DR8qzdi/IbLyO8UwpnaujDpT9E+6bJ7nkmZN2803EmRk=3D; h=3DMessage-ID:X- > YMail-OSG:Received:X-Mailer:References:Date:From:Subject:To:In-Reply- > To:MIME-Version:Content-Type; > b=3Dvq4c6RIDbkuLPYd8mirusIXf6DqTb/IeT55In7W00Y5Sxx1ZiXBb78yE9+TDfXJ0 > elsEZvqv4ocyvolGE0eGtyYeJA0mZikpRNu6pidxPNpCplOcLHBRz7YQ7iERwv3T > agRlWy2Xd3oD9ZeV0A05P7WUOiNNX1PUUJD1IVdrEZo=3D > +DomainKey-Signature:a=3Drsa-sha1; q=3Ddns; c=3Dnofws; > + s=3Ds1024; d=3Dwahoo.com; > + h=3DMessage-ID:X-YMail-OSG:Received:X- > Mailer:References:Date:From:Subject:To:In-Reply-To:MIME- > Version:Content-Type; > + > b=3D6HXZV98ON5vBwmE/xS8stVD0D2F4dkMY7a0suX5KVTb736JdR8G59mqBq/ > dWcpbFTLiCLtxi18LMb/dU1RKRGOEdn3l3j/jKXhBrhIgfg3qtNskPedXDKBvn7JG > XiSkqpA/tUtPjvc0Uuk8/LaA01SQTz40Engg7nD8/EJdIAhA=3D; > +Message-ID: <592088.35091.qm@web111010.mail.gq1.wahoo.com> > +X-YMail-OSG: > KzhhrJYVM1m.MCS6vRpRP2ZZO2PrfnbngosELDCIa91ZqvhJph4RdmzfUW0jw > 9W04RCSch1K730bPohwNpNBIk2QR_zt4_mfbhfq7YEPkSoz9LSXG90P9vIo5Fc > 8qyZN0U6vA9gtdyGQTpN5ahvillUH9nAF0TMWv2SvZJLjPlQ0Z0p8oK8ltBwGTg > LrM8Jtdn9D29yoRyi3_EpVOfdD9OP.EK50Vr1XwSUYMbnpZ0WGHMwd.Yig7A > 6Elwadm3YVbfOdx2mfrG.jQsUAxQjRBNvbrOM57.FaE11kHTe9aoBWSeihNg-- > +Received: from [216.145.54.7] by web111010.mail.gq1.wahoo.com via = HTTP; > Sun, 31 May 2009 22:30:49 PDT > +X-Mailer: wahooMailRC/1277.43 wahooMailWebService/0.7.289.10 > +References: > +Date: Sun, 31 May 2009 22:30:49 -0700 (PDT) > +From: Jianmin Foo > +Subject: Re: question about when shuffle/sort start working > +To: core-user@hadoop.apache.org > +In-Reply-To: > +MIME-Version: 1.0 > +Content-Type: multipart/alternative; boundary=3D"0-1193839393- > 1243834249=3D:35091" > +X-Virus-Checked: Checked by ClamAV on apache.org > + > +--0-1193839393-1243834249=3D:35091 > +Content-Type: text/plain; charset=3Dus-ascii > + > +Thanks a lot for your explanation, Sam. > + > +So is this event generated by hadoop framework? Is there any API in > mapper to fire this event? Actually, I am thinking to implement a = mapper that > will emit some pairs, then fire this event to let the = reducer > works, the same mapper task then emit some other pairs = and > repeat. Do you think is this logic feasible by current API? > + > +Thanks, > +Jianmin > + > + > + > + > + > +________________________________ > +From: Sam Judgement > +To: core-user@hadoop.apache.org > +Sent: Monday, June 1, 2009 12:26:31 PM > +Subject: Re: question about when shuffle/sort start working > + > +When a Mapper completes, MapCompletionEvents are generated. > Reducers try to > +fetch map outputs for a given map only on the receipt of such events. > + > +Sam > + > + > +On 5/30/09 10:00 AM, "Jianmin Foo" wrote: > + > +> Hi, > +> I am being confused by the protocol between mapper and reducer. = When > mapper > +> emitting the (key,value) pair done, is there any signal the mapper = send > out to > +> hadoop framework in protocol to indicate that map is done and the > shuffle/sort > +> can begin for reducer? If there is no this signal in protocol, when = the > +> framework begin the shuffle/sort? > +> > +> Thanks, > +> Jianmin > +> > +> > +> > +> > + > + > + > +--0-1193839393-1243834249=3D:35091-- > + > + > +From core-user-return-14702-apmail-hadoop-core-user- > archive=3Dhadoop.apache.org@hadoop.apache.org Mon Jun 01 06:04:30 2009 > +Return-Path: archive=3Dhadoop.apache.org@hadoop.apache.org> > +Delivered-To: apmail-hadoop-core-user-archive@www.apache.org > +Received: (qmail 53387 invoked from network); 1 Jun 2009 06:04:29 = -0000 > +Received: from hermes.apache.org (HELO mail.apache.org) = (140.211.11.3) > + by minotaur.apache.org with SMTP; 1 Jun 2009 06:04:29 -0000 > +Received: (qmail 39066 invoked by uid 500); 1 Jun 2009 06:04:39 -0000 > +Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org > +Received: (qmail 38970 invoked by uid 500); 1 Jun 2009 06:04:39 -0000 > +Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm > +Precedence: bulk > +List-Help: > +List-Unsubscribe: > +List-Post: > +List-Id: > +Reply-To: core-user@hadoop.apache.org > +Delivered-To: mailing list core-user@hadoop.apache.org > +Received: (qmail 38955 invoked by uid 99); 1 Jun 2009 06:04:39 -0000 > +Received: from athena.apache.org (HELO athena.apache.org) > (140.211.11.136) > + by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jun 2009 = 06:04:39 > +0000 > +X-ASF-Spam-Status: No, hits=3D1.2 required=3D10.0 > + tests=3DSPF_NEUTRAL > +X-Spam-Check-By: apache.org > +Received-SPF: neutral (athena.apache.org: local policy) > +Received: from [216.145.54.172] (HELO mrout2.wahoo.com) > (216.145.54.172) > + by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Jun 2009 = 06:04:28 > +0000 > +Received: from SNV-EXBH01.ds.corp.wahoo.com (snv- > exbh01.ds.corp.wahoo.com [207.126.227.249]) > + by mrout2.wahoo.com (8.13.6/8.13.6/y.out) with ESMTP id > n5163FGq038852 > + for ; Sun, 31 May 2009 23:03:15 - > 0700 (PDT) > +DomainKey-Signature: a=3Drsa-sha1; s=3Dserpent; d=3Dwahoo-inc.com; = c=3Dnofws; > q=3Ddns; > + h=3Dreceived:user-agent:date:subject:from:to:message-id: > + thread-topic:thread-index:in-reply-to:mime-version:content-type: > + content-transfer-encoding:x-originalarrivaltime; > + > b=3DrChE4SCnwtWaZpjhovkiXDKfDiVNdRRvsadSGG9S9bgvOexn/9/5JjE > Qx1pOR7Nb > +Received: from SNV-EXVS08.ds.corp.wahoo.com ([207.126.227.9]) by SNV- > EXBH01.ds.corp.wahoo.com with Microsoft SMTPSVC(6.0.3790.3959); > + Sun, 31 May 2009 23:03:15 -0700 > +Received: from 10.66.92.213 ([10.66.92.213]) by SNV- > EXVS08.ds.corp.wahoo.com ([207.126.227.58]) with Microsoft Exchange > Server HTTP-DAV ; > + Mon, 1 Jun 2009 06:03:15 +0000 > +User-Agent: Microsoft-Entourage/12.17.0.090302 > +Date: Mon, 01 Jun 2009 11:33:13 +0530 > +Subject: Re: question about when shuffle/sort start working > +From: Sam Judgement > +To: > +Message-ID: > +Thread-Topic: question about when shuffle/sort start working > +Thread-Index: AcnifqWrLG6N7GAk7kqy9QalVWfegQ=3D=3D > +In-Reply-To: <592088.35091.qm@web111010.mail.gq1.wahoo.com> > +Mime-version: 1.0 > +Content-type: text/plain; > + charset=3D"US-ASCII" > +Content-transfer-encoding: 7bit > +X-OriginalArrivalTime: 01 Jun 2009 06:03:15.0462 (UTC) > FILETIME=3D[A7231260:01C9E27E] > +X-Virus-Checked: Checked by ClamAV on apache.org > + > + > +No you cannot raise this event yourself, this event is generated = internally > +by the framework. > + > +I am guessing that what you probably want is to have a chain of = MapReduce > +Jobs where the output of one is automatically fed as input to = another. You > +can look at these classes: JobControl and ChainMapper/ChainReducer. > + > +Sam > + > +On 6/1/09 11:00 AM, "Jianmin Foo" wrote: > + > +> Thanks a lot for your explanation, Sam. > +> > +> So is this event generated by hadoop framework? Is there any API in > mapper to > +> fire this event? Actually, I am thinking to implement a mapper that = will > emit > +> some pairs, then fire this event to let the reducer = works, the > +> same mapper task then emit some other pairs and = repeat. > Do you > +> think is this logic feasible by current API? > +> > +> Thanks, > +> Jianmin > +> > +> > +> > +> > +> > +> ________________________________ > +> From: Sam Judgement > +> To: core-user@hadoop.apache.org > +> Sent: Monday, June 1, 2009 12:26:31 PM > +> Subject: Re: question about when shuffle/sort start working > +> > +> When a Mapper completes, MapCompletionEvents are generated. > Reducers try to > +> fetch map outputs for a given map only on the receipt of such = events. > +> > +> Sam > +> > +> > +> On 5/30/09 10:00 AM, "Jianmin Foo" wrote: > +> > +>> Hi, > +>> I am being confused by the protocol between mapper and reducer. > When mapper > +>> emitting the (key,value) pair done, is there any signal the mapper = send > out > +>> to > +>> hadoop framework in protocol to indicate that map is done and the > +>> shuffle/sort > +>> can begin for reducer? If there is no this signal in protocol, = when the > +>> framework begin the shuffle/sort? > +>> > +>> Thanks, > +>> Jianmin > +>> > +>> > +>> > +>> > +> > +> > +> > + > + >=20 > Modified: lucene/dev/trunk/solr/contrib/morphlines-core/src/test- > files/test-documents/email.eml > URL: > http://svn.apache.org/viewvc/lucene/dev/trunk/solr/contrib/morphlines- > core/src/test-files/test- > = documents/email.eml?rev=3D1570955&r1=3D1570954&r2=3D1570955&view=3Ddiff > = =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > --- lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/email.eml (original) > +++ lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/email.eml Sun Feb 23 02:22:02 2014 > @@ -1,40 +1,40 @@ > -MIME-Version: 1.0 > -Received: by 10.216.199.5 with HTTP; Wed, 27 Nov 2013 12:01:23 -0800 > -(PST) > -Date: Wed, 27 Nov 2013 13:01:23 -0700 > -Delivered-To: foo@cloudera.com > -Message-ID: > - 8cmAgK6w@mail.gmail.com> > -Subject: Test EML > -From: Patrick Foo > -To: Patrick Foo > -Content-Type: multipart/alternative; > -boundary=3D001a11c3815cb55dda04ec2e0f3b > - > ---001a11c3815cb55dda04ec2e0f3b > -Content-Type: text/plain; charset=3DISO-8859-1 > - > -This is a test > - > --- > -Patrick Foo > -Customer Operations Engineer > - > - > - > ---001a11c3815cb55dda04ec2e0f3b > -Content-Type: text/html; charset=3DISO-8859-1 > -Content-Transfer-Encoding: quoted-printable > - > -
This is a test

-- > -
- dir=3D3D"ltr">Patrick Foo
Customer Operations > -Engineer

=3D > -
> - > - > ---001a11c3815cb55dda04ec2e0f3b-- > +MIME-Version: 1.0 > +Received: by 10.216.199.5 with HTTP; Wed, 27 Nov 2013 12:01:23 -0800 > +(PST) > +Date: Wed, 27 Nov 2013 13:01:23 -0700 > +Delivered-To: foo@cloudera.com > +Message-ID: > + 8cmAgK6w@mail.gmail.com> > +Subject: Test EML > +From: Patrick Foo > +To: Patrick Foo > +Content-Type: multipart/alternative; > +boundary=3D001a11c3815cb55dda04ec2e0f3b > + > +--001a11c3815cb55dda04ec2e0f3b > +Content-Type: text/plain; charset=3DISO-8859-1 > + > +This is a test > + > +-- > +Patrick Foo > +Customer Operations Engineer > + > + > + > +--001a11c3815cb55dda04ec2e0f3b > +Content-Type: text/html; charset=3DISO-8859-1 > +Content-Transfer-Encoding: quoted-printable > + > +
This is a test

-- > +
+ dir=3D3D"ltr">Patrick Foo
Customer Operations > +Engineer

=3D > +
> + > + > +--001a11c3815cb55dda04ec2e0f3b-- >=20 > Modified: lucene/dev/trunk/solr/contrib/morphlines-core/src/test- > files/test-documents/rsstest.rss > URL: > http://svn.apache.org/viewvc/lucene/dev/trunk/solr/contrib/morphlines- > core/src/test-files/test- > = documents/rsstest.rss?rev=3D1570955&r1=3D1570954&r2=3D1570955&view=3Ddiff= > = =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > --- lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/rsstest.rss (original) > +++ lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/rsstest.rss Sun Feb 23 02:22:02 2014 > @@ -1,36 +1,36 @@ > - > - > - > - > - TestChannel > - http://test.channel.com/ > - Sample RSS File for Junit test > - en-us > - > - > - Home Page of Chris Mattmann > - http://www-scf.usc.edu/~mattmann/ > - Chris Mattmann's home page > - > - > - Awesome Open Source Search Engine > - http://www.nutch.org/ > - Yup, that's what it is > - > - > - > + > + > + > + > + TestChannel > + http://test.channel.com/ > + Sample RSS File for Junit test > + en-us > + > + > + Home Page of Chris Mattmann > + http://www-scf.usc.edu/~mattmann/ > + Chris Mattmann's home page > + > + > + Awesome Open Source Search Engine > + http://www.nutch.org/ > + Yup, that's what it is > + > + > + >=20 > Modified: lucene/dev/trunk/solr/contrib/morphlines-core/src/test- > files/test-documents/sample-statuses-20120906-141433 > URL: > http://svn.apache.org/viewvc/lucene/dev/trunk/solr/contrib/morphlines- > core/src/test-files/test-documents/sample-statuses-20120906- > 141433?rev=3D1570955&r1=3D1570954&r2=3D1570955&view=3Ddiff > = =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > --- lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/sample-statuses-20120906-141433 (original) > +++ lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/sample-statuses-20120906-141433 Sun Feb 23 02:22:02 2014 > @@ -1,4 +1,4 @@ > -1000 > -{"text":"sample tweet > = one","retweet_count":0,"in_reply_to_user_id":null,"retweeted":false,"trun= > = cated":false,"source":"href=3D\"http:\/\/sample.com\"","id_str":"12345678= 91" > = ,"entities":{"user_mentions":[],"hashtags":[],"urls":[]},"in_reply_to_sta= tus_i > = d":null,"place":null,"in_reply_to_status_id_str":null,"coordinates":null,= "creat > ed_at":"Wed Sep 05 01:01:01 +0000 > = 1985","in_reply_to_screen_name":null,"favorited":false,"in_reply_to_user_= > = id_str":null,"user":{"default_profile_image":false,"friends_count":111,"p= rofil > e_background_color":"3C0C29","location":"Palo > = Alto","is_translator":false,"profile_background_tile":true,"favourites_co= unt" > = :11,"verified":false,"profile_sidebar_fill_color":"efefef","follow_reques= t_se > = nt":null,"contributors_enabled":false,"description":"desc1","profile_side= bar > = _border_color":"eeeeee","profile_image_url_https":"https:\/\/si0.twimg.co= > = m\/profile_images\/1\/normal.jpg","id_str":"1111111","listed_count":1,"la= n > g":"en","screen_name":"fake_user1","show_all_inline_media":fals >=20 > = e,"profile_use_background_image":true,"profile_image_url":"http:\/\/a0.t > = wimg.com\/profile_images\/1111111\/normal.jpg","default_profile":false,"s= > tatuses_count":11111,"created_at":"Thu Apr 07 11:04:54 +0000 > = 1985","profile_text_color":"333333","followers_count":111,"protected":fal= se > = ,"following":null,"notifications":null,"profile_background_image_url":"ht= tp:\ > = /\/a0.twimg.com\/images\/themes\/theme1\/bg.gif","time_zone":null,"url" > = :null,"name":"name1","geo_enabled":false,"profile_link_color":"009999","i= d > = ":1111112,"profile_background_image_url_https":"https:\/\/si0.twimg.com\ > = /images\/themes\/theme1\/bg.gif","utc_offset":null},"id":11111112,"contri= > butors":null,"geo":null} > -2000 > +1000 > +{"text":"sample tweet > = one","retweet_count":0,"in_reply_to_user_id":null,"retweeted":false,"trun= > = cated":false,"source":"href=3D\"http:\/\/sample.com\"","id_str":"12345678= 91" > = ,"entities":{"user_mentions":[],"hashtags":[],"urls":[]},"in_reply_to_sta= tus_i > = d":null,"place":null,"in_reply_to_status_id_str":null,"coordinates":null,= "creat > ed_at":"Wed Sep 05 01:01:01 +0000 > = 1985","in_reply_to_screen_name":null,"favorited":false,"in_reply_to_user_= > = id_str":null,"user":{"default_profile_image":false,"friends_count":111,"p= rofil > e_background_color":"3C0C29","location":"Palo > = Alto","is_translator":false,"profile_background_tile":true,"favourites_co= unt" > = :11,"verified":false,"profile_sidebar_fill_color":"efefef","follow_reques= t_se > = nt":null,"contributors_enabled":false,"description":"desc1","profile_side= bar > = _border_color":"eeeeee","profile_image_url_https":"https:\/\/si0.twimg.co= > = m\/profile_images\/1\/normal.jpg","id_str":"1111111","listed_count":1,"la= n > g":"en","screen_name":"fake_user1","show_all_inline_media":fals >=20 > = e,"profile_use_background_image":true,"profile_image_url":"http:\/\/a0.t > = wimg.com\/profile_images\/1111111\/normal.jpg","default_profile":false,"s= > tatuses_count":11111,"created_at":"Thu Apr 07 11:04:54 +0000 > = 1985","profile_text_color":"333333","followers_count":111,"protected":fal= se > = ,"following":null,"notifications":null,"profile_background_image_url":"ht= tp:\ > = /\/a0.twimg.com\/images\/themes\/theme1\/bg.gif","time_zone":null,"url" > = :null,"name":"name1","geo_enabled":false,"profile_link_color":"009999","i= d > = ":1111112,"profile_background_image_url_https":"https:\/\/si0.twimg.com\ > = /images\/themes\/theme1\/bg.gif","utc_offset":null},"id":11111112,"contri= > butors":null,"geo":null} > +2000 > {"text":"sample tweet > = two","retweet_count":0,"in_reply_to_user_id":null,"retweeted":false,"trun= > = cated":false,"source":"href=3D\"http:\/\/sample.com\"","id_str":"23456789= 02" > = ,"entities":{"user_mentions":[],"hashtags":[],"urls":[]},"in_reply_to_sta= tus_i > = d":null,"place":null,"in_reply_to_status_id_str":null,"coordinates":null,= "creat > ed_at":"Wed Sep 05 02:14:34 +0000 > = 1985","in_reply_to_screen_name":null,"favorited":false,"in_reply_to_user_= > = id_str":null,"user":{"default_profile_image":false,"friends_count":222,"p= rofil > e_background_color":"3C0C29","location":"San > = Francisco","is_translator":false,"profile_background_tile":false,"favouri= tes_c > = ount":22,"verified":false,"profile_sidebar_fill_color":"B2D948","follow_r= equ > = est_sent":null,"contributors_enabled":false,"description":"desc2","profil= e_si > = debar_border_color":"8EC63D","profile_image_url_https":"https:\/\/si0.twi= > = mg.com\/profile_images\/22222222\/image_normal.jpg","id_str":"2222222", > "listed_count":0,"lang":"en","screen_name":"fake_user2","show_all_ >=20 > = inline_media":false,"profile_use_background_image":true,"profile_image_u > = rl":"http:\/\/a0.twimg.com\/profile_images\/2222222\/image_normal.jpg"," > default_profile":false,"statuses_count":222222,"created_at":"Thu Aug = 04 > 11:33:28 +0000 > = 1985","profile_text_color":"444444","followers_count":222,"protected":fal= se > = ,"following":null,"notifications":null,"profile_background_image_url":"ht= tp:\ > /\/a0.twimg.com\/profile_background_images\/222222\/222222.jpg","time_ > zone":"Central Time (US & > = Canada)","url":null,"name":"name2","geo_enabled":false,"profile_link_colo= r > = ":"9A0057","id":2222223,"profile_background_image_url_https":"https:\/\/s= i > 0.twimg.com\/profile_background_images\/2222222\/22222.jpg","utc_offse > t":-21600},"id":222223,"contributors":null,"geo":null} > \ No newline at end of file >=20 > Modified: lucene/dev/trunk/solr/contrib/morphlines-core/src/test- > files/test-documents/testEMLX.emlx > URL: > http://svn.apache.org/viewvc/lucene/dev/trunk/solr/contrib/morphlines- > core/src/test-files/test- > = documents/testEMLX.emlx?rev=3D1570955&r1=3D1570954&r2=3D1570955&view=3Ddi= > ff > = =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > --- lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/testEMLX.emlx (original) > +++ lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/testEMLX.emlx Sun Feb 23 02:22:02 2014 > @@ -1,72 +1,72 @@ > - > - > -1795 > -From: "Julien Nioche (JIRA)" > -To: dev@tika.apache.org > -Subject: [jira] Commented: (TIKA-461) RFC822 messages not parsed > -Reply-To: dev@tika.apache.org > -Delivered-To: mailing list dev@tika.apache.org > -Date: Mon, 6 Sep 2010 05:25:34 -0400 (EDT) > -In-Reply-To: <6089099.260231278600349994.JavaMail.jira@thor> > -MIME-Version: 1.0 > -Content-Type: text/plain; charset=3Dutf-8 > -Content-Transfer-Encoding: 7bit > -X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 > -X-Virus-Checked: Checked by ClamAV on apache.org > - > - > - [ https://issues.apache.org/jira/browse/TIKA- > 461?page=3Dcom.atlassian.jira.plugin.system.issuetabpanels:comment- > tabpanel&focusedCommentId=3D12906468#action_12906468 ] > - > -Julien Nioche commented on TIKA-461: > ------------------------------------- > - > -I'll have a look at mime4j and try to use it in Tika > - > -> RFC822 messages not parsed > -> -------------------------- > -> > -> Key: TIKA-461 > -> URL: https://issues.apache.org/jira/browse/TIKA-461 > -> Project: Tika > -> Issue Type: Bug > -> Components: parser > -> Affects Versions: 0.7 > -> Reporter: Joshua Turner > -> Assignee: Julien Nioche > -> > -> Presented with an RFC822 message exported from Thunderbird, > AutodetectParser produces an empty body, and a Metadata containing = only > one key-value pair: "Content-Type=3Dmessage/rfc822". Directly calling > MboxParser likewise gives an empty body, but with two metadata pairs: > "Content-Encoding=3Dus-ascii Content-Type=3Dapplication/mbox". > -> A quick peek at the source of MboxParser shows that the = implementation > is pretty naive. If the wiring can be sorted out, something like = Apache James' > mime4j might be a better bet. > - > --- > -This message is automatically generated by JIRA. > -- > -You can reply to this email to add a comment to the issue online. > - > - > - "http://www.apple.com/DTDs/PropertyList-1.0.dtd"> > - > - > - flags > - 0 > - sender > - "Julien Nioche (JIRA)" <jira@apache.org> > - subject > - [jira] Commented: (TIKA-461) RFC822 messages not > parsed > - to > - dev@tika.apache.org > - > + > + > +1795 > +From: "Julien Nioche (JIRA)" > +To: dev@tika.apache.org > +Subject: [jira] Commented: (TIKA-461) RFC822 messages not parsed > +Reply-To: dev@tika.apache.org > +Delivered-To: mailing list dev@tika.apache.org > +Date: Mon, 6 Sep 2010 05:25:34 -0400 (EDT) > +In-Reply-To: <6089099.260231278600349994.JavaMail.jira@thor> > +MIME-Version: 1.0 > +Content-Type: text/plain; charset=3Dutf-8 > +Content-Transfer-Encoding: 7bit > +X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 > +X-Virus-Checked: Checked by ClamAV on apache.org > + > + > + [ https://issues.apache.org/jira/browse/TIKA- > 461?page=3Dcom.atlassian.jira.plugin.system.issuetabpanels:comment- > tabpanel&focusedCommentId=3D12906468#action_12906468 ] > + > +Julien Nioche commented on TIKA-461: > +------------------------------------ > + > +I'll have a look at mime4j and try to use it in Tika > + > +> RFC822 messages not parsed > +> -------------------------- > +> > +> Key: TIKA-461 > +> URL: https://issues.apache.org/jira/browse/TIKA-461 > +> Project: Tika > +> Issue Type: Bug > +> Components: parser > +> Affects Versions: 0.7 > +> Reporter: Joshua Turner > +> Assignee: Julien Nioche > +> > +> Presented with an RFC822 message exported from Thunderbird, > AutodetectParser produces an empty body, and a Metadata containing = only > one key-value pair: "Content-Type=3Dmessage/rfc822". Directly calling > MboxParser likewise gives an empty body, but with two metadata pairs: > "Content-Encoding=3Dus-ascii Content-Type=3Dapplication/mbox". > +> A quick peek at the source of MboxParser shows that the = implementation > is pretty naive. If the wiring can be sorted out, something like = Apache James' > mime4j might be a better bet. > + > +-- > +This message is automatically generated by JIRA. > +- > +You can reply to this email to add a comment to the issue online. > + > + > + "http://www.apple.com/DTDs/PropertyList-1.0.dtd"> > + > + > + flags > + 0 > + sender > + "Julien Nioche (JIRA)" <jira@apache.org> > + subject > + [jira] Commented: (TIKA-461) RFC822 messages not > parsed > + to > + dev@tika.apache.org > + >=20 > Modified: lucene/dev/trunk/solr/contrib/morphlines-core/src/test- > files/test-documents/testRFC822 > URL: > http://svn.apache.org/viewvc/lucene/dev/trunk/solr/contrib/morphlines- > core/src/test-files/test- > = documents/testRFC822?rev=3D1570955&r1=3D1570954&r2=3D1570955&view=3Ddiff > = =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > --- lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/testRFC822 (original) > +++ lucene/dev/trunk/solr/contrib/morphlines-core/src/test-files/test- > documents/testRFC822 Sun Feb 23 02:22:02 2014 > @@ -1,41 +1,41 @@ > -From: "Julien Nioche (JIRA)" > -To: dev@tika.apache.org > -Subject: [jira] Commented: (TIKA-461) RFC822 messages not parsed > -Reply-To: dev@tika.apache.org > -Delivered-To: mailing list dev@tika.apache.org > -Date: Mon, 6 Sep 2010 05:25:34 -0400 (EDT) > -In-Reply-To: <6089099.260231278600349994.JavaMail.jira@thor> > -MIME-Version: 1.0 > -Content-Type: text/plain; charset=3Dutf-8 > -Content-Transfer-Encoding: 7bit > -X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 > -X-Virus-Checked: Checked by ClamAV on apache.org > - > - > - [ https://issues.apache.org/jira/browse/TIKA- > 461?page=3Dcom.atlassian.jira.plugin.system.issuetabpanels:comment- > tabpanel&focusedCommentId=3D12906468#action_12906468 ] > - > -Julien Nioche commented on TIKA-461: > ------------------------------------- > - > -I'll have a look at mime4j and try to use it in Tika > - > -> RFC822 messages not parsed > -> -------------------------- > -> > -> Key: TIKA-461 > -> URL: https://issues.apache.org/jira/browse/TIKA-461 > -> Project: Tika > -> Issue Type: Bug > -> Components: parser > -> Affects Versions: 0.7 > -> Reporter: Joshua Turner > -> Assignee: Julien Nioche > -> > -> Presented with an RFC822 message exported from Thunderbird, > AutodetectParser produces an empty body, and a Metadata containing = only > one key-value pair: "Content-Type=3Dmessage/rfc822". Directly calling > MboxParser likewise gives an empty body, but with two metadata pairs: > "Content-Encoding=3Dus-ascii Content-Type=3Dapplication/mbox". > -> A quick peek at the source of MboxParser shows that the = implementation > is pretty naive. If the wiring can be sorted out, something like = Apache James' > mime4j might be a better bet. > - > --- > -This message is automatically generated by JIRA. > -- > -You can reply to this email to add a comment to the issue online. > - > +From: "Julien Nioche (JIRA)" > +To: dev@tika.apache.org > +Subject: [jira] Commented: (TIKA-461) RFC822 messages not parsed > +Reply-To: dev@tika.apache.org > +Delivered-To: mailing list dev@tika.apache.org > +Date: Mon, 6 Sep 2010 05:25:34 -0400 (EDT) > +In-Reply-To: <6089099.260231278600349994.JavaMail.jira@thor> > +MIME-Version: 1.0 > +Content-Type: text/plain; charset=3Dutf-8 > +Content-Transfer-Encoding: 7bit > +X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 > +X-Virus-Checked: Checked by ClamAV on apache.org > + > + > + [ https://issues.apache.org/jira/browse/TIKA- > 461?page=3Dcom.atlassian.jira.plugin.system.issuetabpanels:comment- > tabpanel&focusedCommentId=3D12906468#action_12906468 ] > + > +Julien Nioche commented on TIKA-461: > +------------------------------------ > + > +I'll have a look at mime4j and try to use it in Tika > + > +> RFC822 messages not parsed > +> -------------------------- > +> > +> Key: TIKA-461 > +> URL: https://issues.apache.org/jira/browse/TIKA-461 > +> Project: Tika > +> Issue Type: Bug > +> Components: parser > +> Affects Versions: 0.7 > +> Reporter: Joshua Turner > +> Assignee: Julien Nioche > +> > +> Presented with an RFC822 message exported from Thunderbird, > AutodetectParser produces an empty body, and a Metadata containing = only > one key-value pair: "Content-Type=3Dmessage/rfc822". Directly calling > MboxParser likewise gives an empty body, but with two metadata pairs: > "Content-Encoding=3Dus-ascii Content-Type=3Dapplication/mbox". > +> A quick peek at the source of MboxParser shows that the = implementation > is pretty naive. If the wiring can be sorted out, something like = Apache James' > mime4j might be a better bet. > + > +-- > +This message is automatically generated by JIRA. > +- > +You can reply to this email to add a comment to the issue online. > + --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org