Return-Path: Delivered-To: apmail-hadoop-chukwa-user-archive@minotaur.apache.org Received: (qmail 22104 invoked from network); 5 Feb 2010 21:48:49 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 5 Feb 2010 21:48:49 -0000 Received: (qmail 51076 invoked by uid 500); 5 Feb 2010 21:48:49 -0000 Delivered-To: apmail-hadoop-chukwa-user-archive@hadoop.apache.org Received: (qmail 51051 invoked by uid 500); 5 Feb 2010 21:48:49 -0000 Mailing-List: contact chukwa-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: chukwa-user@hadoop.apache.org Delivered-To: mailing list chukwa-user@hadoop.apache.org Received: (qmail 51042 invoked by uid 99); 5 Feb 2010 21:48:49 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 05 Feb 2010 21:48:49 +0000 X-ASF-Spam-Status: No, hits=0.7 required=10.0 tests=SPF_SOFTFAIL X-Spam-Check-By: apache.org Received-SPF: softfail (athena.apache.org: transitioning domain of corbin@tynt.com does not designate 64.71.238.89 as permitted sender) Received: from [64.71.238.89] (HELO sh6.exchange.ms) (64.71.238.89) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 05 Feb 2010 21:48:41 +0000 Received: from mse16fe1.mse16.exchange.ms (unknown [172.29.12.29]) by sh6.exchange.ms (Postfix) with ESMTP id 4750D11C34C for ; Fri, 5 Feb 2010 16:47:58 -0500 (EST) Received: from [127.0.0.1] ([208.100.17.85]) by mse16fe1.mse16.exchange.ms with Microsoft SMTPSVC(6.0.3790.3959); Fri, 5 Feb 2010 16:47:37 -0500 Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1077) Subject: Re: Custom Processor From: Corbin Hoenes In-Reply-To: Date: Fri, 5 Feb 2010 14:47:35 -0700 Content-Transfer-Encoding: quoted-printable Message-Id: <8B90078D-0BD9-4143-9F91-A999CDC15584@tynt.com> References: To: chukwa-user@hadoop.apache.org X-Mailer: Apple Mail (2.1077) X-OriginalArrivalTime: 05 Feb 2010 21:47:37.0898 (UTC) FILETIME=[D57068A0:01CAA6AC] Thanks Eric... things are looking much more sane now after understanding = what a recordEntry is. :) On Feb 5, 2010, at 12:57 PM, Eric Yang wrote: > Log output by ChukwaLog4J appender has a special property that it will = group > the same log output statement delimited by ^A character. Hence, the = UTF8 > adaptor will not break up the structures and send the entire output as = one > record. When the custom processor get an recordEntry (part of a = chunk), it > will contain the full output. nextLine function is to get the next = record > shipped in the same chunk. It should not be rely upon, unless the = specially > crafted adaptor is shipping multi-records as a chunk and this chunk = has no > dependency outside. The current UTF8 adaptor is sending multiple = records > depending on the speed of the tail is happening. UTF8 adaptor should = not be > rely upon if you want to use nextLine feature to build your parser = states. >=20 > Regards, > Eric >=20 > On 2/5/10 10:48 AM, "Corbin Hoenes" wrote: >=20 >> So I'm implementing a custom processor and my processor get's a = recordEntry >> passed to it's parse method. It has more than a single log line it. = Is this >> the responsibility of my processor to split it up into lines? The = nextLine() >> method in AbstractProcessor made me think that I'd always be passed a = single >> line. >>=20 >=20