Return-Path: Delivered-To: apmail-lucene-hadoop-user-archive@locus.apache.org Received: (qmail 78389 invoked from network); 26 Jun 2007 18:31:11 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 26 Jun 2007 18:31:11 -0000 Received: (qmail 48749 invoked by uid 500); 26 Jun 2007 18:31:13 -0000 Delivered-To: apmail-lucene-hadoop-user-archive@lucene.apache.org Received: (qmail 48731 invoked by uid 500); 26 Jun 2007 18:31:13 -0000 Mailing-List: contact hadoop-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-user@lucene.apache.org Delivered-To: mailing list hadoop-user@lucene.apache.org Received: (qmail 48681 invoked by uid 99); 26 Jun 2007 18:31:13 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Jun 2007 11:31:13 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (herse.apache.org: local policy) Received: from [203.99.254.143] (HELO rsmtp1.corp.hki.yahoo.com) (203.99.254.143) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Jun 2007 11:31:09 -0700 Received: from comehaspaintlx (snvvpn2-10-72-76-c140.corp.yahoo.com [10.72.76.140]) (authenticated bits=0) by rsmtp1.corp.hki.yahoo.com (8.13.8/8.13.6/y.rout) with ESMTP id l5QIUI19046210 (version=TLSv1/SSLv3 cipher=RC4-MD5 bits=128 verify=NO) for ; Tue, 26 Jun 2007 11:30:23 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; s=serpent; d=yahoo-inc.com; c=nofws; q=dns; h=from:to:references:subject:date:message-id:mime-version: content-type:content-transfer-encoding:x-mailer:thread-index:in-reply-to:x-mimeole; b=uOwatfbVLH8f9Agaho9dNNdtIEvEAvV476PZsAj1dFLywVgosJ45kRqy62wkjFdz From: "Devaraj Das" To: References: Subject: RE: is that possible to make MapFile "mutable" ? Date: Wed, 27 Jun 2007 00:00:15 +0530 Message-ID: <000301c7b820$0ec675a0$2201a8c0@ds.corp.yahoo.com> MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook 11 Thread-Index: Ace4DZZb3sEWAWlDT6O+8HdHTi1d6QAEg3Vg In-Reply-To: X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3028 X-Virus-Checked: Checked by ClamAV on apache.org No, you cannot append to a file on the dfs and your app should be able to treat multiple files as one single logical file (as you point out). But in your case, it seems like you could design your app to have some buffering, for example, you could have a buffer for the n different files, and could flush the buffer to different files on the dfs only when you have reached a certain limit on the amount of data in the buffer. I am not sure whether fault handling is of concern to you but there is the danger of losing the buffered messages if your app goes down. One way to handle this - assuming you have the ability to reprocess messages, you could checkpoint the state of the message processor in the dfs - the state could include the last message ID you flushed, and the next time your app starts up, it reads the checkpoint file from the dfs, gets the ID, and process messages starting from (ID + 1). -----Original Message----- From: Open Study [mailto:open.study@gmail.com] Sent: Tuesday, June 26, 2007 8:42 PM To: hadoop-user@lucene.apache.org Subject: is that possible to make MapFile "mutable" ? Hi all, MapFile doesn't support append mode of creation, so every time the existing mapfile would be overwritten if a new one with same name is created. Is there anyway I can append to an MapFile or alike without erasing the old content? or it doesn't makes sense at all? In my scenario I need to split mass (count by tens of millions) messages according to certain rules and put them into different mapfiles, which are supposed to get updated when new messages come in. Since I didn't find a way to make mapfile appendable, I have to create new mapfiles, so one mapfile can contain as little as one message in worst case and I will have to later merge them with their proper siblings. Regards