Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 812FB200D37 for ; Thu, 9 Nov 2017 11:19:29 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 7FC0B160BEF; Thu, 9 Nov 2017 10:19:29 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C5B5C1609E5 for ; Thu, 9 Nov 2017 11:19:28 +0100 (CET) Received: (qmail 15442 invoked by uid 500); 9 Nov 2017 10:19:27 -0000 Mailing-List: contact user-help@uima.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@uima.apache.org Delivered-To: mailing list user@uima.apache.org Received: (qmail 15431 invoked by uid 99); 9 Nov 2017 10:19:27 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 09 Nov 2017 10:19:27 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id AE78018079A for ; Thu, 9 Nov 2017 10:19:26 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.979 X-Spam-Level: * X-Spam-Status: No, score=1.979 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H4=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id xBBuMyGFTTDz for ; Thu, 9 Nov 2017 10:19:25 +0000 (UTC) Received: from smtp105.ord1d.emailsrvr.com (smtp105.ord1d.emailsrvr.com [184.106.54.105]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 52DB35FDB5 for ; Thu, 9 Nov 2017 10:19:25 +0000 (UTC) Received: from smtp6.relay.ord1d.emailsrvr.com (localhost [127.0.0.1]) by smtp6.relay.ord1d.emailsrvr.com (SMTP Server) with ESMTP id 085CFE0060; Thu, 9 Nov 2017 05:19:19 -0500 (EST) X-Auth-ID: priyank.sharma@orkash.com Received: by smtp6.relay.ord1d.emailsrvr.com (Authenticated sender: priyank.sharma-AT-orkash.com) with ESMTPSA id 5F578E0082 for ; Thu, 9 Nov 2017 05:19:18 -0500 (EST) X-Sender-Id: priyank.sharma@orkash.com Received: from [192.168.0.114] ([UNAVAILABLE]. [122.160.142.167]) (using TLSv1.2 with cipher DHE-RSA-AES128-SHA) by 0.0.0.0:465 (trex/5.7.12); Thu, 09 Nov 2017 05:19:19 -0500 Message-ID: <5A042BA3.1080509@orkash.com> Date: Thu, 09 Nov 2017 15:49:15 +0530 From: priyank sharma User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.4.0 MIME-Version: 1.0 To: user@uima.apache.org Subject: DUCC's job goes into infintie loop Content-Type: multipart/alternative; boundary="------------010100050507040207070406" archived-at: Thu, 09 Nov 2017 10:19:29 -0000 --------------010100050507040207070406 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit All! I have a problem regarding DUCC cluster in which a job process gets stuck and keeps on processing the same batch again and again due to maximum duration the batch gets reason or extraordinary status *"**CanceledByUser" *and then gets restarted with the same ID's. This usually happens after 15 to 20 days and goes away after restarting the ducc cluster. While going through the data store that is being used by CAS consumer to ingest data, the data regarding this batch does never get ingested. So most probably this data is not being processed. How to check if this data is being processed or not? Are the resources the issue and why it is being processed after restarting the cluster? We have three nodes cluster with 32gb ram, 40gb ram and 28 gb ram. -- Thanks and Regards Priyank Sharma --------------010100050507040207070406--