Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 5A003200D37 for ; Thu, 9 Nov 2017 12:03:20 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 587BB160BEF; Thu, 9 Nov 2017 11:03:20 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 9CBD11609E5 for ; Thu, 9 Nov 2017 12:03:19 +0100 (CET) Received: (qmail 84924 invoked by uid 500); 9 Nov 2017 11:03:18 -0000 Mailing-List: contact user-help@uima.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@uima.apache.org Delivered-To: mailing list user@uima.apache.org Received: (qmail 84912 invoked by uid 99); 9 Nov 2017 11:03:18 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 09 Nov 2017 11:03:18 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 981CF1807BF for ; Thu, 9 Nov 2017 11:03:17 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.38 X-Spam-Level: ** X-Spam-Status: No, score=2.38 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id C8tqRRWn4cAw for ; Thu, 9 Nov 2017 11:03:16 +0000 (UTC) Received: from mail-lf0-f50.google.com (mail-lf0-f50.google.com [209.85.215.50]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 063B75FDD4 for ; Thu, 9 Nov 2017 11:03:16 +0000 (UTC) Received: by mail-lf0-f50.google.com with SMTP id a16so6713670lfk.0 for ; Thu, 09 Nov 2017 03:03:16 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=DpNkx1m4SI/cA5+4QfMjiF18pUf1G2c2On9aed6VNCY=; b=TJdQ6h5DbjahIxZoLACsb1YS8GqlAc9kQ1h7OASdRVwCk+/Im0+zQLxNh2uOWerdwy bQkRxkVLhFWxwkok2hFABRbDyBrzhbMYI/dLuufnwMh3wqYxPtzwwz/3uOtNR0p316XE 4bFXy1hqd9elCQd/fGVvLRRTxSSqo3qhnn3qDsRcHCtxS5ZwWbmg0OO+8o9GgyY+WvkF sKU+/kMmDrmdCp9G5OQ95jaQYwXZf31iSzXwBOzkYyXQS78LtxC3xxCpcV8S+hNAMM4S ovDlO9kbmqu5QSbgyKXh5Co5ujNXyrr3X9DOIX1gGals/02MoBXN57oyGS+Qqr+gQv4p Khqg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=DpNkx1m4SI/cA5+4QfMjiF18pUf1G2c2On9aed6VNCY=; b=O2zRvAjw9+e7VfecAlzEwPtTu+2nnuEm89OK/m03dreJ70tAESgDJ+WrCijGLScSNm AlzcyhDOBDpWz2LBdF4vtH69qi6jwKSei5txKlv4OOnnOSDbRU6LYlfrWOvCYiGzp10l wHbTunwGn00fWHEtLY/v8hzhZbN3cIZrUQfqZd7WvQgBgJktkEtKxBoQfNdLMVD6qSWq iyFASnQ7FQ7jUGxg0vshjQRNf+sMXh7jXb8w5YQ8xGz/B2BmfRuu9V78Kg+94fXItwnI /U6nceLItU8iwNKMsFbkQUsVQAqllatVZEx0CLtT+RZevx2iOILiwg6uLzopwgQW+c7K gjZQ== X-Gm-Message-State: AJaThX4FHUT84R+PjP2R8wJFFt3kqR+1Fi0dgfBJJNFN69Wj5uU9dE6e 6KZYVdpzfJLRGn4Kcw3d0xGdhICI5Uxx9sByoW8= X-Google-Smtp-Source: ABhQp+Qw+1/PAr++aCXJgaUs9U8+MWqzuhhOblIlDLcujg+st2dPK4omyI7RUcUDaLny6izoLNyIzO5SwDWvRG4TMrU= X-Received: by 10.46.19.2 with SMTP id 2mr39690ljt.188.1510225395127; Thu, 09 Nov 2017 03:03:15 -0800 (PST) MIME-Version: 1.0 Received: by 10.25.41.130 with HTTP; Thu, 9 Nov 2017 03:03:14 -0800 (PST) In-Reply-To: <5A042BA3.1080509@orkash.com> References: <5A042BA3.1080509@orkash.com> From: Lou DeGenaro Date: Thu, 9 Nov 2017 06:03:14 -0500 Message-ID: Subject: Re: DUCC's job goes into infintie loop To: user@uima.apache.org Content-Type: multipart/alternative; boundary="94eb2c1ce928351aea055d8ac042" archived-at: Thu, 09 Nov 2017 11:03:20 -0000 --94eb2c1ce928351aea055d8ac042 Content-Type: text/plain; charset="UTF-8" The first place to look is in your job's logs. Visit the ducc-mon jobs page ducchost:42133/jobs.jsp then click on the id of your job. Examine the logs by clicking on each log file name looking for any revealing information. Feel free to post non-confidential snippets here, or If you'd like to chat in real time we can use hipchat. Lou. On Thu, Nov 9, 2017 at 5:19 AM, priyank sharma wrote: > All! > > I have a problem regarding DUCC cluster in which a job process gets stuck > and keeps on processing the same batch again and again due to maximum > duration the batch gets reason or extraordinary status *"**CanceledByUser" > *and then gets restarted with the same ID's. This usually happens after 15 > to 20 days and goes away after restarting the ducc cluster. While going > through the data store that is being used by CAS consumer to ingest data, > the data regarding this batch does never get ingested. So most probably > this data is not being processed. > > How to check if this data is being processed or not? > > Are the resources the issue and why it is being processed after restarting > the cluster? > > We have three nodes cluster with 32gb ram, 40gb ram and 28 gb ram. > > > > -- > Thanks and Regards > Priyank Sharma > > --94eb2c1ce928351aea055d8ac042--