incubator-bigtop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Roman Shaposhnik (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (BIGTOP-436) flume-node stop seems to mistarget some other java process on lucid
Date Thu, 08 Mar 2012 16:47:59 GMT
flume-node stop seems to mistarget some other java process on lucid
-------------------------------------------------------------------

                 Key: BIGTOP-436
                 URL: https://issues.apache.org/jira/browse/BIGTOP-436
             Project: Bigtop
          Issue Type: Bug
          Components: Debian
    Affects Versions: 0.3.0
            Reporter: Roman Shaposhnik
            Assignee: Bruno Mahé
             Fix For: 0.3.0


It seems that the following bit of code from flume-node reliably kills our jenkins slave process
on lucid:

{noformat}
  # FLUME-919 will put an end to such extreme violence
  FLUME_PID=`cat $FLUME_PID_FILE`
  if [ -n $FLUME_PID ]; then
    FLUME_PID_GROUP=$(ps -o pgrp -p ${FLUME_PID} h)

    if [ -n $FLUME_PID_GROUP ]; then
      kill -TERM -${FLUME_PID_GROUP} &>/dev/null
      sleep 5
      kill -KILL -${FLUME_PID_GROUP} &>/dev/null

      rm -f $LOCKFILE $FLUME_PID_FILE
    fi
  fi
  return 0
{noformat}

Here's how it happens:
   http://bigtop01.cloudera.org:8080/view/Bigtop-trunk/job/Bigtop-trunk-packagetest-lucid/label=lucid-slave/1/console

We need to investigate and possibly fix this.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

Mime
View raw message