Mailing-List: contact user-help@hive.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hive.apache.org
Received-SPF: pass (nike.apache.org: domain of darkwolli32@gmail.com
 designates 209.85.214.45 as permitted sender)
Message-ID: <5334A7F0.1040202@gmail.com>
Date: Thu, 27 Mar 2014 23:36:32 +0100
From: fab wol <darkwolli32@gmail.com>
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.9;
 rv:24.0) Gecko/20100101 Thunderbird/24.4.0
MIME-Version: 1.0
To: user@hive.apache.org
Subject: Re: MSCK REPAIR TABLE
References: 
 <CALtQu18R5ZYdHYofpkHEgLWmGGMJyVmeckhkvgYVMY06S+8WsA@mail.gmail.com>
 <CAORpBsgwbmLGmuOXTjq9JJy_DbddAtW2R7kQ3ebqRC=2Eu7YNQ@mail.gmail.com>
 <CALtQu1-MP85T-rTjPqOCKhWm0mR=rUucpSke34P9kEBH3=QFrg@mail.gmail.com>
 <CAORpBsi323BEpEb=ccnGmOp5oe5PgQFkdKCF45jRCH-HY1qGCw@mail.gmail.com>
 <CALtQu1-fTLfCEyH_6f7PGHL+sEXYHSkbUex+bVbgS53+i2+a7w@mail.gmail.com>
 <CAC06LGaUXAfxvx4c-jknH5k-mB=-7bidMWzWi0BsktUsSA1fPQ@mail.gmail.com>
In-Reply-To: 
 <CAC06LGaUXAfxvx4c-jknH5k-mB=-7bidMWzWi0BsktUsSA1fPQ@mail.gmail.com>
Content-Type: multipart/alternative;
 boundary="------------010309070407040702000203"

This is a multi-part message in MIME format.
--------------010309070407040702000203
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit

Hey Stephen, thanks for the advice, but as i wrote in my first post, i 
wanted to do that anyways. But thanks for the explanation why this is 
indeed the best way to go for a production system ...

Cheers
Wolli

Am 27.03.14 16:05, schrieb Stephen Sprague:
> fwiw. i would not have the repair table statement as part of a 
> production job stream.  That's kinda a poor man's way to employ 
> dynamic partitioning off the back end.
>
> Why not either use hive's dynamic partitioning features or pre-declare 
> your partitions? that way you are explicitly coding for your purpose 
> rather than running a general repair table on the backend knowing you 
> "broke it" up front?
>
> just a suggestion!
>
>
> On Thu, Mar 27, 2014 at 3:18 AM, fab wol <darkwolli32@gmail.com 
> <mailto:darkwolli32@gmail.com>> wrote:
>
>     Hey Nitin and everyone else,
>
>     so let me tell you from memory that the Hive CLI Error was kind of
>     the same and nothing saying like the beeline error. Would have
>     been no uplift here.
>
>     I was restarting the cluster (it is a cloud cluster provided by
>     http://www.unbelievable-machine.net), for getting the HiveServer2
>     Log and to be very sure, that everything is well set up. During
>     this all tasktrackers are deleted and newly setup (HDFS and
>     storage is not touched at all, neither are configs). After that
>     the msck repair table stmt is going well and its actually not so
>     slow at all, as i thought it might be (ca. 110 secs per table). I
>     guess there might have been some logs/tmp/cache data stacked up,
>     and that might have caused the errors ...
>
>     Slightly confusing, but i will post if I find out what exactly was
>     throwing the error here in the future ...
>
>     Cheers for the help
>     Wolli
>
>
>     2014-03-27 11:03 GMT+01:00 Nitin Pawar <nitinpawar432@gmail.com
>     <mailto:nitinpawar432@gmail.com>>:
>
>         Without error stack, very hard to get whats wrong
>
>         will it be possible for you to run it via hive cli and grab
>         some logs through there ?
>
>
>         On Thu, Mar 27, 2014 at 3:29 PM, fab wol
>         <darkwolli32@gmail.com <mailto:darkwolli32@gmail.com>> wrote:
>
>             Hey Nitin,
>
>             HiveServer2 Log unfurtantely says nothing:
>
>             Mon Mar 24 17:41:18 CET 2014 hiveserver2 stopped, pid 2540
>             Mon Mar 24 17:43:22 CET 2014 hiveserver2 started, pid 2554
>             Hive history
>             file=/tmp/mapr/hive_job_log_97715747-63cd-4789-9b2e-a8b0d544cdf9_2102956370
>             <tel:2102956370>.txt
>             OK
>             Thu Mar 27 10:52:48 CET 2014 hiveserver2 stopped, pid 2554
>             Thu Mar 27 10:55:52 CET 2014 hiveserver2 started, pid 2597
>
>             Cheers
>             Wolli
>
>
>             2014-03-27 10:04 GMT+01:00 Nitin Pawar
>             <nitinpawar432@gmail.com <mailto:nitinpawar432@gmail.com>>:
>
>                 can you grab more logs from hiveserver2 log file?
>
>
>                 On Thu, Mar 27, 2014 at 2:31 PM, fab wol
>                 <darkwolli32@gmail.com <mailto:darkwolli32@gmail.com>>
>                 wrote:
>
>                     Hey everyone,
>
>                     I have a table with currently 5541 partitions.
>                     Daily there are 14 partitions added. I will switch
>                     the update for the metastore from "msck repair
>                     table" to "alter table add partition", since its
>                     performing better, but sometimes this might fail,
>                     and i need the "msck repair table" command. But
>                     unfortunately its not working anymore with this
>                     table size it seems:
>
>                     0: jdbc:hive2://clusterXYZ-> use <DB_NAME>;
>                     No rows affected (1.082 seconds)
>                     0: jdbc:hive2://clusterXYZ-> set
>                     hive.metastore.client.socket.timeout=6000;
>                     No rows affected (0.029 seconds)
>                     0: jdbc:hive2://clusterXYZ-> MSCK REPAIR TABLE
>                     <TABLENAME>;
>                     Error: Error while processing statement: FAILED:
>                     Execution Error, return code 1 from
>                     org.apache.hadoop.hive.ql.exec.DDLTask
>                     (state=08S01,code=1)
>                     Error: Error while processing statement: FAILED:
>                     Execution Error, return code 1 from
>                     org.apache.hadoop.hive.ql.exec.DDLTask
>                     (state=08S01,code=1)
>
>                     anyone had luck with getting this to work? As you
>                     can see, I already raised the time until the
>                     Thrift Timeout kicks in, but this error is
>                     happening even before the time runs off ...
>
>                     Cheers
>                     Wolli
>
>
>
>
>                 -- 
>                 Nitin Pawar
>
>
>
>
>
>         -- 
>         Nitin Pawar
>
>
>


--------------010309070407040702000203
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit

<html>
  <head>
    <meta content="text/html; charset=ISO-8859-1"
      http-equiv="Content-Type">
  </head>
  <body bgcolor="#FFFFFF" text="#000000">
    Hey Stephen, thanks for the advice, but as i wrote in my first post,
    i wanted to do that anyways. But thanks for the explanation why this
    is indeed the best way to go for a production system ...<br>
    <br>
    Cheers<br>
    Wolli<br>
    <br>
    <div class="moz-cite-prefix">Am 27.03.14 16:05, schrieb Stephen
      Sprague:<br>
    </div>
    <blockquote
cite="mid:CAC06LGaUXAfxvx4c-jknH5k-mB=-7bidMWzWi0BsktUsSA1fPQ@mail.gmail.com"
      type="cite">
      <div dir="ltr">
        <div class="gmail_default" style="font-family:courier
          new,monospace">fwiw. i would not have the repair table
          statement as part of a production job stream.&nbsp; That's kinda a
          poor man's way to employ dynamic partitioning off the back
          end.&nbsp; <br>
          <br>
          Why not either use hive's dynamic partitioning features or
          pre-declare your partitions? that way you are explicitly
          coding for your purpose rather than running a general repair
          table on the backend knowing you "broke it" up front?<br>
          <br>
        </div>
        <div class="gmail_default" style="font-family:courier
          new,monospace">just a suggestion!<br>
        </div>
      </div>
      <div class="gmail_extra"><br>
        <br>
        <div class="gmail_quote">On Thu, Mar 27, 2014 at 3:18 AM, fab
          wol <span dir="ltr">&lt;<a moz-do-not-send="true"
              href="mailto:darkwolli32@gmail.com" target="_blank">darkwolli32@gmail.com</a>&gt;</span>
          wrote:<br>
          <blockquote class="gmail_quote" style="margin:0 0 0
            .8ex;border-left:1px #ccc solid;padding-left:1ex">
            <div dir="ltr">Hey Nitin and everyone else,
              <div><br>
              </div>
              <div>so let me tell you from memory that the Hive CLI
                Error was kind of the same and nothing saying like the
                beeline error. Would have been no uplift here.</div>
              <div>
                <br>
              </div>
              <div>I was restarting the cluster (it is a cloud cluster
                provided by <a moz-do-not-send="true"
                  href="http://www.unbelievable-machine.net"
                  target="_blank">http://www.unbelievable-machine.net</a>),
                for getting the HiveServer2 Log and to be very sure,
                that everything is well set up. During this all
                tasktrackers are deleted and newly setup (HDFS and
                storage is not touched at all, neither are configs).
                After that the msck repair table stmt is going well and
                its actually not so slow at all, as i thought it might
                be (ca. 110 secs per table). I guess there might have
                been some logs/tmp/cache data stacked up, and that might
                have caused the errors ...</div>
              <div><br>
              </div>
              <div>Slightly confusing, but i will post if I find out
                what exactly was throwing the error here in the future
                ...</div>
              <div><br>
              </div>
              <div>Cheers for the help</div>
              <div>Wolli</div>
            </div>
            <div class="gmail_extra">
              <br>
              <br>
              <div class="gmail_quote">2014-03-27 11:03 GMT+01:00 Nitin
                Pawar <span dir="ltr">&lt;<a moz-do-not-send="true"
                    href="mailto:nitinpawar432@gmail.com"
                    target="_blank">nitinpawar432@gmail.com</a>&gt;</span>:
                <div>
                  <div class="h5"><br>
                    <blockquote class="gmail_quote" style="margin:0 0 0
                      .8ex;border-left:1px #ccc solid;padding-left:1ex">
                      <div dir="ltr">Without error stack, very hard to
                        get whats wrong&nbsp;
                        <div><br>
                        </div>
                        <div>will it be possible for you to run it via
                          hive cli and grab some logs through there ?&nbsp;</div>
                      </div>
                      <div class="gmail_extra"><br>
                        <br>
                        <div class="gmail_quote">
                          On Thu, Mar 27, 2014 at 3:29 PM, fab wol <span
                            dir="ltr">&lt;<a moz-do-not-send="true"
                              href="mailto:darkwolli32@gmail.com"
                              target="_blank">darkwolli32@gmail.com</a>&gt;</span>
                          wrote:<br>
                          <blockquote class="gmail_quote"
                            style="margin:0 0 0 .8ex;border-left:1px
                            #ccc solid;padding-left:1ex">
                            <div dir="ltr">Hey Nitin,
                              <div><br>
                              </div>
                              <div>HiveServer2 Log unfurtantely says
                                nothing:</div>
                              <div><br>
                              </div>
                              <div>
                                <div><font face="courier new, monospace">Mon
                                    Mar 24 17:41:18 CET 2014 hiveserver2
                                    stopped, pid 2540</font></div>
                                <div><font face="courier new, monospace">Mon
                                    Mar 24 17:43:22 CET 2014 hiveserver2
                                    started, pid 2554</font></div>
                                <div><font face="courier new, monospace">Hive
                                    history
                                    file=/tmp/mapr/hive_job_log_97715747-63cd-4789-9b2e-a8b0d544cdf9_<a
                                      moz-do-not-send="true"
                                      href="tel:2102956370"
                                      value="+12102956370"
                                      target="_blank">2102956370</a>.txt</font></div>
                                <div><font face="courier new, monospace">OK</font></div>
                                <div><font face="courier new, monospace">Thu
                                    Mar 27 10:52:48 CET 2014 hiveserver2
                                    stopped, pid 2554</font></div>
                                <div><font face="courier new, monospace">Thu
                                    Mar 27 10:55:52 CET 2014 hiveserver2
                                    started, pid 2597</font></div>
                              </div>
                              <div><br>
                              </div>
                              <div>Cheers</div>
                              <div>Wolli</div>
                            </div>
                            <div class="gmail_extra"><br>
                              <br>
                              <div class="gmail_quote">2014-03-27 10:04
                                GMT+01:00 Nitin Pawar <span dir="ltr">&lt;<a
                                    moz-do-not-send="true"
                                    href="mailto:nitinpawar432@gmail.com"
                                    target="_blank">nitinpawar432@gmail.com</a>&gt;</span>:
                                <div>
                                  <div><br>
                                    <blockquote class="gmail_quote"
                                      style="margin:0 0 0
                                      .8ex;border-left:1px #ccc
                                      solid;padding-left:1ex">
                                      <div dir="ltr">can you grab more
                                        logs from hiveserver2 log file?&nbsp;</div>
                                      <div class="gmail_extra"><br>
                                        <br>
                                        <div class="gmail_quote">
                                          On Thu, Mar 27, 2014 at 2:31
                                          PM, fab wol <span dir="ltr">&lt;<a
                                              moz-do-not-send="true"
                                              href="mailto:darkwolli32@gmail.com"
                                              target="_blank">darkwolli32@gmail.com</a>&gt;</span>
                                          wrote:<br>
                                          <blockquote
                                            class="gmail_quote"
                                            style="margin:0 0 0
                                            .8ex;border-left:1px #ccc
                                            solid;padding-left:1ex">
                                            <div dir="ltr">Hey everyone,
                                              <div><br>
                                              </div>
                                              <div>I have a table with
                                                currently 5541
                                                partitions. Daily there
                                                are 14 partitions added.
                                                I will switch the update
                                                for the metastore from
                                                "msck repair table" to
                                                "alter table add
                                                partition", since its
                                                performing better, but
                                                sometimes this might
                                                fail, and i need the
                                                "msck repair table"
                                                command. But
                                                unfortunately its not
                                                working anymore with
                                                this table size it
                                                seems:</div>
                                              <div><br>
                                              </div>
                                              <div>
                                                <div><font face="courier
                                                    new, monospace">0:
                                                    jdbc:hive2://clusterXYZ-&gt;
                                                    use &lt;DB_NAME&gt;;</font></div>
                                                <div><font face="courier
                                                    new, monospace">No
                                                    rows affected (1.082
                                                    seconds)</font></div>
                                                <div><font face="courier
                                                    new, monospace">0:
                                                    jdbc:hive2://clusterXYZ-&gt;
                                                    set
                                                    hive.metastore.client.socket.timeout=6000;</font></div>
                                                <div><font face="courier
                                                    new, monospace">No
                                                    rows affected (0.029
                                                    seconds)</font></div>
                                                <div><font face="courier
                                                    new, monospace">0:
                                                    jdbc:hive2://clusterXYZ-&gt;
                                                    MSCK REPAIR TABLE
                                                    &lt;TABLENAME&gt;;</font></div>
                                              </div>
                                              <div>
                                                <div>
                                                  <font face="courier
                                                    new, monospace">Error:
                                                    Error while
                                                    processing
                                                    statement: FAILED:
                                                    Execution Error,
                                                    return code 1 from
                                                    org.apache.hadoop.hive.ql.exec.DDLTask
                                                    (state=08S01,code=1)</font></div>
                                                <div><font face="courier
                                                    new, monospace">Error:
                                                    Error while
                                                    processing
                                                    statement: FAILED:
                                                    Execution Error,
                                                    return code 1 from
                                                    org.apache.hadoop.hive.ql.exec.DDLTask
                                                    (state=08S01,code=1)</font></div>
                                              </div>
                                              <div><font face="courier
                                                  new, monospace"><br>
                                                </font></div>
                                              <div><font face="arial,
                                                  helvetica, sans-serif">anyone
                                                  had luck with getting
                                                  this to work? As you
                                                  can see, I already
                                                  raised the time until
                                                  the Thrift Timeout
                                                  kicks in, but this
                                                  error is happening
                                                  even before the time
                                                  runs off ...</font></div>
                                              <div><font face="arial,
                                                  helvetica, sans-serif"><br>
                                                </font></div>
                                              <div><font face="arial,
                                                  helvetica, sans-serif">Cheers<br>
                                                  Wolli</font></div>
                                            </div>
                                            <span><font color="#888888">
                                              </font></span></blockquote>
                                        </div>
                                        <span><font color="#888888"><br>
                                            <br clear="all">
                                            <span><font color="#888888">
                                                <div><br>
                                                </div>
                                                -- <br>
                                                Nitin Pawar<br>
                                              </font></span></font></span></div>
                                      <span><font color="#888888">
                                        </font></span></blockquote>
                                  </div>
                                </div>
                              </div>
                              <span><font color="#888888"><br>
                                </font></span></div>
                            <span><font color="#888888">
                              </font></span></blockquote>
                        </div>
                        <span><font color="#888888"><br>
                            <br clear="all">
                            <div><br>
                            </div>
                            -- <br>
                            Nitin Pawar<br>
                          </font></span></div>
                    </blockquote>
                  </div>
                </div>
              </div>
              <br>
            </div>
          </blockquote>
        </div>
        <br>
      </div>
    </blockquote>
    <br>
  </body>
</html>

--------------010309070407040702000203--