hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <>
Subject [jira] [Commented] (HIVE-13275) Add a toString method to BytesRefArrayWritable
Date Thu, 02 Jun 2016 02:53:59 GMT


Hive QA commented on HIVE-13275:

Here are the results of testing the latest attachment:

{color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 9 failed/errored test(s), 10198 tests executed
*Failed tests:*
TestHWISessionManager - did not produce a TEST-*.xml file
TestJdbcWithMiniHA - did not produce a TEST-*.xml file
TestJdbcWithMiniMr - did not produce a TEST-*.xml file
TestOperationLoggingAPIWithTez - did not produce a TEST-*.xml file

Test results:
Console output:
Test logs:

Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 9 tests failed

This message is automatically generated.

ATTACHMENT ID: 12793199 - PreCommit-HIVE-MASTER-Build

> Add a toString method to BytesRefArrayWritable
> ----------------------------------------------
>                 Key: HIVE-13275
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>          Components: File Formats, Serializers/Deserializers
>    Affects Versions: 1.1.0
>            Reporter: Harsh J
>            Assignee: Harsh J
>            Priority: Trivial
>         Attachments: HIVE-13275.000.patch
> RCFileInputFormat cannot be used externally for Hadoop Streaming today cause Streaming
generally relies on the K/V pairs to be able to emit text representations (via toString()).
> Since BytesRefArrayWritable has no toString() methods, the usage of the RCFileInputFormat
causes object representation prints which are not useful.
> Also, unlike SequenceFiles, RCFiles store multiple "values" per row (i.e. an array),
so its important to output them in a valid/parseable manner, as opposed to choosing a simple
joining delimiter over the string representations of the inner elements.
> I propose adding a standardised CSV formatting of the array data, such that users of
Streaming can then parse the results in their own script. Since we have OpenCSV as a dependency
already, we can make use of it for this purpose.

This message was sent by Atlassian JIRA

View raw message