tajo-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jihoon Son (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TAJO-736) Add table management documentation
Date Fri, 04 Apr 2014 05:25:21 GMT

    [ https://issues.apache.org/jira/browse/TAJO-736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13959651#comment-13959651

Jihoon Son commented on TAJO-736:

[~hyunsik] and [~jhkim],
first of all, appreciate for your efforts.
These documents will be very useful and helpful to Tajo users.

Documents look nice, but I have some simple suggestions.
* In CSV, there are some characters which are forbidden for delimiters. For example, the line
feed (\n) cannot be used as the delimiter, because it is used to distinguish each line. It
would be great to add some descriptions about this.
* In RCFile, you may miss to put a period at the end of the first paragraph. 
* In Parquet, it would be great to add an example of DDL that creates a table with compression.
* In Column Partitioning, the "Todo" section should be removed. Also, I think that there is
a compatibility issue with Hive. For example, can Tajo directly read partitioned tables of
Hive? Whether it can or cannot, it would be better to add a simple description of the compatibility.

In addition, I think that [~davidzchen]'s review will be very helpful for the Parquet document.
[~davidzchen], would you mind reviewing the Parquet document, please?

Best regards,
Jihoon Son

> Add table management documentation
> ----------------------------------
>                 Key: TAJO-736
>                 URL: https://issues.apache.org/jira/browse/TAJO-736
>             Project: Tajo
>          Issue Type: Sub-task
>          Components: documentation
>            Reporter: Hyunsik Choi
>            Assignee: Hyunsik Choi
>             Fix For: 0.8-incubating, 1.0-incubating
>         Attachments: TAJO-736.patch
> Jinho and I wrote some user documentations for file formats. This patch contains documentations
for CSV file, RCFile, and Parquet file.

This message was sent by Atlassian JIRA

View raw message