avro-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mike Hurley (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (AVRO-1858) Update DataFileReadTool (tojson) to support a "head" concept
Date Mon, 06 Jun 2016 21:54:21 GMT

    [ https://issues.apache.org/jira/browse/AVRO-1858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15317337#comment-15317337
] 

Mike Hurley commented on AVRO-1858:
-----------------------------------

I thought about int vs long. Since there's somebody else pointing that out, it's probably
a good idea to use a long.

When I added the head operation for my team, we talked about if we also wanted tail. Our consensus
was tail would be too expensive to implement (performance, not code). Or, we just don't understand
the Avro lib well enough. We just wanted a feature to allow taking a quick peek into an Avro
file.

I think the best option is to keep AVRO-1858 as it is. If you or others think it's worthwhile,
add tail and "substring" (or whatever better name for pos+length dumps) JIRA items.

> Update DataFileReadTool (tojson) to support a "head" concept
> ------------------------------------------------------------
>
>                 Key: AVRO-1858
>                 URL: https://issues.apache.org/jira/browse/AVRO-1858
>             Project: Avro
>          Issue Type: Improvement
>          Components: java
>    Affects Versions: 1.8.1
>            Reporter: Mike Hurley
>
> It would be nice if the tojson operator supported a "head" concept in order to get a
sampling of records in an Avro file.
> Allow specifying a maximum record count to display. If no max is given in head mode,
use a reasonable default (like 10).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message