crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matthias Friedrich (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CRUNCH-69) it would be useful to include sample data for AverageBytesByIP and TotalBytesByIP examples
Date Sat, 22 Sep 2012 15:05:08 GMT

    [ https://issues.apache.org/jira/browse/CRUNCH-69?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13461181#comment-13461181
] 

Matthias Friedrich commented on CRUNCH-69:
------------------------------------------

I think we have to anonymize these logs before committing them, or preferably make up artificial
data. I don't know about US laws, but in Germany IP addresses are considered private data,
it would be illegal to store them for longer than a few days, much less publish them.
                
> it would be useful to include sample data for AverageBytesByIP and TotalBytesByIP examples
> ------------------------------------------------------------------------------------------
>
>                 Key: CRUNCH-69
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-69
>             Project: Crunch
>          Issue Type: Improvement
>          Components: Core
>    Affects Versions: 0.3.0
>            Reporter: Roman Shaposhnik
>            Assignee: Josh Wills
>            Priority: Minor
>         Attachments: access_log.zip
>
>
> Currently one has to wonder what kind of input to give those examples. It would be very
nice if there existed a canonical set of input files as part of example's resources. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message