mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Grant Ingersoll (Commented) (JIRA)" <>
Subject [jira] [Commented] (MAHOUT-947) Improvements to seqdumper
Date Fri, 10 Feb 2012 12:27:59 GMT


Grant Ingersoll commented on MAHOUT-947:

bq. I wasn't suggesting supporting multiple args, just quoted globs - since HDFS FileSystem
supports strings with glob patterns in them...

Makes sense.  I can update.  In any case, I changed Tom's patch to use our standard --input
flag for both and then just check to see whether it is a directory or not.  We could just
as well check to see if it is a glob.
> Improvements to seqdumper
> -------------------------
>                 Key: MAHOUT-947
>                 URL:
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: tom pierce
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.7
>         Attachments: MAHOUT-947-2.patch, MAHOUT-947.patch, MAHOUT-947.patch, MAHOUT-947.patch
> I've put together a few handy additions to seqdumper:
> * Ability to dump all sequence files in a directory.
> * A quiet flag to attenuate the non-data output.
> * A flag to toggle name-only printing for NamedVector values.
> * An option to only print the N highest-valued elements in WeightedVector values
> Seems like others will probably find some of these to be helpful.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message