lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <gsing...@apache.org>
Subject Re: Mahout and lucence and data formats
Date Fri, 19 Nov 2010 14:20:13 GMT

On Nov 19, 2010, at 8:36 AM, Shivani Rao wrote:

> I see that there are two functions
> vectordump and seqdump
> Does any of this help with dumping out sparse vectors/matrices from Mahout
> to a text format?

Yes, they do.  You can dump JSON using VectorDump.

Also, I just opened https://issues.apache.org/jira/browse/MAHOUT-548

> 
> Regards,
> Shivani
> 
> On Fri, Nov 19, 2010 at 7:10 AM, Grant Ingersoll <gsingers@apache.org>wrote:
> 
>> 
>> On Nov 18, 2010, at 4:59 PM, Ted Dunning wrote:
>> 
>>> On Thu, Nov 18, 2010 at 1:15 PM, Shivani Rao <raoshivani@gmail.com>
>> wrote:
>>> 
>>>> Ted ,
>>>> Thanks for your email. I understand that R reads values in text format.
>>>> I have a problem with dumping out data from Mahout.
>>>> The data spit out by apache mahout and its format and conversion to a
>> text
>>>> format is my main concern.
>>>> Not the R's side of the story.
>>>> 
>>>> 
>>> Right.  You will have to write a bit of code to output CSV from Mahout.
>>> There isn't any native support
>>> for it.
>> 
>> We should remedy this.  I might have some cycles today to do this.
>> 
>> 
>> 
> 
> 
> -- 
> Research Scholar,
> School of Electrical and Computer Engineering
> Purdue University
> West Lafayette IN
> web.ics.purdue.edu/~sgrao

--------------------------
Grant Ingersoll
http://www.lucidimagination.com


Mime
View raw message