incubator-hama-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Praveen Sripati <praveensrip...@gmail.com>
Subject Re: InputFormats for Hama
Date Sun, 25 Mar 2012 12:56:32 GMT
Thomas et al,

> Would someone please open JIRAs for that?

I can open a JIRA. I need input on what all InputFormat makes sense and the
their priority. Some we can port from Hadoop.

> Based on XML we can implement a format that parses DMOZ or commoncrawl on
Amzon S3.

Could you please elaborate on this?

Praveen


On Sun, Mar 25, 2012 at 5:14 PM, Chia-Hung Lin <clin4j@googlemail.com>wrote:

> As I understand, many iterative applications don't require key value
> input/ output and additionally need random access (read/ write) to
> particular file. I/O interface e.g. mpi may increase flexibility here.
>
> https://issues.apache.org/jira/browse/MAPREDUCE-2911
>
> On 25 March 2012 10:01, Praveen Sripati <praveensripati@gmail.com> wrote:
> > Hi,
> >
> > For Hama there are limited input formats
> >
> > CombineFileInputFormat, FileInputFormat, NullInputFormat,
> > SequenceFileInputFormat, TextInputFormat
> >
> > Does it make sense to have to have more input formats? I was thinking
> > InputFormats for Graph Databases.
> >
> > Any feedback for the different input formats is welcome.
> >
> > I quickly glanced Giraph and Hadoop and they have more InputFormats which
> > makes it easy to plug them with external systems.
> >
> > Praveen
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message