gora-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alfonso Nishikawa (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (GORA-109) Pig Adapter for Gora
Date Thu, 03 May 2018 18:40:00 GMT

    [ https://issues.apache.org/jira/browse/GORA-109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16462909#comment-16462909

Alfonso Nishikawa commented on GORA-109:

Hi, Pals.

It has been a lot of time since this issue. I finally got a _quite_-successful implementation
of a GoraStorage for Apache Pig. At this moment it is in my own branch since I impemented
it for my degree proyect. [https://github.com/alfonsonishikawa/gora/tree/goraexplorer/gora-pig]

It allows to load data from -any- HBase (I can only claim this) through Gora in Pig scripts
set job.name 'GoraPig LOAD test';
register gora/*.jar;
webpage = LOAD '.' USING org.apache.gora.pig.GoraStorage('{
        "persistentClass": "admin.WebPage",
        "fields": "baseUrl,contentType"
}') ;
DUMP webpage;
I will update my page at [http://people.apache.org/~alfonsonishikawa/gora-109.html] with details
in the incoming days (the speed depends on my spare time), but please, ask me in advance the
first questions you have so I can write a FAQ too. If I have to write by my own ideas, I tend
to write EVERYTHING and I end NEVER :)

I anyone is a Pig user and wants to test it ask too. Questions priotize what I explain :)

I am thinking about some Nutch jobs in Pig :P


Sorry for the inmense delay :(. Just know that this _"complete"_ implementation (actually
nothing ever is complete) took me more than 300 hours :(. It has much details everywhere.

> Pig Adapter for Gora
> --------------------
>                 Key: GORA-109
>                 URL: https://issues.apache.org/jira/browse/GORA-109
>             Project: Apache Gora
>          Issue Type: New Feature
>          Components: gora-pig
>    Affects Versions: 0.2
>            Reporter: Lewis John McGibbney
>            Assignee: Alfonso Nishikawa
>            Priority: Minor
>             Fix For: 0.9
> This has been on the agenda from the start and I suppose will be purely use case driven.
Persoanlly I haven't touched Pig as I don't have a use-case driven requirement to do so, however
there is no harm in this being logged if in the future it attracts interest.  

This message was sent by Atlassian JIRA

View raw message